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FOREWORD 


The  Eighth  Army  Conference  on  Applied  Mathematics  and  Computing  was  held  on 
19-22  June  1990,  at  Cornell  University,  Ithaca,  New  York.  Four  years  earlier,  the  fourth  in 
this  series  of  conferences  was  also  held  at  Cornell  University.  At  that  time  the 
Mathematical  Sciences  Institute  was  established.  At  each  of  these  meetings,  there  were 
several  invited  speakers  that  addressed  vital  research  areas.  At  the  1990  conference,  invited 
talks  covered  topics  such  as  new  ideas  about  turbulence,  nonlinear  dynamical  systems, 
symbolic  methods,  domain  decomposition  methods  for  partial  differential  equations,  and 
variational  methods  for  free  boundary  problems  with  discontinuities  and  interfaces.  Special 
Sessions  were  organized  on  wavelet  transforms  for  image  analysis,  geometric  modeling, 
symbolic  methods,  and  adaptive  methods  for  high  performance  architectures.  In  the  eleven 
Technical  Sessions  and  one  Poster  Session,  more  than  sixty  papers  were  contributed.  An 
informal  discussion  session  was  held  to  discuss  research  issues  in  geometric  modeling  for 
vulnerability  analysis. 

The  subcommittee  of  the  Army  Mathematics  Steering  Committee  that  oversees  these 
conferences  was  very  pleased  with  the  high  scientific  quality  of  the  contributed  papers. 
Many  of  these  papers  provided  the  attendees  a  chance  to  see  scientific  developments  taking 
place  in  the  Army  laboratories.  Through  these  meetings  techniques  developed  at  one 
installation  are  brought  to  the  attention  of  scientists  at  other  places,  thereby  reducing 
duplication  of  effort.  Another  important  phase  of  these  meetings  is  presenting  the  members 
of  the  audience  an  opportunity  to  hear  nationally  known  scientists  discuss  recent 
developments  in  their  own  field.  This  year  the  invited  speakers,  together  with  the  titles  of 
their  talks,  are  listed  below. 

SPEAKER  AND  AFFILIATION 

Professor  T.  Brooke  Benjamin 
University  of  Oxford 
Oxford,  England 

Professor  Ivar  Ekeland 
University  of  Paris  IX  Ceremade 
Paris,  France 

Professor  James  H.  Bramble 
Cornell  University 
Ithaca,  New  York 


Professor  Saiyoy  Mitter 
Massachusetts  Institute  of  Technology 
Cambridge,  Massachusetts 


TUTEE  OF  ADDRESS 

New  Ideas  about  the  Origins 
of  Turbulence 


Variational  Methods  and 
Dynamical  Systems 


On  the  Analysis  of  Domain 
Decomposition  Methods  for 
Elliptic  Partial  Differential 
Equations 

Variational  Problems  with 
Free  Discontinuities  and 
Nonlinear  Diffusions 


i  i  i 


Professor  Bruno  Buchberger 
Johannes  Kepler  University 
Linz,  Austria 


Symbolic  Computation: 
Theory  and  Practice 


Professor  George  R.  Sell  Approximation  Dynamics  for 

University  of  Minnesota  the  Navier-Stokes  Equations 

Minneapolis,  Minnesota 

The  benefits  derived  from  these  conferences  depend  a  great  deal  on  the  host’s 
Chairman  on  Local  Arrangements.  The  attendees  at  this  meeting  were  fortunate  to  have 
Professor  Anil  Nerode,  Director  of  Mathematical  Sciences  Institute,  serving  in  this  capacity. 
He,  together  with  members  of  his  capable  staff,  provided  all  those  amenities  such  as  coffee, 
projection  equipment,  travel  information,  etc.,  needed  for  an  enjoyable  and  profitable 
symposium. 


TABU  QT  CQMTMTTg 


TITLE  PAGE 

Foreword . iii 

Table  of  Contents .  v 
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Philip  J.  Holmes  and  Pieter  J.  Swart .  11 

BREAKUP  OF  A  VISCOUS  LIQUID  JET 

S.  P.  Lin  and  E.  A.  Ibrahim .  ...  23 
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COMPUTATIONS 


Dan  C.  Marinescu,  John  R.  Rice,  and  Emmanuel  A.  Vavalis. . .  37 
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MODELING 
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Abstract 

We  study  the  global  existence,  regularity  and  boundedness  of  solutions  of  the  two-dimensional  periodic 
Kuramoto-Sivashinsky  equation  with  a  thin  periodicity  rectangle  =  [0,2jr]x[0,2jte].  The  main  result  is 
that  for  a  large  set  of  initial  conditions,  the  solution  exists  and  is  uniformly  bounded.  This  implies  the 
existence  of  a  local  compact  attractor  with  a  basin  of  attraction  which  expands  to  the  whole  space  as  e  0. 
We  state  various  theorems  and  give  only  brief  indications  of  the  proofs.  A  full  treatment  will  be  published 
elsewhere. 


1.  Introduction 


Our  goal  in  this  paper  is  to  present  some  results  on  the  global  asymptotic 
behavior  of  the  Kuramoto-Sivashinsky  (K-S)  equation 
(U) 


ut  +  v  A2u  +  Au  +ylVul2  =  0, 


in  spatial  dimension  two,  where  u  =  u  (y,t)  =  u  (y],y2»t)  satisfies  the  periodic  boundary 
condition 

(1.2)  u(y!  +  27c,  y2,t)  =  u(yby2  +  2ixe,t)  =  u(y1?y2,t)  for  all  y  in  R2  and  t  >  0 
and  the  periodic  initial  condition 

(1.3)  u(y,0)  =  u0(y). 

Here  0  <  e  <,  1  is  a  small  parameter,  so  that  the  basic  periodicity  cell  [0,2ji]  x  [0,27te]  = 
Oe  is  a  thin  domain. The  dissipativity  of  the  general  two  (and  higher)  dimensional 
problem  has  been  open  for  some  time,  the  essential  difficulty  being  the  lack  of  a  proof 
of  the  existence  of  an  absorbing  set.  In  fact,  if  such  a  set  exists,  then  there  exists  a  global 
attractor,  and  one  can  prove  very  precise  regularity  results  (cf.[NST21,  [Te]). 

The  approach  we  shall  adopt  is  based  on  the  intuitive  idea  that  (1.1 )-( 1 .3)  should 
be  close  to  one-dimensional,  and  is  close  to  the  methods  introduced  by  [HR1,2]  and 
[RS] .  The  odd-periodic  case  was  studied  by  [NSTl,2],[FNST].The  same  authors  treat 
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[NST1]  a  Neumann  type  of  problem  without  symmetry,  and  the  rigid  Dirichlet  problem 
is  studied  by  [Ta],  again  without  any  symmetry  assumptions.The  general  periodic  one¬ 
dimensional  problem  has  only  recently  been  proved  to  be  dissipative  [I]. 

We  start  by  changing  (1.1)  into  a  system  by  means  of  the  hodograph 
transformation 

Dyu  =  U.  ,  i  =  1,2  andU  =  (UrU2). 

Equality  of  the  mixed  partials  requires  the  condition  curl  U  =  0.  Notice  also  that  the 
average  of  U  over  the  periodicity  cell  is  zero.  After  this  transformation  we  obtain 
(1.4) 

ut  +  v  a2u  +  A  U  +  (  U.V  )U  =  0  . 

Functional  Setting 

A  point  in  Qe  =  [0,2k]  x  [0,  2k£]  will  be  denoted  by  y  =  (yj,y2).  Let  us  define 
x  =  (xltx2)  by  xt  =  y1  and  x2  =  £‘  1y2.This  maps  Qe  onto  the  square  Q2  =  [0,2ki2. 
Define  also  the  rescaled  operators 

V  =(D  e  'd  )  ,A  =  D2  +£2D2  . 

e  X1  x2  £  X1  *2 

Note  that  these  become  singular  when  £  is  small.  Finally,  we  define  a  new  function  u  = 

u(x)  by  u(x)  =  U(y),  where  x  and  y  are  related  as  above.  In  the  sequel,  we  write  (...) 
and  I .  I  for  the  standard  inner  product  and  norm  in  L2(Q2). 

Given  ue  L2(Q2)  we  define  the  projection  operator  M  as  follows: 

271 

V  =  Mu  ,  where  v  =  v(x  )  =  —  f  u  (x)  dx.  and  w  =  (I  -  M)u. 

2n  Q 

This  averaging  operation  maps  L2(Q2)  onto  the  closed  subspace  formed  by  functions 
of  xt  alone  and  it  is  an  orthogonal  projection. The  complementary  projection  I  -  M 
defines  w  =  (I  -  M)u.  Notice  that  Mw  =  0,  so  that  w  has  zero  average  with  respect  to  x2. 
Also,  M  and  I  -  M  commute  with  the  Laplacian  on  its  domain  ,  and  they  preserve 
periodicity.  After  rescaling,  Eq.  (1.4)  becomes 

u,  +  v  A2  u  +  A  u  +  (u  .  V  )u  =  0 

1  £  £  t 

u  =  u(x,t)  2rt  periodic  in  xpx2, 
u(x,0)  =  uQ(x) ,  also  2k  periodic. 


2 


Without  loss  of  generality,  we  shall  set  v  =  1. 

Let  us  now  apply  the  projections  M  and  I  -  M  to  (1.5).  This  gives 


(1-6) 

v  +  A2  v  +  A  v  +  M[(u.V  )u]  =0 
1  e  e  e 

w  +  A2  w  +  A  w  +  (I  -  M)  [  (u  .V  )  u]  =  0  . 

1  £  £  £ 

We  wish  to  associate  with  (1.6)  a  simpler  problem  which,  for  small  e,  will  turn  out  to 
govern  to  a  large  extent  the  dynamics  of  the  system.  We  thus  define  the  reduced  problem 
to  be  the  one  obtained  from  (1.6)  by  setting  (v,w)  =  (V,0)  in  (1.6)  and  taking  as  initial 
condition  the  projection  of  Uq  onto  the  v-space.This  gives 
(1.7) 

v  +  A2  v  +  A  v  +  (v.V  )  v  =  0 
‘  £  £  £ 


v  (x,0)  =  v0  =  Mu0  . 

Let  us  write  v{,  i  =  1,2  for  the  components  of  V.  Then  these  satisfy 
v.  +  A2v  +  A  v,  +  v  Dv  v  =  0 

i'l  £  1  £  1  1  X|  1 


/.  ,  +  A2v  +  A  v.  + 

2,t  £  2  £  2 


Dx,V2  =  0 


Notice  that  the  first  component  satisfies  the  one-dimensional  K-S  equation  (the 
dependence  of  the  equations  on  e  is  illusory),  while  the  second  one  satisfies  a  linear 
equation.  By  the  results  of  [I],  [NST2],  if  the  initial  condition  Uoe  H*S  then  there  exists 
an  absorbing  set  in  Hs  for  0  <  s  <  k.  Moreover,  the  same  property  holds,  even  for 
dimensions  2  and  3  ,  as  soon  as  a  solution  is  uniformly  bounded  in  time  in  L2. 

The  above  dissipativity  result  for  the  ID  problem,  together  with  general  theorems 
on  attractors  of  asymptotically  smooth  nonlinear  semigroups  [Hal  imply  the  existence  of 
a  global  attractor  of  finite  Hausdorff  and  fractal  dimensions  [I],  [NST2], 


The  evolutionary  equation 

In  what  follows,  we  will  always  assume  that  the  initial  condition  has  zero 
average  over  the  periodicity  cell;  it  is  then  easy  to  prove  that  the  solutions  have  the  same 
property  for  all  times  for  which  they  exist.  Let  us  define  the  unbounded  linear  operator 
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Ae  by  A£u  =  Ae2u  on  the  closed  subspace  X  of  L2(Q2)  consisting  of  functions  with 
zero  average  and  whose  (e-rescaled)  curl  is  zero.  The  operator  defined  above  is  self- 
adjoint,  has  a  dense  domain  and,  when  restricted  to  the  subspace  of  periodic  functions 
with  zero  average,  is  positive  definite.  It  is  also  well-known  that  its  resolvent  is  compact. 
These  properties  imply  that  the  fractional  powers  of  Ae  are  well-defined.  In  particular, 

one  has 

1  1 

A2  =  -  A  ,  and  !A4ul2  =  IV  ul2. 
e  e  £  £ 

We  will  also  introduce  rescaled  versions  of  the  classical  bilinear  and  trilinear  forms 
associated  with  the  nonlinear  term  in  our  equation  ([Te  ] ,  [Li] ).  We  thus  consider 
B  (u,v)  =  (u.  V  )  v  ,  and  b  (u,v,w)  =(  B  (u,v),  w) 

e  £  £  £ 

The  trilinear  form  satisfies  the  following  inequality,  where  C  is  independent  of  e: 

(1.8) 

III1  ill 

I  b  (u,v,  w)  I  <  C  lul2  IA4ul2  IA4vI  IwI2IA4wI2 
£  £  £  £ 

In  particular,  when  u  =  v  this  gives 

(1.9) 

I  I  1  I  1  i 

I  b  (u,u,  w)  I  <  C  lul2  IA4u!2!w121A4w!2  . 

£  £  E 

With  the  above  definitions,  equation  (1.5)  can  be  written  in  evolutionary  form  as 

follows 

\_ 

u'  +  A  u  -  A2u  +  B  (u,u)  =  0 
£  £  £ 

and  the  projections  v  and  w  satisfy  the  system 

(1.10) 

v'  +  A  v  -  A2v  +  M  B  (u,u)  =  0. 

£  £  £ 

w'+  A  w  -  A2w  +  (I  -  M)B  (u,u)  =  0. 

EE  E 


4 


2.  Regularity  results  for  a  thin  domain 

We  now  turn  to  the  problem  of  global  existence  and  regularity  of  solutions  of  the 
K-S  equation  on  a  thin  domain  (local  existence  and  uniqueness  are  a  consequence  of 
classical  theorems  for  sectorial  evoiutionaiy  equations  [P]).  Our  strategy  is  to  prove 
existence  and  regularity  for  solutions  with  initial  conditions  in  a  large  set,  over  intervals 
whose  length  is  independent  of  the  initial  condition  chosen  in  this  set.  This  enables  us  to 
patch  up  local  solutions  and  thus  construct  a  global  solution  .  In  order  to  do  this  we  shall 
need  some  results  on  the  ID  equation,  as  well  as  on  the  reduced  2D  equations. 


The  dynamics  of  the  reduced  2D  problem 

We  recall  the  reduced  2D  Kuramoto-Sivashinsky  equations 

(2.1) 

vu  +A2v1+A£v1+  VjD^Vj-0 


,  +  A2v.  +  Av.+ 
e  *  z  *• 


D  v,  =  0. 
x,  2 


We  observe  that  (2.1)  is  independent  of  e  and  Vj  ,  v2  are  independent  of  x2. 
Notice  that  Vj  satisfies  the  ID  K-S  equation.  In  order  to  study  the  dynamics  of  the 
system,  we  first  solve  the  equation  for  v^  We  note  that  the  absorbing  property  holds  for 
Vt  in  L2[0,2jiJ.  Also,  since  the  e-curl  of  V  is  zero,  V2  is  also  independent  of  xlt  hence 
this  function  depends  only  on  time.  It  then  follows  from  the  equation  satisfied  by  v2  that 
it  must  be  constant  .  However,  its  average  is  zero,  hence  v2  is  identically  zero. 
Therefore,  the  reduced  2D  K-S  equation  has  a  global  attractor,  namely  Ax{0}. 


Growth  Estimates  for  the  reduced  equation 

Let  us  recall  some  results  regarding  the  one -dimensional  K-S  equation 

(2.2) 

ut  +  A2u  +  Au  +  uux  =  0. 

We  know  ([I],  [NST1.2])  that  (2.2)  has  a  global  attractor  A  in  X,  and  that  there  exist 
constants  p0  and  pj  such  that 

lim  sup  IS(t)uJ2  <  p2 

lim  sup  IA4S(t)unl2  <  p2  , 

t-*«°  u  l 
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where  S(t)  is  the  solution  semigroup  of  (2.2).  Moreover,  there  exist  absorbing  balls  of 
radii  2p0  and  2p1  in  X  and  X1/4  =  D(A^4).  The  next  results  are  of  a  technical  nature, 
and  make  more  precise  the  above  absorbing  properties. 

Lemma  2.1 

Consider  Eq.  (2.2)  in  ID  with  a  periodic  boundary  condition  and  initial  condition. 
Then  there  exist  absolute  positive  constants  y  and  L,  and  a  function  D,  real  analytic  in  luql, 
such  that  the  solution  semigroup  satisfies  the  estimate 

iA1/4S(t)u0l2  <  e- 2*  D  +  L  for  t  >  0. 

This  result  is  an  easy  consequence  of  the  dissipativity  estimates  for  (2.2) 
[NST1.2]. 

Lemma  2.2 

Given  k  such  that  0  <  k  <  1,  there  exist  absolute  positive  constants  bj ,  i  =1,2  , 
such  that  for  all  u0e  X1/4  one  has 

IA1/4S(t)uol2  <  L  +  k  lA^uol2  for  t  >  To  ,  where  L  is  a  constant  and 
To  =b!exp(b2lA1/4uol4). 

Lemma  2.3 

Assume  that  Bj  >  L,  with  L  as  above .  Then  there  exists  a  Kq  >  1  such  that  for 
all  q  such  that  0  <  q  <  1  and  for  all  u0  e  D(A1/4)  satisfying  L  <  IA1/4u0l2  <  Bj2  the 
following  holds  for  t  >  0: 

IA1/4S(t)uol2  <  Kq  IA1/4uol2  q*  ' 2  where  Tj*  - 2  =  exp  (ai  exp  ^Bi4^- 4) 
and  aj,a2  are  constants. 

The  proofs  of  these  results  are  very  similar  to  those  in  [RS]. 

The  growth  condition  "G" 

We  will  often  consider  functions  q  =  q(e)  for  which  q  A  blows  up  at  a  certain 
"sufficiently  slow"  rate  as  e  .  0+.  Instead  of  giving  a  list  of  .he  requirements  on  q,  we 
note  that  a  function  that  satisfies  these  growth  conditions  is,  for  example, 

_± 

q(e)  =  {A  +  B  log  log  log  log(C  £~')  }  4  for  appropriate  constants  A.B.C. 

Our  first  result  is  a  lower  bound  on  the  blow-up  time  of  solutions.  Let  us  define,  for  a 
given  initial  condition  Uq,  the  blow-up  time  in  D(AE1/4): 
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T*  =  T*(u0)  =  sup  {  t :  supQ <s<t  I  a\i(s)  I  <  «>  }  . 
Then,  by  a  fairly  crude  estimate,  one  has 


I  A4  I2 
A£U0 

where  K  is  a  constant,  independent  of  the  initial  conditions.  As  an  easy  consequence  we 
have : 

Lemma  2.4 

Let  Ro  and  N  >  1  be  given  .  Then,  for  any  uq  e  D(A1/4)  such  that  IA1/4uol2  <  Rq 
the  following  holds: 

l 

T  2  N-l  K 

IA  S(t)u0l  <  NRQ  for  0  <  t  <  — -  .  —  where  K  is  a  constant,  independent  of  uQ. 

N  Rq 

For  simplicity,  we  shall  henceforth  assume  that  K  =  1. 

In  the  following  Lemma,  we  give  a  short-time  result  which  will  be  essential  in 
the  proof  of  the  main  global  existence  theorem. 

Lemma  2.5 

Let  B0  >  0  ,  C0  >  0  be  given,  and  consider  a  function  T|  satisfying  the  growth 
hypothesis  (G)  indicated  above.  Then  there  exist  e0 , 0  <  e0  <1,  B,  >  B0  ,  C,  >  C0  and 
Tj=  Tj(e)  >  0  such  that  the  following  hold  for  all  e  such  that  0  <  e  <  Eq  : 

J_  _t_ 

If  I  A4vQl2  <  B2q'2  and  IA4w0l2  <  e ,  then 

j_  2. 

IA^vCTj)!2  <  B2tt2  and  IaVcTj/  <dxt\2  . 

The  quantities  Eg,  Bj,  Cj,  depend  on  B0,  C0,  but  not  on  e,  and  Tj(e)  — »  0  ,  s  e— »0+. 

The  proof  follows  by  taking  the  inner  product  of  equations  (1.6  )  for  v  and  w  with  A1/2v 
and  A1/2w,  respectively.  The  growth  rate  of  q  plays  a  particularly  important  role  in  the 
estimates. 

As  a  direct  consequence  of  Lemma  2.5  we  have  the  following  result: 

Lemma  2.6 

Let  B0  >  0  and  C0  >  0  be  given,  and  let  q  satisfy  hypothesis  (G).  Then  there 
exist  Eg,  0<  £q<1,  Bt  >  B0,  Cj>  C0  and  T[>  0  such  that  the  following  holds  for  all  e 
satisfying  0  <  e<£q: 
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If  1A^v0I2  <  B2  rj-2  and  |A£4wQl2  ^  C^e-1,n~1  then 

i  i 

IA^vCTj)!2  <  B2  Tp2  and  !A4  wCT^I2  <  C^e. 

Lemma  2.7 

Fix  B1  and  Q  and  let  u0  =  (v0,w0)  be  chosen  such  that 

j_  _i_ 

IA4vJ2  <,  B2  rp2  and  IA4wJ2  <  C?  e  . 

£  U  1  £  U  1 

Let  Kq>  1  be  given  by  Lemmas  2.2  and  2.3  with,  say,  k  =  1/8  and  set  N  =  4Ko,  and 
define  To  =  To(e)  =  bi  exp  (biBi4^-  4)  so  that  one  has 

lA4v(t)l2  <L  +  IlA4v0l2  ,t  >T0  . 

Next,  define  xN  by 

tN  =  sup  [  T  >  0  :  )A4u(t)l2  S  ND2  Tp2  for  0  <  t  <  X  } 
where  D42  =  Bj2  +  Ct2. 

Then  there  exists  e0,  0  <  Eq  <  1,  such  that  for  all  e  satisfying  0  <  e  <  Eq  the 
following  hold 

T  <  xM 
0  N 

IAsJv(t/£|b^-2 

x 

|A4w(T  )|2<cje. 

e  u  1 

The  estimate  for  w  is  straightforward,  while  the  estimate  for  v  relies  heavily  on  a 
comparison  between  v  and  the  corresponding  solution  of  the  reduced  problem. 

The  existence  and  ultimate  boundedness  of  solutions  are  now  an  easy 
consequence  of  this  Lemma. 

Theorem  2.8 

Consider  the  Kuramoto-Sivashinsky  equation  with  periodic  boundary  conditions 
given  by  the  thin  periodicity  cell  [0,27t]x[0,27te]  and  a  periodic  initial  condition. Then 
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there  exists  £q,0<£q<1  and  a  constant  B0  and  on  this  range  there  are  real-valued 
functions  R(e)  and  K(e)  such  that  R(e)  >0,K(e)  >1  and  R(e)  as  £—>0+  and  such 
that  for  all  U0eD(A114)  such  that  IU0  l1/4<  R(e)  one  has  U(t)eD(AI/4)for  all  t  >0  and 

iU(t)l  j  <  K(e)  R(e)  , 

7 

limsup  IU(t)l,  <  Bn 

t— ► °°  u 

4 

for  all  t  >0. 

This  result  follows  by  repeatedly  applying  Lemma  2.8.  Notice  that  we  are  stating 
the  result  in  terms  of  the  original  function  U. 

Existence  of  a  local  attractor 

We  recall  that  a  set  A  is  a  local  attractor  for  a  nonlinear  semigroup  [S(t)}  if  it  is 
compact,  invariant,  and  there  exists  a  bounded  neighborhood  B  of  A  such  that  A  attracts 
B.  Let  us  also  recall  the  following  result : 

Lemma  2.10  ([Ha],  Lemma  3.2.1) 

Let  { S(t),  t  >  0}  be  an  asymptotically  smooth  semigroup  in  a  Banach  space  X 
and  let  B  be  a  nonempty  set  in  X  such  that  the  semiorbit  y* (B)  is  bounded.  Then  to(B)  is 
nonempty,  compact,  invariant,  and  it  attracts  B.  If  B  is  connected,  so  is  fo(B). 

By  using  this  result  for  B  =  BE  =  {U  :  IUI1/4  <  R(e)  }  we  see  that  AE  *  co(Be)  is 
a  local  attractor  whose  basin  of  attraction  contains  at  least  the  set  Be. 
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Abstract 

We  summarize  results  on  a  dissipative  infinite  dimensional  evolution  equation  hav¬ 
ing  an  associated  Energy  (Liapunov)  function  which  possesses  no  classical  minimizers. 
The  equation  has  a  countable  set  of  equilibria,  all  unstable,  which  form  a  minimizing 
sequence,  and  our  main  result  implies  that  a  large,  dense  set  of  solutions  explore  this 
sequence.  In  doing  so,  energy  approaches  its  global  minimum  via  an  escape  to  arbi¬ 
trarily  high  wavenumbers  in  Fourier  space.  We  describe  the  asymptotics  of  this  process 
and  illustrate  it  with  numerical  computations.  This  equation  exhibits  a  remarkably 
subtle  dependence  on  initial  data  very  different  from  that  hi  classical  finite  dimensional 
“chaos" . 


Introduction 

This  study  concerns  the  behavior  of  the  equation 

utt  —  (||ttx||  —  l)ttXx  —  qu (3uxxt,  (1) 

which  was  motivated  by  the  simple  model  of  a  one-dimensional  nonlinear  viscoelastic  bar 
bonded  to  a  rigid  substrate,  namely 

utt  -  (ul  -  uz  +  i3uIt)x  -  qu.  (2) 

Further  details  on  the  background  to  these  problems  can  be  found  in  [1]  and  full  details 
are  in  [2],  from  which  most  of  the  following  is  adapted.  Here  the  displacement  u  =  u{x,t) 
is  defined  on  x  €  (O.jt)  with  Dirichlet  boundary  conditions  u(0,t)  =  u(ir,t)  =  0  and 
|| u j| 2  =  Jq  u2dx.  The  term  qu  penalizes  large  displacements  and  tends  to  promote  the 
formation  of  microstructure.  The  i3uxxt  term  represents  viscoelastic  damping.  The  more 
tractable  model  (1)  is  obtained  by  replacing  the  nonlinear  term  u2xuxx  in  (2)  by  the  spatially 

'Supported  by  the  U.S.  Army  Research  Office  under  ARO  DAAG  29-85-C0018  (Mathematical  Sciences 
Institute)  and  NSF  under  DMS  87-03656.  The  computations  were  performed  using  the  Cornell  National 
Supercomputer  Facility,  which  receives  major  funding  from  NSF  and  the  IBM  Corporation. 
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averaged  term  ||ux||2ur;r.  We  remark  that  such  non-local  terms  do  arise,  for  example  in 
ferromagnetism  [3]. 

We  henceforth  restrict  our  attention  to  (1).  This  model  can  also  be  interpreted  as  the 
Euler- Lagrange  equation  corresponding  to  the  nonlocal  “strain”  energy 

E[u,u.t]  =  ^||tit||2  +  ~(||ux||2  ~  l)2  +  7^IM|2  (3) 

to  which  has  been  added  viscoelastic  dissipation,  which  constantly  bleeds  of  energy  at  the 
rate 

*§  =  -0h*t\\2.  (4) 

The  minimum  of  this  energy  cannot  however  be  attained  by  any  classical  solution,  since 
the  conflicting  requirements  u  —  0  and  j|ux||2  =  1  cannot  be  met.  In  other  similar  problems 
with  nonconvex  energies,  the  equilibrium  solutions  are  characterized  by  a  severe  loss  of 
uniqueness.  It  is  by  studying  the  long  term  dynamical  behavior  of  such  systems  that  we 
hope  to  shed  some  light  on  their  equilibrium  solutions. 


Equilibrium  States 


Expanding  u(x,t )  in  the  Fourier  sine  series 

/if 

u(i,  t)  =  >  ak(l)\J  —  sin  kx 

t=i  V  * 

we  obtain  the  infinite  set  of  ODE’s 

a  00 

ak  +  l3k2ak  +  k2(  —  -  1  +  ^;2a2)a*  =  °»  k  =  1-2-  — 


j=  i 


(5) 


(6) 


In  addition  to  the  trivial  solution  u  =  uo  =  0  the  equilibria  of  ( 1)  are  easily  seen  to  occur 
in  “pure-mode”  pairs 


a 


± 

k 


a}  =  0,  j  ^  k,  k2  >  a. 


(7) 


YVe  therefore  have  the  countable  set  of  equilibria 


«o  =  0,  uf  =  ±j-J  l  -  ~  sin  kx,  k  -  A'.  I\  =  1 . ... 


o 


(*) 


where  K  —  K(a)  =  min{fc|fc2  >  a}.  Since  Efuj^O]  =  ^[l  -  ^p]  \  0  as  t  —  oc.  it  follows 
that  is  a  minimizing  sequence  for  this  Liapunov  function. 

A  local  stability  analysis  shows  that  every  equilibrium  uf  is  exponentially  unstable, 
albeit  increasingly  weakly  as  the  wavenumber  k  is  increased.  Moreover,  i  f  k2  >  k i  >  K. 
then  lie  in  the  unstable  manifolds  of  u*  .  To  see  this,  note  that  each  2N-dimensional 
subspace  of  the  form  .Y,v  =  {( u,  u,)|(  u,  ut)  =  2Z^.,(a;,d;)sin./x}  is  invariant  for  (6)  (and 
(1)).  This  fact  is  used  in  the  proof  of  Theorem  1.  below. 
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Asymptotic  behavior 

We  establish  a  dichotomy  which  implies  that  solutions  behave  either  in  a  “finite  dimensional” 
fashion,  essentially  involving  only  a  finite  set  of  Fourier  modes,  or  that  all  Fourier  modes 
are  active  and  that  energy  cascades  out  to  infinity  in  wave  number  space.  Since  typical 
initial  data  contains  arbitrary  high  Fourier  wavenumbers,  almost  all  solutions  will  contain 
“unstable”  Fourier  components  and  hence  realize  the  second  alternative. 

Theorem  1  (J.M.Ball)  Let  (u,ut)  6  X  =  Hq  X  L2  solve  (1).  Then  as  1  —  oo,  either 
(u,ut)  — ►  (uj,0)  strongly  for  some  equilibrium  uf  and  E(t)  —  ^[l  -  ^7],  or 

||a*||  — ►  0,  u  —  0  weakly  in  Hq 
||ti«||  — 1 ■  1  and  E(t)  — *■  0. 

The  first  alternative  is  realized  for  initial  conditions  in  a  set  of  first  category  and  dense 
in  the  phase  space,  whilst  the  second  alternative  is  realized  for  all  initial  conditions  in  its 
complement,  a  set  of  second  category  and  also  dense  in  the  phase  space. 

This  is  a  striking  result.  Arbitrary  initial  data  can  be  arbitrarily  close  to  orbits  realizing 
either  alternative,  implying  a  sensitive  dependence  on  initial  data  that  is  quite  different 
from  that  in  chaotic  dynamical  systems,  being  truly  infinite  dimensional  and  without  any 
recurrence. 


Energy  Transport  to  higher  wavenumbers 

Since  “most”  solutions  do  minimize  the  energy  by  developing  some  form  of  microstructure 
we  now  describe  how  this  happens.  It  is  most  conveniently  stated  in  terms  of  the  Fourier 
components  of  the  “strain”  ur  =  bk\J\  cos  kx,  Ck  =  hjt- 

Theorem  2  Assume  that  the  second  alternative  of  Theorem  l  holds  and  pick  any  u  >  0 
and  K  <  00.  Then  there  exists  a  time  T  =  T{v,  I\ ,  a,  (3)  <  00  such  that,  for  all  t  >  T  and 
k  <  K  the  solutions  of  ( 1 )  satisfy 


\(bk,ck)(t)\  <  o  and  |1  -  ]T  62.|  <  j/2.  (9) 

fc=i 

Moreover,  for  all  k  ^  l  with  k,l  >  K  and  t  >  T,  the  modal  ratio  pki  =  bk/b[  satisfies 

pkl  =  e^~^{t-T)pu(T)  (10) 

where  Pk.l(s)  =  s(  1  +  0(  l//\'2)  +  0(l2/k2h'2)). 

The  first  assertion  follows  from  Theorem  2.  The  key  to  the  proof  of  the  second  result 
is  the  realization  that,  for  each  large  k,  (6)  is  a  singularly  perturbed  second  order  ODE. 
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Noting  that  6*  =  fca*,  we  may  rewrite  (6)  as 

i>k  =  c*,  (11) 

Q  =  ^2((l-S-f Ifybk-fa),  (12) 

*  j-i 

solutions  of  which  rapidly  enter  and  thereafter  remain  near  the  slow  manifold 
Ck  =  (1  -  fs  -  b^bk/P-  Substitution  of  this  into  the  first  component  of  (11)  yields 

(i3) 

K  j=i 

and  differentiation  of  Pk,t  and  use  of  (12)  yields  (10)  with  Pk,i  =  s.  Justification  of  these 
formal  manipulations  and  derivation  of  the  error  estimates  may  be  found  in  [2]. 

This  shows  that  any  specific  Fourier  mode  6j  eventually  dies  and  it  describes  how  the 

energy  escapes  to  A:  =  oo.  In  fact,  for  k  >  l  >  K  this  shows  that  high  modes  grow 

exponentially  at  the  expense  of  low  modes  and  that  every  mode  eventually  decays  at  an 
exponential  rate.  We  can  use  this  to  illustrate  the  delicate  influence  of  initial  data  on  modal 
dynamics. 


Figure  1:  Typical  evolution  of  the  modal  energy. 

Suppose  first  that  the  initial  data  is  analytic,  namely  that  bi(T)  =  Ae~cl  for  some 
A,c  >  0.  Then,  as  t  — •  oo,  the  peak  of  the  energy  in  wavenumber  space  can  be  shown 
to  slowly  move  out  to  k  =  oo  at  a  rate  ~  ti  /a  and  the  “bump”  spreads  out  with  half 
bandwidth  increasing  as  f«/c3.  This  behavior  is  illustrated  in  Figure  1. 
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Secondly,  suppose  that  6;(T)  =  Al  r,  so  that  u  €  CT  1 .  In  this  case  it  follows 
from  (10)  that  the  peak  of  the  energy  in  wavenumber  space  moves  out  to  k  =  oo  at  a 
rate  ~  /r?  and  the  “bump”  spreads  out  with  halfbandwidth  increasing  as  t*  /r  as  t  —  oo. 

Numerical  experiments  (performed  over  large  finite  times  and  restricted  to  a  large  but  finite 
number  of  Fourier  modes)  showed  these  estimates  to  be  very  accurate  [2].  The  system  (6), 
truncated  to  N  modes,  becomes  increasingly  stiff  as  ||ux||2  -  1  decreases  in  size  and  was 
numerically  integrated  using  the  backwards  differentiation  algorithm  DDEBDF  from  the 
SLATEC  subroutine  library. 


Numerical  Examples 

We  now  illustrate  the  sensitive  dependence  on  initial  conditions  with  some  simple  examples. 
In  order  to  avoid  the  rather  costly  numerical  integration  of  the  full  system  (6),  we  assume 
that  the  solution  is  already  on  the  slow  manifold  and  that  the  reduced  system  (13)  therefore 
provides  a  good  description  of  the  dynamics.  The  exact  solution  to  (13)  is  given  by 

bh(t)  =  A(t)e~&bk(0),  &  =  1,2, ...  (14) 


where 


r  „  p'-js’-iir* 

j=  1  11  P> 

—  - 

0°2  2  /  a  \ 

u=i  I71 


(15) 

(16) 


_  <3t 

Note  that  the  e  0k2  bk( 0)  term  represents  the  solution  to  the  “truly  backwards”  heat  equa¬ 
tion  liuxxt  =  au  and  that  the  A(t)  term  acts  as  a  uniform  scaling  so  as  to  achieve  ||ux||  ~  1. 
The  modal  ratio  pu  =  b^/bi  for  (14)  now  satisfies  (10)  but  without  any  error  terms.  Sensi¬ 
tive  dependence  on  initial  conditions  is  best  illustrated  using  the  long-term  approximation 
given  by  (16).  A  further  simplification  follows  by  assuming  that  the  first  ,V  modes  have 
already  decayed  to  zero,  and  choosing  a  <C  N2  while  keeping  a/ $  fixed,  so  as  to  obtain 


A(t)  = 


j= i 


1  +  0(  ■ 


,  _ 

+  0(e  J 


which  provides  an  accurate  description  of  the  long-time  evolution  of  (1)  for  initial  data 
containing  at  least  one  nonzero  high  frequency  component.  We  now  use  this  approxima¬ 
tion  with  1024  Fourier  modes  and  a/ 13  =  1  to  generate  some  simple  numerical  examples 
displaying  sensitive  dependence  on  initial  conditions. 

In  our  first  example  we  illustrate  the  effect  of  ignoring  the  high-frequency  components 
that  are  present  in  generic  initial  data.  In  Figure  2  is  shown  the  evolution  o'  analytical 
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Figure  5:  Approximate  numerical  solution  of  ( 1 )  using  1024  nonzero  Fourier  modes  obtained 
via  (14)  as  described  in  text.  Initial  condition  is  given  by  ux(x,0)  =  -H 0,200(2  -  O.Ttt)  - 
0. IH200, —  0.3?r). 


initial  data  ux(-,0)  with  Fourier  components  6^(0)  =  Csj^e~k,  where  the  “resonant”  coef¬ 
ficients  Sk  =  {1, 1,  —  1, -1, 1, 1, -1,  —  1,...}  are  chosen  to  force  concentration  in  the  interior 
of  the  interval  (0 ,  x)  and  C  is  chosen  such  that  jjur(-,0)||2  =  1.  These  initial  conditions 
illustrate  how  solutions  of  (1)  can  focus  spatially  and  form  localized  fine  structure.  Note 
that  the  increasingly  finer  oscillations  observed  as  t  —  oo  is  not  a  numerical  artifact,  but 
characteristic  of  a  solution  u  that  converges  weakly  to  zero  in  Hq  while  forced  to  satisfy 
||ux(-,0)||2  ss  1  (  cf.  Theorem  1  ).  The  first  20  modes  of  such  initial  data  provide  an  excellent 
approximation  at  t  =  0  and  in  Figure  3  is  shown  the  evolution  of  this  “truncated”  problem. 
If  there  are  initially  only  a  finite  number  N  of  active  Fourier  modes,  then  (14)  implies  that, 
after  a  time  t  ~  0((3N2 /a),  all  the  modes  except  h.v  will  be  decaying  exponentially,  with 
the  highest  active  mode  6/v(t)  — 1 •  i^/l  —  a/N 2  and  preserving  the  sign  of  b^{ 0).  This  is 
clearly  displayed  by  the  large  time  behavior  shown  in  Figure  3.  Note  that  the  solutions  only 
diverge  after  t  ~  1000,  as  expected. 

In  our  next  example  we  show  how  the  solutions  of  the  reduced  system  (14)  can  be  made 
to  display  strong  spatial  concentrations  at  arbitrarily  chosen  points  and  after  arbitrarily 
long  times.  In  Figure  4  is  shown  the  solution  corresponding  to  ur(x,0)  =  —H(x  -  0.7 x), 
where  H  is  the  Heaviside  step  function,  shifted  to  be  of  zero  average  and  scaled  such  that 
||ur(-,0)||2  =  1.  This  example  also  illustrates  the  “persistance  of  strain  discontinuities” 
(  cf.  [2])  —  discontinuities  in  ux  cannot  be  destroyed  or  created  in  finite  time.  Let  //*./ 
be  the  function  obtained  by  considering  only  the  contribution  of  the  k’th  through  to  the 
l’th  Fourier  modes  of  H  (rescaled  to  be  of  unit  L 2  norm  ).  //fc,oo(x  -  xo)  represents  a 
“spike”  at  xo  that  can  be  localized  by  choosing  k  large.  In  Figure  5  is  shown  the  evolution 
of  the  initial  data  ux(x,0)  =  -Ho,20o(*  -  0.7x)  -  0. 1  #200,1 02<t(*  ~  0.3x).  The  first  term 
dominates  initially,  but  decays  after  t  as  30000.  The  second  term,  although  initially  an 
almost  unnoticable  little  “spike”  at  x  =  0.3x,  displays  its  presence  after  the  first  term  has 
begun  to  decay. 

We  can  therefore  construct  initial  data  of  the  form  YljHk,.ij(x  -  x: ),  with 
kj  <  l3  <C  hJ+\  <  lJ+ 1  and  arbitrary  x7,  for  which  the  process  observed  in  Figure  5  can  be 
repeated  as  often  as  we  wish,  causing  the  slow  and  successive  appearance  and  disappearance 
of  spatial  concentrations  at  x  =  xi,X2,....  Typical  solutions  of  (1)  therefore  need  not  sta¬ 
bilize  pointwise  and  can  display  the  formation  of  concentrations  in  a  seemingly  haphazard 
fashion.  Even  when  adding  stronger  dissipation  to  the  system,  e.g.  by  adding  a  capillarity 
term  —~/uTTXX  to  the  right  hand  side  of  (1),  this  process  can  still  continue  for  extremely 
long  times  before  the  solution  eventually  settles  down  to  an  equilibrium. 


Conclusion 

While  only  of  indirect  physical  interest,  we  believe  the  model  discussed  here  provides  a  signif¬ 
icant  example  of  behavior  characteristic  of  certain  infinite  dimensional  evolution  equations. 
In  spite  of  the  strong  dissipation  and  the  fact  that  the  energy  E  decreases  monotonically 
along  solutions,  excluding  any  chaotic  or  time-periodic  motions,  the  initial  data  exerts  a 
remarkable  influence  on  the  dynamical  behavior  and  approach  to  equilibrium.  In  “simple” 
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dissipative  systems  possessing  Liapunov  functions,  such  as  the  Chafee-Infante  problem: 

ut  =  uxx  +  A  f(u)  (e.g.f(u)  =  u  -  u3) 
or  the  damped  nonlinear  wave  equation: 

utt  =  uxx  -  (3ut  +  A /(«), 

one  expects  almost  all  solutions  to  approach  a  classical  equilibrium  corresponding  to  a 
local  minimum  in  energy.  For  models  (1)  and  (2),  however,  the  forward  o  bits  {xt( ()|t  >  <0} 
cannot  be  shown  to  lie  in  a  compact  set  and  the  usual  methods  fail,  cf.  [i,  2j .  For  (1)  almost 
all  solutions  minimize  energy  with  the  nonlocal  nonlinear  energy  term  allowing  new  zeroes 
to  appear  in  ux  without  appreciable  kinetic  energy  expenditure.  However,  our  asymptotic 
results  show  that  the  rate  at  which  and  manner  in  which  the  “modal  strain  energy”  ||uI.||2 
escapes  to  arbitrary  high  Fourier  wavenumbers  is  controlled  by  the  smoothness  of  the  initial 
data.  This  rather  delicate  influence  of  the  initial  data  suggests  that  there  may  be  problems  in 
the  interpretation  of  numerically  determined  equilibrium  states  of  nonlinear  elastic  continua 
with  non-convex  strain  energies  by  means  of  “dynamic  relaxation”  methods.  Such  methods 
usually  ignore  the  inherent  dynamics  of  the  problem  and  identify  the  asymptotic  equilibrium 
states  with  the  minima  of  a  nonconvex  energy  functional.  Inertia  and  dissipation  terms  are 
then  added  in  order  to  be  able  to  apply  dynamic  relaxation,  and  the  resulting  initial  value 
problem  is  then  numerically  solved  for  various  initial  data  until  the  kinetic  energy  has 
numerically  stabilized.  If,  as  in  model  (I),  initial  data  can  so  acutely  affect  either  the 
fineness  of  the  resulting  equilibria  or  the  rate  at  which  fine  structure  develops,  then  such  a 
dynamical  process  run  for  finite  times  from  specific  (sets)  of  initial  data  might  yield  results 
of  doubtful  significance.  Secondly,  the  fact  that  such  a  process  appears  to  have  stabilized 
may  merely  be  the  consequence  of  a  long  period  of  extremely  slow  evolution  and  cannot 
rule  out  interesting  surprises  in  the  distant  future. 
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ABSTRACT.  The  instability  of  a  cylindrical  liquid  jet  encapsulated  by  a  viscous  gas 
in  a  pipe  is  analysed  in  a  parameter  space  spanned  by  the  Reynolds  number,  the  Froude  number, 
the  Weber  number,  the  density  ratio,  the  viscosity  ratio,  and  the  diameter  ratio.  A  convergent 
solution  of  the  problem  is  constructed  by  a  Galerkin  projection  with  two  orthogonal  sets  of 
functions.  Two  distinctively  different  modes  of  instability  are  obtained.  The  first  is  the  Rayleigh 
mode  which  tends  to  break  up  the  jet  into  drops  of  diameter  comparable  with  the  jet  diameter. 
The  Taylor  mode  instability  is  due  to  the  pressure  and  the  other  mode  tends  to  produce  droplets 
of  diameters  much  smaller  than  that  of  the  jet.  It  is  shown  that  the  former  mode  appears  when 
the  Weber  number  is  much  larger  than  the  gas  to  liquid  density  ratio.  When  this  ratio  is  of  order 
one,  the  instability  can  be  due  to  either  modes  depending  on  the  values  of  the  rest  of  the 
parameters.  When  the  density  ratio  is  much  larger  than  the  Weber  number,  Taylor’s 
atomization  mode  replaces  the  Rayleigh  mode. 

INTRODUCTION.  The  instability  of  an  inviscid  liquid  jet  with  respect  to  temporally 
growing  disturbances  in  the  absence  of  gravity  and  ambient  gas  was  analyzed  by  Rayleigh 
(1879).  He  showed  that  the  disturbance  possessing  the  maximum  amplification  rate  could  cause 
the  jet  to  break  up  to  form  droplets  comparable  in  size  with  the  jet  diameter.  Chandrasekhar 
(1961)  showed  that  the  neglected  liquid  viscosity  can  only  reduce  the  amplification  rate  of 
disturbances  but  cannot  supress  the  instability  caused  by  capillary  pinching.  The  convective  and 
absolute  instability  of  a  liquid  jet  was  investigated  by  Keller  et  al.  (1972),  Leib  and  Goldstein 
(1986),  and  Lin  and  Lian  (1989).  Taylor  (1963),  Lin  and  Kang  (1987)  and  Lin  and  Lian  (1990) 
showed  that  when  the  gas  to  liquid  density  ratio,  Q,  is  much  greater  than  the  Weber  number,  a 
viscous  jet  of  radius  R  j  may  actually  become  unstable  with  respect  to  disturbances  of  wave 
length  X  «  Rj.  Lin  and  Creighton  (1990)  found  that  while  the  mechanism  of  Rayleigh’s 
instability  is  capillary  pinching,  the  mechanism  of  Taylor’s  mode  is  the  interfacial  pressure 
fluctuation.  However,  the  effects  of  the  interfacial  shear  on  the  Rayleigh  and  the  Taylor  modes 
of  the  jet  instability  remain  unknown,  since  the  gas  viscosity  is  neglected  in  all  of  the  above 
mentioned  theories.  The  effect  of  the  viscosity  of  a  motionless  surrounding  fluid  on  the  breakup 
of  a  motionless  viscous  cylindrical  thread  was  investigated  theoretically  by  Tomotika  (1934). 

The  breakup  mechanism  for  this  case  remains  capillary  pinching. 

Joseph  et  al.  (1984)  investigated  the  instability  of  two  immiscible  liquids  of  the  same 
density  but  of  different  viscosities  in  a  pipe.  The  interfacial  tension  was  neglected.  The  effects 
of  surface  tension  and  density  stratification  in  the  absence  of  gravity  were  later  included  in  the 
investigations  of  Preziosi  et  al.  (1989)  and  Hu  and  Joseph  (1989).  Smith  (1989)  investigated 
the  instability  of  two  immiscible  fluids  of  the  same  viscosity  but  of  different  densities  in  a 
vertical  pipe.  These  works  are  of  fundamental  importance,  because  they  isolate  the  effects  of 
the  density  and  viscosity  discontinuities  at  the  interface.  However,  they  cannot  be  applied  to 
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infer  the  coupled  effects  of  surface  tension,  interfacial  shear  gravitational  acceleration  and 
pressure  fluctuation  on  the  Rayleigh  and  the  Taylor  modes  of  instability. 


FORMULATION.  Consider  the  stability  of  a  cylindrical  liquid  jet  of  radius  Rj.  The 
jet  is  surrounded  by  a  viscous  gas  enclosed  in  a  vertical  circular  pipe  of  radius  R2  which  is 
concentric  to  the  jet.  For  the  jet  to  maintain  a  constant  radius  the  pressure  gradient  in  the  steady 
liquid-  and  the  gas-flows  must  remain  the  same  constant.  This  will  allow  the  pressure  force 
difference  across  the  liquid-gas  interface  to  be  exactly  balanced  by  the  surface  tension  force  as 
required.  Such  coaxial  flows  of  liquid  and  gas,  in  the  presence  of  gravity,  which  satisfy  exactly 
the  Navier-Stokes  equations  are  given  by 

W 1  (r)  =  - 1  + _ _ (1-  1 M  [21n/  +  (l-/2)]}, 

(H-U-fc2))  4N  Fr 

w2(r)  =  -  .  { 1  -  M  [21n/  +  (1  -l1)] } 

[N- ( l-«.2)  ]  4N  Fr 

+T522  f5'2  •  r2  - 21"  (?)].  0) 

N  =  |i2/Hi ,  /  =  R2/R1 .  Q  =  P2/P1  * 

Re  =  Reynolds  number  =  pjW0Rj/p.j  , 

Fr  =  Froude  number  =  WQ2/gR  j  ,  R  =  Re/Fr , 

where  the  subscript  1  or  2  stands  for  the  liquid  or  the  gas  phase  respectively,  WQ  is  the  magnitude 
of  the  jet  velocity  in  the  z-axis  (c.f.  Fig.  1),  r  is  the  radial  distance  normalized  with  Rj,  W(r)  is 
the  axial  velocity  distribution,  p  is  the  dynamic  viscosity,  p  is  density,  and  g  is  the  gravitational 
acceleration  in  the  negative  z-direction.  Some  velocity  distributions  in  a  water  jet  and  in  the 
surrounding  air  flow  under  one  atmosphere  are  given  in  figure  1  for  various  values  of  Re/Fr. 

Note  the  large  difference  in  the  slopes  of  the  velocity  profiles  in  the  liquid  and  the  gas  phases  due 
to  the  large  difference  in  their  viscosity,  when  R  is  relatively  large. 

The  stability  of  the  basic  state  described  by  (1)  with  respect  to  a  normal  mode 
axisymmetric  disturbance  is  governed  by  the  well  known  Orr-Sommerfeld  equation  (Drazin  and 
Reid,  1985), 

[to  -  (N’/Re)D2]D20j(r)  +  ikWj(r)D2  <J>j(r)  -  ikrd[dWj(r)/r]pj  =  0,  (i=l  ,2)  (2) 

D2  =  d2  -  r'M  -  k2  ,  d  =  d/dr,  N’  =  Vj/Vj  , 

where  v  is  the  kinematic  viscosity,  the  subscript  i  stands  for  the  liquid  phase  or  the  gas  phase 
depending  on  if  i=l  or  i=2,  co  and  k  are  respectively  the  dimensionless  complex  frequency  and 
the  wave  number  of  the  disturbance,  and  pj  is  the  amplitude  of  the  normal  mode  disturbance 
related  to  the  Stokes  stream  function  by 

Vi(r,z,t)  =  pj(r)e^ikz+(0t) , 

where  t  is  time  normalized  with  Rj/Wq.  The  Stokes  stream  function  is  related  respectively  to  the 
radial  and  axial  components  of  the  disturbance  velocity  by 

uj  =  Vjz/r  and  wj  =  -yir/r  , 
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where  the  subscripts  z  and  r  denote  partial  differentiations. 

The  boundary  conditions  at  the  perturbed  liquid-gas  interface  r  =  1+T|  can  be  linearized 
by  use  of  the  Taylor  series  expansions  of  all  variables  involved  about  r=l,  and  retaining  only 
terms  of  the  first  order  in  perturbations.  Hence,  the  interfacial  conditions  are  to  be  evaluated  at 
r=l  with  T)  as  an  additional  unknown.  Since  the  interface  is  a  material  surface,  r\  must  satisfy  at 
r=l  the  kinematic  condition 

Tit  +  wii1z  =  Viz- 

Other  interfacial  kinematic  conditions  are  the  continuity  of  the  radial  and  tangential  components 
of  the  velocity  across  the  interface  given  respectively  by 

[Vizl2  =  tVlz  ‘  V2z lr=l  =  0 .  and 

(Wir1!  '  Virl2  =°- 

The  balancing  of  forces  per  unit  area  of  the  interface  in  the  tangential  and  normal  directions  leads 
respectively  to  the  dynamic  conditions  at  r=l, 

[Ni{r)win-  -  (Vir7r)r  +  \|fizz}]  j  =  0,  and 

[pj  -  (2/Re)Ni(\|/iz/r)r]2  +  01  +  nzz)We  =  0  > 

where  pj  is  the  disturbance  pressure. 

We  s  Weber  number  =  S/p^W0^Rj,  Nj  =  pi j/p.  j  ^ 

in  which  S  is  the  surface  tension.  Thus,  We  signifies  the  ratio  of  surface  tension  force  to  the 
inertia  force  per  unit  area  of  the  interface.  The  boundary  condition  at  the  pipe  wall  is  the  no-slip 
condition  at  r=l, 

V2z  =  °’M'2r  =  0- 

The  normal  mode  axisymmetric  pressure  disturbance  and  interfacial  displacement  are  written  as 

[Pi,  nl  =  tCi(r),  ^]e(ikz+cot)-  (3) 

Substituting  (3)  and  the  normal  mode  of  \|/j  into  the  above  boundary  conditions,  we 
rewrite  them  in  the  same  order  of  appearance 


(G)  +  ikWj)4  -  ik<J>j  =  0, 

(a) 

o 

ii 

(b) 

tWr-Oirh  =0’ 

(c) 

[NjB<Dj]2  -(l-Q)R^  =  0, 

(d) 

B  =  d2  -  d/r  +  k2  , 
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[Ci  -  (2ik/Re)Ni(<)>ir  -  +  ^(l-k2)We  =  0, 

(e) 

o 

ii 

J 

(f) 

<{>2(0  =  0. 

(g) 

The  last  term  in  (d)  arises  from  the  second  derivatives  of  the  basic  flows.  The  pressure  amplitude 
discontinuity  in  (e)  can  be  obtained  from  the  linearized  Navier-Stokes  equations,  and  are  found  to 
be 

1 

[Cil2  =  [QiUco+ikWj)^  -  ikWj^i) 

-  Ni(D20i)l/Re]2  (ik)"1*  Qj  =  Pj/Pi  - 

Nontrivial  solutions  of  (1)  with  its  boundary  conditions  (a)  to  (g)  for  given  flow 
parameters  Re,  Fr,  We,  Q,  N,  l  and  k  exist  only  for  certain  eigenvalues  co.  The  real  part  of  (0 
determines  the  stability  of  the  flow,  and  the  imaginary  part  of  go  determines  the  characteristic 
frequency  of  the  disturbance. 

SOLUTION.  The  solution  of  the  problem  formulated  in  the  previous  section  will  be 
expanded  in  an  orthogonal  set  of  functions  in  each  of  the  flow  fields  in  the  liquid  and  in  the  gas 
(lighter  imcompressible  fluid).  The  two  orthogonal  sets  are  associated  with  the  same  differential 
operator  in  (2),  i.e.  D2,  but  with  different  domain  boundaries.  By  use  of  the  change  of  variable 

<!>j  =  rf j ,  (i=l,2), 

we  have 

D24>j  =  r(L-k2)fj , 

where 

L  =  r'l  d(rd)  -  r'2  . 

The  orthogonal  functions  will  be  chosen  among  the  solutions  of  the  Bessel  equation  of  the  first 


order  with  the  parameter  kjn 

(L2  +  kin2)Fin  =  0,  (n=l,2,...Mj),  (4) 

where  Fjn  stands  for  Fj(kjnr),  and  Mj  is  an  arbitrarilly  large  integer.  The  bounded  solutions  of  (4) 
which  forms  an  orthogonal  set  of  functions  in  r<l  are 

Pl„  =  Jl(klnr),  (5) 

where  kjn  are  the  roots  of 

klnJo(kln)-Jl(km)  =  0.  (6) 

With  these  values  of  k  jn,  we  have 
^o  r  ^lm  ^tn  =  ^mn 
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(8) 


where  8^  0  if  m  *  n,  if  m  =  n  it  is  given  by 

SnJ'O-5  (kin2 '  l)J02<klm)- 

The  bounded  solutions  of  (4)  which  form  an  orthogonal  set  of  functions  in  the  domain  l<r <1  arc 


F2n  ■  F2<k2nr>  -  ',l<k2„Vl<k2„r)  -  ^(k^Oc^r) ,  (9) 

where  k2n  are  the  roots  of 

F2<k2„>-k2„F<k2n)=0,  <10> 

F<k2„r)  =  J„(k2nr)Y,(k2„/)  -  Ji(k2n/iK0(k2„r) .  (11) 

With  the  values  of  k2n  thus  determined,  we  have 

Jj  r  F2n  F2m  &  =  ^nrn  d2> 

(2) 

where  5^  =  0  if  m  =  n,  otherwise  it  is  given  by  the  integral  on  the  left  side  of  the  above  equation 
with  F2n  given  by  (9).  Note  that 

F2(k2n/)  =  0.  03) 

We  now  expand  the  eigen-vector  in  a  truncated  series  of  the  above  orthogonal  functions 

Oi  =  rainFin-  <i=1’2>  (14) 


where  the  repeated  indecies  n  denote  sumation  over  n=l  to  n=Mj  (i=l,2).  The  number  of  terms 
Mj  required  in  the  two  flow  domains  may  not  be  the  same  for  the  required  accuracy.  The 
components  of  eigenvector  will  be  obtained  by  use  of  the  Galerkin  projection.  The  following 
formula  which  can  be  derived  with  integration  by  parts  will  be  used  repeatedly  in  the  reduction  of 
the  Galerkin  projection, 

rGL(g)dr  =  rgL(G)dr  -  [rgd(G)  -  rGdCg)]^  (15) 

where  g  and  G  are  function  of  r.  The  Galerkin  projection  of  (2)  gives 

f/  rFimKL-k2-Re03,)(L-k2)fi  +  ik(viA'i)ReWi(L-k2)fi]dr  =  0  ,  (16) 

i 

where  co’  =  (o(V|/Vj).  By  use  of  (15),  the  orthogonality  conditions,  and  the  following  relations 

[rUf1)dn=|m)-rFlmdUfi)]r=Sl  =  0. 

F2<k2m'2>  =  0  • 
we  can  reduce  (16)  to 

cimnain  *  vtl^li[(dFim)t  L  a'^Flm^t1  Pi 

-  vs252i[(t2/s2)  (dF2m>t  2  Y  +  GWs  2  5  ’  <d  F2m>s2  £1  =  0 
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(m,  n=l 


(17) 


where  v  =  (Vj/Vj),  the  subscripts  of  parantheses  denote  the  values  of  r  at  which  the  paranthesized 
functions  are  to  be  evaluated,  and 

2 

eimn  =  5mn  [k2(vk2+Reco)  +  (2vk2+Reco)kim  +  v(kinkim)2] 

+  ikRe(k2  +  kin)|g  |  r  WjFimFin  dr  -  ik52  ,  [(l-Q)RRe/N]  J  FimFm  fld r  , 

a  =  [L(f1)]ti  ,p  =  [dL(f1)]ti  , 

y=  [L(f2)lt2  ,6  =  [dL(f2)]S2  ,  e  =  [L(f2)]s  2  . 

It  is  known  that  termwise  differentiations  of  truncated  series  representations  of  functions  do  not 
provide  as  high  an  accuracy  for  the  derivatives  of  functions  as  for  functions  themselves.  For  this 
reason  we  treat  a  to  £  in  (17),  which  involve  derivatives  higher  than  second,  as  five  additional 
unknowns.  Thus  (17)  is  a  system  of  M  j  +  M2  equations  in  Mj  +  M2  +  5  unknowns.  The 
required  additional  equations  are  provided  by  the  six,  boundary  conditions  (a)  to  (0  which 
contain  an  additional  unknown  Note  that  boundary  condition  (g)  is  already  satisfied,  because 
of  (13). 

Substituting  the  series  solution  (14)  into  (a)  to  (f)  we  have 


ikF1(kln)aln  -  [co+ikW^l)]^  =  0 ,  (a)’ 

l^n  ^in^2  '  (b) 

^Wir-ain(Fin+dFin)]2  =0  (O’ 

[Ni{(k2-l)Fin  +dFin  +d2Fin)ainl2  -  (1-Q)4  =  0  (d)’ 

[  ( (co+ikW  j  )Re  Qi(Fin+dFin) 

-  ikRe  Qj  W^.  Fjn 

+  +  (3k2+kin)dFin])ainl^ 

+  ikReWe(l-k2)^  *0,  (<_■)' 

O’ 


Equation  (17)  and  the  above  boundary  conditions  form  a  system  of  (M  j  +  M2  +  6) 
homogeneous  linear  equations  in  the  same  number  of  unknowns.  Making  the  following 
identifications 


aln  =  xn  (n=l  toMj) 

a2n  =  Xn+Mt  (n=l  t0  M2> .  M  =  M]  +  M2 

(a,  (3, 7,  6,  e,  0  =  (XM+1,  XM+2,  XM+3,  XM+4,  XM+5,  XM+6) , 
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this  linear  homogeneous  system  can  be  written  in  a  standard  form 


(Amn  + Bmn)xn  =  0 » (m,n=l,2,...M+6) 

where  the  elements  of  matrices  Amn  and  can  be  identified  easily  from  (17)  and  the  above 
boundary  conditions.  A  nontrivial  solution  up  to  an  arbitrary  multiplicative  constant  of  this 
system  exists  only  if  the  determinant  of  its  coefficient  matrix  vanishes,  i.e., 

^mn  +  05  ®mnl  =  ®  •  ^) 


To  construct  the  eigenfunctions  of  the  original  system,  which  is  not  required  in  this 
work,  we  need  only  the  eigenvectors  ajn  and  a2n-  The  values  of  a,  (J,  y,  5,  e  and  %  are  not 
required.  The  explanation  of  the  numerical  computation  involved  in  the  solution  of  (18)  is  in 
order.  All  computations  are  carried  out,  in  double  precision  with  Gould  PN  9780  at  Clarkson  and 
the  supercomputer  facility  at  the  Cornell  Theory  Center.  To  construct  the  orthogonal  functions 
Fj,  we  solve  (6)  and  (10)  with  (11)  respectively  for  kjn  and  k2n  with  the  Muller  (1956)  method. 
All  integrals  involved,  in  (17)  except  5mn  which  has  a  closed  form  expression,  are  evaluated  with 
the  Gauss-Kronrod  quadrature.  For  a  given  set  of  parameters  (Re,  We,  Fr,  Q,  N,  l)  the  eigenvalue 
co  is  obtained  from  (18)  for  various  values  of  k  with  the  method  by  Kaufman  (1974).  In  this 
complex  eigenvalue  solution,  Mj  and  M2  are  systematically  increased  until  the  eigenvalue 
corresponding  to  the  most  amplified  or  the  least  damped  disturbance  converges  to  the  desired 
significant  digits. 


RESULTS.  Table  1  gives  a  typical  example  which  demonstrates  the  convergence  of 
the  method  of  determining  the  eigenvalues  for  a  given  set  of  flow  parameters.  It  is  seen  that  as 
Mj  and  M2  are 

increased  respectively  from  6  and  54  to  7  and  63,  the  eigenvalues  remains  the  same  up  to  the  first 
four  significant  digits.  Note  that  when  Mj  =  7  and  M2  =  63  there  are  76  eigenvalues  for  the 
given  set  of  parameters.  Only  the  one  corresponding  to  the  most  amplified  disturbance  is  given 
in  the  table.  The  same  convergence  test  was  carried  out  for  every  computation  for  the  most 
amplified  or  the  least  damped  eigenvalues  for  various  sets  of  parameters  reported  in  this  work. 
Preziosi  et  al.  (1989),  used  Chebyshev  polynomials  as  base  functions  for  their  solution  of  a 
special  case  of  zero  gravity  in  the  present  problem.  They  required  80  terms  for  satisfactory 
convergent  results.  Thus  the  terms  required  in  the  present  problem  with  gravitational  effect  is 
slightly  less  than  that  required  in  their  problem  of  zero  gravity.  A  finite  element  method  was 
used  by  Hu  and  Joseph  (1989)  in  their  extension  of  the  work  of  Preziosi  et  al.  The  finite  element 
method  seemed  to  be  more  efficient  than  the  collocation  method.  Attempts  have  been  made  to 
test  the  accuracy  and  convergence  by  doubling  the  number  of  terms  in  the  present  problem.  It  was 
found  that  for  such  a  large  system,  the  numerical  error  with  a  double  precision  calculation 
dominates  the  reduced  truncation  error. 

Figure  1  shows  the  velocity  distribution  in  the  basic  state  for  various  values  of  R  for  the 
given  parameters  1=10,  N=0.018,  and  Q=0.0013.  These  values  of  N  and  Q  correspond  to  a  water 
jet  in  atmosphere  at  room  temperature.  Figure  2  plots  the  growth  rates  cor  against  the  wave 
number  of  the  disturbance  for  various  values  of  We  for  the  set  of  parameters  specified  in  the 
figure  caption.  R  =  Re/Fr  =  0  signifies  the  absence  of  gravity.  It  is  clearly  seen  that  as  We  is 
decreased  from  0.01  to  10'^  in  steps,  the  amplification  rates  decrease  for  k<l.  For  k>l  the  trend 
is  reversed,  although  the  growth  rates  are  relatively  small.  The  reversal  of  the  trend  can  be  easily 
understood  by  looking  at  the  last  term  in  (e).  The  factor  (l-k^)J;  in  this  term  arises  from  the 
curvature  of  the  interface.  The  £-term  is  associated  with  the  interfacial  curvature  along  a 
direction  perpendicular  to  the  jet  axis  which  gives  rise  to  the  necking  at  £<0  and  expansion  at 
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£>0.  The  -k^  term  is  of  opposite  sign  and  is  associated  with  the  curvature  in  the  axial  direction. 
This  curvature  tends  to  pull  the  displaced  interface  back  to  its  basic  state  position.  When  k<l,  the 
former  destabilizing  pinching  effect  dominates  the  latter  stabilizing  effect.  When  k>l  the  role  of 
surface  tension  is  reversed.  For  the  given  parameters  the  jet  instability  is  clearly  due  to  the 
Rayleigh  mode  of  capillary  pinching,  since  the  maximum  growth  rate  occurs  at  k<l.  As  We  is 
decreased  further  from  0.0001,  the  wave  lengths  corresponding  to  the  maximum  growth  rates 
gradually  shift  to  the  region  k>l.  This  is  examplified  by  the  curve  for  We=10‘^  in  figure  2. 

Then  the  instability  judged  by  the  maximum  amplification  rate  is  no  longer  due  to  capillary 
pinching,  but  due  to  the  Taylor  mode.  This  mode  will  be  expounded  more  clearly  later  when 
gravity  is  taken  into  account.  Contrary  to  the  dramatic  effect  of  surface  tension  on  the  Rayleigh 
mode,  the  air  viscosity  has  little  effect  on  this  mode.  The  destabilizing  effect  of  the  basic  state 
shear  rate  is  significant  for  shorter  waves  for  which  k>l.  Comparisons  between  our  theoretical 
results  and  the  experimental  results  of  Goedde  and  Yuen  (1970)  and  that  of  Donnelly  and 
Glaberson  (1966)  are  made  in  figure  3.  Unfortunately,  the  values  of  We  and  Re  corresponding  to 
the  experimental  points  were  not  reported.  Two  amplification  curves  were  obtained  from  our 
theory  with  the  parameters  corresponding  to  the  lower  range  of  the  jet  velocity  reported  by 
Goedde  and  Yuen.  Rayleigh’s  amplification  curve  is  also  included  in  this  figure  for  comparison. 
While  the  slope  of  Rayleigh’s  curve  has  a  discontinuity  at  k=l,  and  the  jet  is  neutrally  stable  for 
k>l,  our  curve  is  continuous  in  slope  and  gives  negative  cOj.  for  k>l.l.  The  good  agreement 
between  the  experiments  and  our  curve  for  Re  =  3000  and  We  =  0.0013  and  Rayleigh’s  curve, 
which  is  independent  of  We  at  Re  =  °°  is  probably  fortuitous.  It  has  already  been  shown  that  the 
amplification  curves  depend  very  sensitively  on  We,  although  less  so  on  other  parameters.  For  a 
better  comparison  with  theories,  complete  records  of  all  relevant  parameters  (/,  Q,  We,  Re,  N)  for 
each  observation  of  (cOj.,  C0j,  k)  are  needed.  Figure  4  further  demonstrates  the  stabilizing  effect  of 
the  interfacial  tension.  As  We  is  decreased  to  values  much  smaller  than  Q,  both  the  amplification 
rates  and  the  wave  number  of  the  unstable  spectrum  are  increased  dramatically.  It  is  seen  that  the 
most  unstable  disturbances  of  Taylor’s  atomization  mode  are  of  wave  length  several  orders  of 
magnitude  smaller  than  the  jet  radius.  Moreover,  the  wave  lengths  near  the  maximum  growth 
rates  of  the  amplification  curves  all  scale  with  the  capillary  length  a  =  27tS/p2W02Ri.  This  can 
be  verified  by  showing  that  the  following  equation  is  satisfied  with  the  values  of  the  wave  number 
km,  corresponding  to  the  maximum  growth  rate,  taken  from  each  curve  of  figure  10, 

(27tRj/k)  =  a  =  2rtS/p2W02R1  =  2nR1(We/Q). 


Recall  that  in  the  Rayleigh  mode,  the  most  amplified  waves  scale  with  Rj  in  length.  Contrary  to 
the  situation  in  the  Rayleigh  mode  the  air  viscosity  has  a  more  significant  effect  on  the 
atomization  mode,  as  can  be  seen  in  figure  5.  When  N  is  increased  from  0.0018  to  0.018  the 
disturbances  for  which  k<23  are  damped  while  the  disturbances  for  which  k>23  are  amplified. 
This  seems  to  reflect  the  fact  that  the  enhancement  of  the  amplification  rate  due  to  the  relative 
increase  in  gas  viscosity  more  than  compensates  for  the  decrease  in  the  damping  rate  due  to  the 
relative  decrease  in  the  liquid  viscosity  for  shorter  waves  such  that  k>23.  The  reverse  is  true  for 
longer  waves  for  which  k<23.  This  also  reveals  the  crucial  roles  played  by  the  gas  shear  stress  in 
the  generation  of  small  droplets.  Neglecting  the  gas  viscosity,  Lin  and  Kang  (1987),  and  Lin  and 
Creighton  (1990)  showed  that  only  pressure  fluctuation  can  generate  short  waves  scaling  with 
capillary  length.  It  is  clear  now  that  the  interfacial  shear  and  pressure  fluctuations  are  equally 
capable  of  generating  short  waves  scaling  with  the  capillary  length.  This  view  is  further  sub¬ 
stantiated  by  figure  12  which  show  qualitatively  the  same  behavior  as  figure  10,  when  the  ratio  of 
inertia  force  relative  to  viscous  force  is  raised  respectively  by  raising  the  value  of  Q  and  Re. 
Figure  7  shows  the  destabilizing  effect  of  the  basic  state  shear  rate  on  the  Taylor  mdoe.  A  large 
basic  state  shear  rate  at  the  interface  requires  a  large  shear  stress  fluctuation  when  the  interface 


fluctuates  from  the  unperturbed  cylindrical  surface,  in  order  to  satisfy  the  condition  of  vanishing 
shear  force  at  the  interface,  (c.f.  Hinch,  1984;  Kelly  et  al.  1989)  This  large  shear  stress  fluctuation 
inevitably  brings  about  a  large  pressure  fluctuation,  and  causes  the  growth  rate  to  increase.  In 
contrast  to  the  case  of  the  Rayleigh  mode,  the  radius  ratio  /  has  a  very  significant  effect  on  the 
Taylor  mode  (Fig.  8).  A  decrease  in  /  brings  about  a  larger  basic  state  shear  rate  which  again 
results  in  an  increase  in  the  growth  rate.  Unlike  the  cas;  of  Q=l,  We=0,  and  N<1  investigated  by 
Joseph  et  al.,  we  did  not  find  stability  near  k=0  when  /— »/  for  finite  values  of  We  and  Q«l. 
However,  when  we  put  We=0,  Q=l,  N=0.5,  R=0  and  Re=27.2  we  did  find  that  the  jet  is  stable  for 
6<k  and  k<0.7.  This  is  consistent  with  the  results  of  Joseph  et  al.  (1984).  Hooper  and  Boyd 
(1987)  and  Renardy  (1985)  also  found  a  similar  stable  region  for  planar  Couett  flow  of  two 
superposed  fluids  of  different  viscosities  but  of  the  same  density. 

DISCUSSION.  It  should  be  pointed  out  that  the  instability  waves  near  k=0  in  the 
present  work  are  not  of  the  type  of  Yih  (1967),  since  Yih’s  long  shear  waves  are  not  apparent 
when  N<1  (c.f.  Hooper  &  Boyd,  1987).  The  stability  analysis  of  a  vsicous  liquid  jet  in  an 
ambient  gas  reveals  that  there  are  two  distinct  mechanisms  of  the  jet  break  up.  The  first  is  that  of 
Rayleigh  mode  by  capillary  pinching,  and  the  second  is  that  of  the  Taylor  mode  by  interfacial 
shear  and  pressure  fluctuations.  The  theory  is  not  yet  fully  substantiated  by  experiments.  The 
present  theory  predicts  that  the  growth  rate  of  disturbances  increases  significantly  with  the  Weber 
number  as  it  should,  since  the  instability  is  due  to  capillary  pinching.  Unfortunately,  the  known 
experiments  in  the  Rayleigh  mode  regime  failed  to  record  the  values  of  relevant  parameters 
including  We  for  each  experimental  point.  Only  the  ranges  of  velocity,  temperature,  and  jet 
diameter  were  reported.  Thus  only  the  ranges  of  the  parameters  encountered  in  experiments  can 
be  estimated.  This  deprives  us  of  a  more  complete  comparison.  Consequently  the  apparent 
agreement  between  experiments  and  the  present  theory  with  We  =  0.0013  and  Re  =  3000,  and 
with  the  Rayleigh  theory  remain  fortuitous.  This  value  of  We  and  the  values  of  the  rest  of 
parameters  used  in  figure  7  are  in  the  lower  end  of  the  parameter  range  estimated  from  the 
reported  experimental  data.  It  is  possible  that  most  of  the  experimental  points  were  obtained  in 
the  lower  range  of  the  parameters  encountered  in  experiments.  The  theoretical  results  on  the 
Taylor  mode  are  only  qualitatively  substantiated  by  the  experiments  of  Reitz  and  Bracco  (1982). 
The  average  diameters  of  their  atomized  droplets  seem  to  all  scale  with  the  capillary  length  as 
predicted  by  our  theory.  Careful  measurements  of  (o)r  0)j,  k)  for  varioius  given  sets  (We,  Re,  Q, 
N,  /)  are  required  for  a  better  comparison  with  the  theory  for  both  modes.  There  may  exist  other 
modes  of  instability  in  the  parameter  ranges  not  considered  in  this  work.  A  possible  third  mode 
which  may  correspond  to  a  dripping  jet  (c.f.  Lin  and  Lian,  1989)  is  yet  to  be  explored  by 
considering  the  convective  and  absolute  instabilities  of  spatially  growing  disturbances  when 
We»Q.  The  known  analysis  of  absolute  and  convective  instabilities  of  a  jet  all  ignore  the  effect 
of  gas  viscosity  (Leib  and  Goldstein,  1986;  Lin  and  Lian,  1989).  Blennerhassett  (1980)  showed 
that  Tollmien-Schlichting  waves  are  more  stable  than  the  interfacial  waves  in  two  superposed 
viscous  fluids  flowing  over  a  plane.  The  same  situation  appears  to  happen  here.  The 
Tollmien-Schlichting  wave  will  probably  not  appear  until  Re  is  raised  to  a  value  much  greater 
than  those  considered  in  this  work. 

While  the  present  analysis  also  applies  to  the  case  of  N>1,  the  computation  for  this  case 
has  not  yet  been  carried  out.  The  extension  of  the  present  analysis  to  the  case  of 
non-axisymmetric  disturbances  is  quite  straightforward.  The  nonlinear  stability  analysis  of  the 
linearly  unstable  disturbances  described  in  this  work  will  be  useful  for  many  industrial  processes 
which  utilize  the  mechanisms  of  the  jet  breakup  either  in  Taylor’s  atomization  mode  or 
Rayleigh’s  ink-jet  mode. 

This  work  was  supported  in  part  by  Grant  No.  DAAL03-89-K-0179  of  ARO,  Gram  No. 
MSM-88 17372  of  NSF  and  a  New  York  State  Science  Grant.  The  computation  was  carried  out 
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with  the  computer  facility  at  Clarkson  University  and  with  the  Cornell  National  Computer 
facility,  which  is  founded  by  the  NSF,  the  State  of  New  York,  and  IBM  Corporation. 


m2 

cor 

COi 

3 

15 

0.0238 

0.7003 

5 

35 

0.0239 

0.7013 

5 

45 

0.0239 

0.7017 

6 

54 

0.0239 

0.7021 

7 

63 

0.0239 

0.7021 

Table  1.  Convergence  to  the  most  amplified  mode.  k=0.7,  We  =  0.0025,  Re  =  400.0, 

Re/Fr  =  0.0,  Q  =  0.0013,  l  =  10.0,  N  =  0.018 
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Abstract 

This  paper  investigates  communication  and  control  in  SPMD  parallel  computations,  and 
introduces  the  Events/Threads  of  control,  E/T,  model  which  allows  qualitative  and  quantitative 
analysis  of  parallel  execution.  The  principal  component  of  the  E/T  model  is  the  characteristic 
function  g(  P)  which  relates  the  number  of  events  to  the  number  P  of  threads  of  control  (usually 
a  processor).  Many  properties  of  a  computation  follow  from  the  behavior  of  g(P ),  such  as  limits 
on  the  potential  speedup.  The  model  includes  the  effects  of  reads  and  writes  in  communication, 
algorithmic  blocking,  work  intensity,  etc.  It  is  most  appropriate  for  SPMD  (Single  Program 
Multiple  Data)  computations  that  are  common  in  scientific  applications.  An  experiment  is 
described  briefly  which  relates  the  the  detailed  behavior  of  a  parallel  computation,  observed 
through  monitoring,  with  the  high  level  characterization  provided  by  the  E/T  model. 


1  Overview 

The  E/T  model  describes  a  parallel  computation  C  as  a  collection  of  P  threads  of  control  and  E 
events.  Informally  a  thread  of  control  is  an  agent  capable  to  perform  some  work  in  behalf  of  C  and 
an  event  is  an  explicit  action  perform  by  a  thread  of  control  in  order  to  coordinate  its  activity  with 
other  threads  of  control.  In  a  wider  sense  an  event  is  a  change  of  state  of  a  thread  of  control. 

Modeling  and  analysis  of  numerical  problems  which  lend  themselves  to  the  Same  Program 
Multiple  Data,  SPMD,  paradigm  are  the  focus  of  our  investigation  [11],  [12].  Communication 
and  control  latency  can  strongly  influence  the  performance  of  these  computations  and  we  use  the 
E/T  model  to  analyze  this  influence.  Informally  a  SPMD  computation  is  performed  whenever  all 
processing  elements,  PE's  of  a  parallel  machine  execute  the  same  program  on  different  data.  SPMD 
computations  lead  to  a  collection  of  similar  threads  of  control  therefore  their  modeling  and  analysis 
seems  an  easier  task  than  the  analysis  of  non-homogeneous  computations  with  a  large  number  of 
unrelated  threads  of  control. 

The  E/T  model  can  used  for  qualitative  analysis  of  a  parallel  computation  C,  an  analysis  based 
upon  the  study  of  the  characteristic  function  g,  which  relates  the  number  of  events,  E.  and  the 
number  of  threads  of  control  P.  E  =  g(P)  of  C.  An  optimal  parallel  computation  with  P  threads  of 
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control  is  a  computation  characterized  by  a  linear  function  g(P).  If  g(P)  cannot  be  expressed  as  a 
polynomial  then  there  is  little  hope  that  C  will  ever  be  performed  efficiently.  Consider  two  parallel 
computations  Ci  and  C2  with  P  threads  of  control  which  represent  two  different  implementation  of 
an  algorithm  A  or  implementations  of  two  different  algorithms  A\  and  A2  which  perform  the  same 
task.  If  the  characteristic  functions  of  C\  and  C2  are  in  the  relation  gi(P)  <  g2(P)  for  P\  <  P  <  P2 
then  we  have  a  high  degree  of  confidence  that  C\  performs  better  than  C2  in  the  range  P\  <  P  <  P2 
for  a  wide  variety  of  parallel  architectures. 

Whenever  more  information  about  a  parallel  computation  C  is  available,  for  example  when  the 
sequence  of  events  occurring  in  a  thread  of  control  can  be  identified  or  when  the  characteristics 
of  the  parallel  machine  executing  C  are  known  then  the  E/T  model  is  capable  of  providing  more 
accurate  assessments  about  the  expected  performance  of  C.  A  quantitative  analysis  can  only  be 
carried  out  if  the  threads  of  control  exhibit  some  form  of  invariance  to  data  dependencies,  in  other 
words  if  data  dependencies  can  only  alter  the  timing  but  not  the  order  of  events  in  a  thread  of 
control. 

A  first  type  of  quantitative  analysis  is  a  static  analysis.  This  is  an  analysis  of  the  mapping  from 
a  directed  acyclic  graph,  X>  to  a  parallel  computation  C.  The  E/T  model  is  used  to  determine  the 
computation  and  communication  workload.  The  computation  workload  can  be  analyzed  at  different 
levels,  e.g.,  the  amount  of  computation  between  two  consecutive  events,  the  workload  per  thread 
of  control,  and  the  total  workload  of  C.  Similarly,  communication  workload  can  be  characterized 
by  the  amount  of  data  transferred  during  a  single  event,  the  amount  of  data  transferred  per  thread 
of  control,  and  the  total  amount  of  data  transferred  at  the  computation  level.  The  effects  of 
synchronization  and  blocking  are  not  captured  by  the  static  analysis. 

A  second  type  of  quantitative  analysis  is  the  dynamic  analysis  concerned  with  schedules  which 
associate  times  with  events.  At  this  stage  a  detailed  knowledge  of  the  hardware  is  necessary  in  order 
to  determine  the  time  required  to  perform  computations  and  the  time  to  send  and  receive  data. 
Alternatively,  performance  monitors  and  execution  traces  for  a  selection  of  data  may  provide  suffi¬ 
cient  knowledge  to  carry  out  a  dynamic  analysis.  This  analysis  reveals  the  effects  of  synchronization 
and  blocking. 

Static  analysis  is  often  susceptible  of  an  analytical  approach  but  the  dynamic  models  only  seldom 
lead  to  a  tractable  analysis.  Often  dynamic  models  can  be  constructed  only  through  monitoring 
the  actual  execution  of  C  on  a  particular  parallel  system. 

2  Qualitative  Analysis  Of  Parallel  Computation  in  the  E/T  Model 

2.1  Basic  assumptions 

We  propose  a  model  for  parallel  computing  based  upon  events  and  threads  of  control,  the  E/T 
model.  A  parallel  computation  C  with  P  threads  of  control  and  E  events  is  described  by  its 
characteristic  function  g  defined  by  E  =  g{P).  The  model  is  based  upon  two  assumptions: 

(a)  Conservation  of  work.  Any  work  required  by  a  computation  f(l)  with  one  thread  of  control 

has  to  be  performed  by  one  of  the  threads  of  control  of  C(P).  the  parallel  computation  with 

P  threads  of  control. 

(b)  W(P).  the  work  required  by  a  parallel  computation  is  an  increasing  function  of  the  number  of 

threads  of  control ,  P. 

The  first  assumption  needs  little  justification.  It  is  an  immediate  consequence  of  the  view  that 
a  thread  of  control  is  an  agent  performing  some  work  in  behalf  of  C.  To  carry  out  a  computation 
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with  P  threads  of  control  simply  means  to  redistribute  in  some  fashion  the  work  which  otherwise 
would  be  carried  out  by  only  one  thread.  Call  this  constant  amount  of  work  reflecting  the  work 
conservation  principle  Wcona. 

The  second  assumption  is  supported  by  the  following  arguments.  An  event  is  associated  with 
every  communication  and  control  act.  Any  thread  of  control  needs  to  communicate  with  other 
threads  at  least  at  the  instance  when  it  is  initiated  when  some  work  is  assigned  to  it,  and  at  the 
termination  time,  when  it  has  to  communicate  its  results.  It  follows  that  g(P )  is  an  increasing 
function  of  P.  Moreover  any  event  requires  a  small  amount  of  additional  work,  say  0,  to  be  carried 
out  by  the  thread  of  control  when  an  event  occurs.  Let  WCC(P)  denote  the  additional  amount  of 
work  required  by  C{P)  for  communication  and  control.  The  previous  arguments  show  that  WCC(P) 
given  by 


WCC(P)  >e*E  =  6x  g(P)  (2.1) 

which  is  an  increasing  function  of  P.  Thus,  while  WCC(P)  might  not  increase  monotonically,  it  is 
plausible  to  assume  that  the  variations  from  the  trend  are  small  and  that  WCC(P)  is  increasing. 
But  W(P),  the  work  carried  by  C{P)  consists  of  at  least  wo  components  the  first  one,  T>vCOna, 
independent  of  P  and  the  second  one,  WCC(P),  an  increasing  function  of  P 

W(P )  =  Wcons  +  Wce(P).  (2.2) 

A  parallel  computation  C  with  P  threads  of  control  is  considered  to  be  optimal  iff  E  =  O(P), 
the  number  of  events  in  C  is  linear  in  P. 

Some  of  the  algorithms  we  have  encountered  exhibit  a  convex  characteristic  function  g{P).  We 
show  that  if  the  characteristic  function  g(P)  is  convex  then:  the  speedup  has  a  maximum  for  some 
finite  P  =  PJmai  and  it  is  a  concave  function  for  P  >  Psmax- 

2.2  Threads  of  control  and  events 

The  basic  idea  of  the  model  is  to  describe  a  parallel  computation  C  in  terms  of  threads  of  control  and 
events.  Important  properties  of  C  are  its  duration  T  and  work  intensity  w(t).  The  work  intensity  is 
the  actual  measure  of  work  performed  as  a  function  of  time,  e.g.,  operations  per  second.  The  work 
associated  with  C  is 


fT 

W  =  /  w(t)dt.  (2.3) 

Jo 

In  view  of  the  previous  discussion  the  work  intensity  w'(t)  associated  with  thread  <2>'  has  two 
components 


W'(t)  =  w'con,(t)  +  w'cc(t)  (2.4) 

where  wio„3  is  from  the  work  assigned  to  the  thread  by  virtue  of  the  work  conservation  principle, 
and  the  second  one,  w’cc(t)  represents  the  work  intensity  for  communication  and  control.  Note  that 
w'cons{t)  and  w'cc(t )  cannot  be  non-zero  simultaneously. 

The  duration  T  of  C(P)  is  expected  to  depend  upon  the  number  P  of  threads  of  control  of  C(P). 
The  work  performed  by  the  zth  thread,  <j>\  is 

W'  =  fT  w'(t)dt  =  [  w'cons(t)dt  +  f  w'cc(t)dt.  (2.5) 

Jo  Jo  Jo 

The  total  work  required  by  C(P)  is  thus 
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(2.6) 


P  P  fT 

w{P)  =  y,w'  =  £  /  wtwdt- 

1=1  .=i 

The  thread  (f>1  can  be  in  one  of  two  states  at  time  t:  active  if  u;Jon4(<)  >  0,  and  suspended  if 
wcons{t)  —  0-  When  the  thread  <£‘  is  suspended  then  it  can  be  either  communicating  if  w’cc(t)  >  0, 
or  blocked  if  w'cc{t)  =  0,  as  shown  in  Figure  1  (which  is  explained  later). 

A  parallel  computation  C(P)  may  have  several  threads  of  control  </>*  active  at  any  given  time 
t.  Call  uact( t)  the  number  of  threads  active,  vcc(t)  the  number  of  threads  communicating  and  i /(<) 
the  number  of  threads  non-blocked,  either  active  or  communicating  at  time  t.  Note  that  v(t)  is 
sometimes  called  the  profile  of  the  parallelism ,  [14].  Clearly 

v(t)  =  i 'act(t)  +  vcc(t)  (2.6a) 

and 

1  <  u(t)  <  P  for  0  <  t  <  T(P).  (2.7) 

We  say  that  the  system  changes  its  state  at  time  t  if  vact(t  -  e)  ±  i /act(t  +  c)  for  any  positive  t. 
To  mark  the  change  of  state,  we  say  that  an  event  e(t)  has  occurred  at  time  t.  If  thread  <p'  has 
changed  state  at  time  t,  we  denote  the  event  by  e'(t).  Note  that  we  make  the  following  convention: 
an  event  is  associated  only  with  the  transition  from  active  to  suspended  state.  The  duration  of  an 
event  is  equal  to  the  time  spent  by  the  thread  in  the  suspended  state. 

For  the  sake  of  convenience  we  consider  that  all  P  threads  of  control  are  created  at  time  t  =  0 
and  exist  until  time  t  -  T(P).  In  addition,  we  assume  that  there  are  two  intervals  of  time  when 
only  one  thread  of  control  is  active,  u(  t)  =  1  for  0  <  t  <  t,  and  for  T(P)  —  t  <  t  <  T(P).  The 
times  t,  and  te  are  called  start  parallel  and  end  parallel  times,  respectively.  At  t ,  the  thread  of 
control  active  initially,  </>*,  explicitly  performs  an  action  to  assign  a  part  of  work  to  a  thread  ©2, 
which  changes  its  state  from  suspended  to  active,  4>l  is  called  a  parent  of  ©2.  This  process  has  to 
be  repeated  at  least  P  times,  such  that  each  thread  must  become  active  at  least  once. 

In  case  of  a  serial  computation,  only  one  thread  of  control  is  active  at  any  time  t.  Without  loss 
of  generality,  we  assume  that  a  serial  computation,  C(l)  has  only  one  thread  of  control  active  at 
any  time  t. 

In  a  parallel  computation  C(P)  changes  of  state  occur  due  to  the  need  for  communication  and 
control.  Such  communication  must  take  place  at  least  once  during  the  lifetime  of  <p\  otherwise 
<i>'  would  not  be  able  to  coordinate  its  work  with  other  threads.  Communication  between  two 
threads  of  control,  <p'  and  0J  takes  place  as  the  sender,  say  <t>\  performs  an  explicit  action  of 
making  available  private  information,  and  the  receiver,  say  ©C  performs  an  explicit  action  to  access 
this  information.  The  terms  sender  and  receiver  are  considered  in  the  sense  of  information  theory 
and  the  E/T  model  is  not  concerned  with  the  mechanisms  used  for  communication.  Sending  and 
receiving  may  be  performed  in  different  ways,  such  as  by  message  passing  or  bv  accessing  shared 
data. 

Every  time  a  thread  ©'  performs  an  explicit  action  for  communication  or  control,  our  model 
assumes  the  behavior  illustrated  in  Figure  1.  Note  that  the  workload  intensities  associated  with 
the  thread  ©‘  exhibit  the  following  behavior 


w'conA1)  >  0 

for 

^  ^  ^suspend  arld  t  >  trcaciivate 

W'consi*)  =  0 

for 

t suspend  ^  ^  ^  ^reactivate 

<(<)>  0 

for 

t suspend  <  ^  <  block  brsume  <  ^  <  ^reactivate 

O 

(1 
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Figure  1:  The  states  of  the  thread  of  control  <f>'  when  an  event  ettuiptrid  occurs. 


The  additional  work  for  communication  and  control,  6  in  (2.1)  reflects  the  work  associated 
with  the  periods  when  injc(t)  is  non-zero.  A  blocking  period  may  occur  only  for  some  events. 
For  example,  in  a  message  passing  system,  an  asynchronous  write  operation  does  not  experience 
blocking,  while  a  synchronous  read  may  experience  blocking  if  the  data  has  not  been  received  yet. 
In  a  shared  memory  system,  both  reading  and  modifying  a  shared  data  element  may  experience 
blocking. 

It  is  difficult  to  predict  the  duration  of  a  blocking  period,  therefore,  knowing  that  an  algorithm 
for  matrix  multiplication  requires  say,  0(n2/p2^3)  communication  steps,  for  two  n  x  n  matrices, 
using  p  processors  [1],  doet.  not  translate  easily  into  statements  concerning  communication  time. 


2.3  W,T  characterization  of  a  parallel  computation 

Statements  about  a  computation  C  can  be  made  when  the  amount  of  work,  W  and  the  time 
T  required  by  C  are  known.  To  simplify  the  discussion,  let  us  assume  that  the  work  intensity 
associated  with  thread  <J>'  is  constant  when  the  thread  is  not  blocked. 


J  I  if  </>'  is  active  or  communicating 
(  0  if  <t>'  is  blocked. 


In  this  case  if C  is  performed  using  a  serial  execution,  i.e.,  as  C(l),  with  only  one  thread  as  shown 
in  Figure  2a,  then  there  is  a  clear  relationship  between  W(l)  =  Wcon3  and  T(l),  the  execution  time 
with  one  thread  only: 


W(l)  =  T(l)-I.  (2.10) 

The  relationship  between  W  and  T  is  loss  obvious  in  case  of  multiple  threads  of  control,  as  shown 
in  Figures  2b  and  2c,  where  two  alternative  computations  CA( 2)  and  CB( 2)  are  used  to  perform  C. 
We  observe  that 


W{\)  <W(A](2)  <W(B){2),  (2.11) 

but 

T(B)( 2)  <  2).  (2.12) 

Even  the  simple  question  of  which  one  of  the  two  variations  of  C{ 2),  CA{ 2)  or  CB( 2),  is  better 
cannot  be  answered  unambiguously,  as  CB( 2)  requires  less  time,  but  more  work  than  CA( 2). 

The  relationship  between  W(P)  and  T(P )  is  explored  next.  Consider  the  case  described  by 
equation  (2.9).  Then  the  work  intensity  can  be  expressed  as 

w(t)  =  w'(t)-v(t)  =  I-u(t)  (2.13) 
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Figure  2(a) 


H 

<  =  ni) 


t  =  0 


Figure  2(b) 


t  =  r^(2) 


t=0  t  =  Ts(2) 

Figure  2(c) 


Figure  2:  (a)  Sequential  computation  C(l)  =  <  W(1),T(1)  >.  (b)  Computation 
reorganized  to  be  <  CA( 2)  >  =  <  WA( 2),  TA( 2)  >  with  two  threads  of  control,  (c) 
Computation  reorganized  in  a  different  way  to  be  CB( 2)  =  <  WB( 2),  TB( 2)  >  with 
two  threads  of  control.  Solid  lines  represent  work  periods  and  dotted  lines  blocked 
periods. 


with  v(t)  the  number  of  threads  of  control  non-blocked  at  time  t.  The  work  W(P)  associated  with 
C(P ),  can  be  expressed  as 


rT(P)  run 

W{P)=  w(t)dt  =  /  /  u(t)dt.  (2.14) 

Jo  Jo 

Define  the  expected  number  of  threads  non-blocked  (active  or  communicating)  at  time  t  as 


rT(P) 


HP)  = 


1  J^P) 


T(P) 


fl  (P) 

I  o{t)dt. 

Jo 


From  (2.14)  and  (2.15)  it  follows  that 


T(P)  = 


W{P)  1 


I  HP) 


Similarly 


(2.15) 


(2.16) 


wc 


»( t)  =  w'cons(  t)  ■  l ract(  t)  =  /  •  Vact(  t) 


with  Pact(t)  the  number  of  threads  of  control  active  at  time  t. 
The  work  fFC0„a  can  be  expressed  as 


Wrt 


rT(P)  fT(P) 

=  /  Wcon,(l)dt  =  I  i/act(t)dt. 

Jo  Jo 


Define  the  expected  number  of  threads  active  at  time  t  as 


(2.17) 


(2.18) 
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1  /T(p) 

&act{P)  —  j0  Vactifydt. 


(2.19) 


Then  we  have 


Wcona  =  IT(P)hact(P). 

But  Wcona  =  W(l)  =  /T(l)  hence 


(2.20) 


T(l) 

Tjjj  =  *>*ct{P)-  (2.21) 

2.4  The  expected  amount  of  work  per  thread  of  control 

To  study  the  asymptotic  behavior  of  a  parallel  computation  C  when,  P,  the  number  of  threads  of 
control  increases,  we  first  investigate  the  behavior  of  the  function 

t HP)  =  (2-22) 

Consider  first  computations  C  with  E  =  O(P)  where  each  thread  of  control  </>*  experiences  only  a 
few  communication  events,  in  addition  to  the  events  to  initialize  and  terminate  <f>' .  An  example 
of  such  a  computation  is  a  plotting  computation  when  each  thread  operates  in  isolation  upon  its 
private  data  to  create  its  part  of  the  plot  and  makes  the  results  available  at  the  end.  In  this  case 

w(P)  =  ^21  +  0(1).  (2.23) 

For  such  parallel  computations  the  expected  amount  of  work  per  thread  of  control  is  a  monotonically 
decreasing  function  of  the  number  of  threads  of  control  as  shown  in  Figure  3. 

Consider  now  parallel  computations  with  E  =  0(P2),  for  example  when  each  thread  of  control 
communicates  with  every  other  thread  of  control  during  its  lifetime.  In  this  case  the  asymptotically 
expected  amount  of  work  per  thread  of  control  is 

w(P)  =  pP-  =  ^f±  +  C>(P).  (2.24) 

The  amount  of  work  per  thread  of  control  exhibits  a  minimum  for  a  certain  Popt  and  it  is  a 
monotonically  increasing  function  of  P  when  P  >  Popt •  Clearly,  Popt  increases  as  Wcona  increases. 
For  a  given  6  the  range  of  P  such  that  W(P)-  W(  Popt )  <  b  is  usually  fairly  large.  w{  P)  is  relatively 
flat  around  its  minimum.  This  case  is  illustrated  in  Figure  4. 

If  g(P )  =  0(Pn)  with  n  >  3  then  w(P)  increases  rapidly  with  P  and  massive  parallelism  is 
unlikely  to  be  advantageous  unless  Wcona  is  enormous. 

In  conclusion  w(P)  provides  a  useful  signature  of  C.  This  signature  indicates  that  massive 
parallelism  is  truly  advantageous  only  when  E  =  O(P).  In  this  case  the  ui(P)  is  a  monotonically 
decreasing  function  of  P  so  that  if  reasonable  load  balancing  is  achieved  among  the  threads  of  contro' 
then  the  processors  are  used  efficiently.  When  E  =  0(P2)  then  there  exists  an  optimum  number 
of  threads  of  control  which  minimize  the  expected  workload  per  thread,  and  w(P)  is  relatively  flat 
around  that  minimum.  If  the  characteristic  function  E  =  g{P)  is  either  a  polynomial  of  degree 
n  >  3  or  similar  type  of  behavior,  then  ir(P)  exhibits  a  minimum  for  a  lower  value  of  Popt  and 
w{  P0pt )  is  higher  than  in  the  previous  case.  The  efficiency  of  computations  in  this  class  is  rather 
sensitive  to  the  choice  of  P.  the  number  of  threads  of  control. 


43 


Figure  3:  The  expected  work  per  thread  of  control  w(P)  function  of  the  number 
of  threads  of  control,  P,  for  a  parallel  computation  of  a  fixed  problem  size  with 
E  -  0{P)  according  to  equation  (2.23). 


Figure  4:  The  expected  work  per  thread  of  control  w(P )  function  of  the  num¬ 
ber  of  threads  of  control,  P,  for  a  parallel  computation  of  fixed  problem  size  with 
E  =  0( P2)  according  to  equation  (2.24). 


2.5  The  speedup 

The  speedup  S(P )  is  defined  as  the  ratio  of  the  computation  time  with  one  thread  of  control  to 
the  computation  time  with  P  threads,  P  >  1,  that  is 


S(P)  = 


T(P)  ’ 


(2.25) 


First  observe,  that  according  to  (2.21), 


S(P)  =  uact.  (2.26) 

Similar  results  have  been  reported,  see  for  example  [3],  but  in  the  framework  of  the  E/T  model, 
the  speedup  is  equal  to  the  expected  number  of  threads  active,  performing  work  assigned  by  virtue  of 
the  conservation  law.  The  speedup  is  less  than  u,  the  expected  number  of  threads  running  (active 
or  communicating).  Since  uact  <  v  <  P,  it  follows  that 

S(P)  <  P.  (2.27) 

Consider  now  the  asymptotic  behavior  of  S(P).  From  (2.16)  and  (2.20)  it  follows  that 


(2.28) 


We  introduce  the  efficiency ,  b(P)  as  the  ratio  between  the  expected  amount  of  work  per  thread 
of  control  using  P  threads.  w(P)  =  W{P)/P ,  and  the  work  W(l)  =  Wconj  using  one  thread 
(sequential  execution),  that  is 


b(P)  = 


W{P) 
PW{  1)  ' 


Note  that  W(P)  >  W(l).  Hence 


b(P)  <  1  /P. 

The  expected  fraction  a(P)  of  non-blocked  threads  in  C(P)  is  given  by 


(2.29) 


(2.30) 


a(P)  = 


0  <  a{P)  <  1. 


Then  we  have 


(2.31) 


S(P)=  — —  .  (2.32) 

'  '  b(P) 

The  study  of  the  asymptotic  behavior  of  S{P)  when  P  becomes  very  large  is  reduced  to  the  problem 
of  the  asymptotic  behavior  of  a(P )  and  b(P).  From  the  definitions  of  a(P),  b(P)  and  W(P),  the 
following  conclusion  can  be  drawn: 

(a)  For  parallel  computations  with  g(P)  =  O(P),  we  have  b(P )  =  1/ P  -F  constant  for  large  P  and 

hence  S(P)  <  constant  for  a  large  number  of  threads  of  control. 

(b)  For  parallel  computation  with  g(P)  =  0(Pn)  with  n  >  2,  b(P)  is  an  increasing  function  of  P 
and  hence  S(P)  tends  to  zero  asymptotically. 
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Let  us  now  consider  the  case  of  scaled  execution  [5]  where  the  computation  size  increases  linearly 
with  the  number  of  processors  (threads  of  control)  used,  namely 

W(  1)  =  O(P) 

T(l)  =  0{P).  (2.33) 

Scaled  speedup  SS(P)  is  defined  for  scaled  execution  by  equation  (2.25).  The  quantities  a(P)  and 
6(P)  are  analogously  defined  and  relations  (2.27)  through  (2.32)  hold.  The  asymptotic  behaviors 
of  b(P )  and  SS(P)  in  this  case  are  as  follows: 

(sa)  For  parallel  computation  with  g(P )  =  O(P),  SS(P)  is  an 
increasing  function  of  P. 

(sb)  For  parallel  computation  with  g(P)  =  0(P2),  SS(P)  <  con¬ 
stant  for  large  P. 

(sc)  For  parallel  computation  with  g(P)  =  0(Pn)  and  with  n  >  3, 

SS(P)  tends  to  zero  for  large  P. 

It  seems  reasonable  to  question  whether  scaled  execution  and  parallel  computations  with  g(P)  = 
0{P)  are  compatible  with  one  another.  A  computation  is  called  embarrassingly  parallel  if 


W(P)  =  W(  1)  +  constant,  v{t)  =  P  for  t0  <  t  <  T{P)  -  t0 


and 


E  =  constant  x  P  . 

This  terminology  is  especially  appropriate  if  the  constants  involved  are  small.  For  these  computa¬ 
tions  we  have  a(P)  =  1  -  2to/T(P)  which  is  asymptotically  1  and  b(P)  =  ( W ( 1 )  +  constant/ (P  ■ 
PV(  1 ))  which  is  asymptotically  1/P.  Thus  for  embarrassingly  parallel  computations,  we  have 

SS(P)=P  +  0(  1).  (2.34) 

Such  computations  arise  when  the  work  can  be  partitioned  into  P  parts  at  the  beginning  and  then 
done  completely  independently  by  the  processors.  Thus  we  can  achieve  optimal  speedup  for  such 
computations. 

Divide  and  conquer  algorithms  may  provide  scaled  speedup  nearly  as  great.  Let  P  =  2k  and 
assume  conservatively  that 

1.  The  work  after  each  division  of  the  problem  is  the  same  as  VF(1). 

2.  The  events  take  place  only  at  dividing  the  computation  up  and  recombining  the  results. 
Then  we  see  that  W(P)  <  IV ( 1)  log  P,  E  =  O(P)  and  we  compute  that,  asymptotically, 

b(P)  =  log  P/P 

T(P)  <  T(l)  x  21ogP  (2  35) 

i>(P)  <  constant  x  P 

SS(P)  =  0(  P/logP). 
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In  [3]  the  average  parallelism  was  proposed  as  a  high  level  characterization  of  software  structure. 
The  average  parallelism  is  defined  as  the  speedup,  given  an  unbounded  number  of  processors. 
The  previous  discussion  shows  that  there  are  parallel  algorithms,  such  that  S(P)  or  SS(P)  tend 
asymptotically  to  zero,  hence  the  average  parallelism  does  not  provide  a  useful  characterization  of 
such  applications. 

2.6  Analysis  when  E  =  g(P)  is  a  convex  function 

The  qualitative  analysis  continues  with  the  case  when  g(P)  is  a  convex  function.  Several  algorithms 
we  have  examined  suggest  that  g(P)  is  often  a  convex  function  of  P  as  well  as  increasing. 

Theorem  2.1  If  E  =  g(P)  is  increasing  and  convex  function  for  P  >  1  then  W(P)  is  also  convex. 
Let  Pamax  be  the  unique  solution  of 


P  =  [a  +  g(P)\/g'(P).  (2.36) 

where  a  =  Then  S(P)  is  increasing  for  P  <  P,maI  and  decreasing  for  P  >  P,max. 

Proof.  We  have  W(P)  —  Wcons  +  9  x  g(P)  so  W(P )  is  convex  if  g(P)  is.  The  speedup  S(P) 
may  be  expressed  by 


5(P)  =  T(1)/T(P)  = 


P  W{T)  +  Bxg(P) 

l  +  [6/W(l)}g(P)  W(l)  +  9xg(P) 


miy  I 

(W(l)  +  9x  g(P))/(PI)  ■ 

Combine  6/W(l)  into  the  constant  1  /a  and  differentiate  this  expression  to  obtain 


(2.37) 


S\P) 


a  +  g(P)  -  Pg'(P) 
(a  +  g(P)2 


Set  g(P)  =  Ph(P)  with  h’(P)  >  0  by  the  convexity  assumption.  Then  we  have 


(2.38) 


a  -  P2h'( P) 

S'(P)  =  (7T+  f(p)Y  =  [a  +  Ph(P)  ~  P(k(P)  +  P  h’(PW(«  +  ph(P)?.  (2.39) 

Since  h'{P)  is  positive,  (2.39)  is  zero  exactly  once.  A  manipulation  of  (2.38)  allows  one  to  obtain 
(2.36)  as  asserted  by  the  theorem. 

We  may  use  this  result  to  provide  estimates  of  maximum  speedups  and  the  corresponding 
number  of  threads  of  control  (processors)  for  a  few  cases  as  given  in  Table  1.  Note  that  the 
speedups  given  are  maximums,  other  factors  (e.g.,  lack  of  load  balancing)  can  make  them  smaller. 


2.7  Additional  workload  due  to  algorithmic  effects 

To  characterize  the  work  required  by  a  computation  C  with  P  threads  of  control,  we  have  identified 
two  components  so  far,  the  intrinsic  work,  WCOn3  assigned  to  the  threads  by  virtue  of  the  con¬ 
servation  law  and  WCC(P)  the  work  for  communication  and  control.  However,  the  transformation 
from  a  computation  C(l)  with  one  thread  of  control  only  to  a  computation  C(P)  with  P  threads  c  r 
control,  often  introduces  additional  work  called  in  the  following  algorithmic  workload  and  denoted 
by  Wnig(P),  i.e..  we  have 
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Table  1:  Values  of  maximum  speedups  and  corresponding  P,max  for 
a  =  W( l)/8  =  106,  104,  102  and  g(P)  =  Pn  for  n  =  1.5,  2,  2.5,  and  3. 


g(P)  =  Pl* 

a  =  106 

a  =  104 

a  =  102 

Speedup 

5291 

245 

11 

P$max 

16000 

736 

34 

g{P)  =  p 2 

a  =  106 

a  =  104 

a  =  102 

Speedup 

500 

50 

5 

p 

1  smax 

1000 

100 

10 

g(P)  =  5 

o 

II 

CJ 

a  =  104 

cs 

O 

II 

0 

Speedup 

128 

20 

3.2 

P imai 

213 

34 

5 

g(P)  =  P8 

a  =  106 

a  =  104 

a  =  102 

Speedup 

53 

11.4 

2.4 

Psmax 

79 

17 

4 

W(P)  =  Wcon,  +  WCC(P)  +  Walg(P).  ;  '0) 

We  study  alternative  parallel  ways  to  do  the  same  work  and  exclude  from  considering  complete 
changes  of  algorithms.  However,  we  do  allow  the  work  to  be  transformed  in  various  ways  using 
simple  equivalences  of  operations.  Specifically, 

C(l)  =  4*5*  9*412 

is  equivalent  to  CA{ 2)  defined  by 

Thread  l  Thread  2 

4  *  5  =  20  9  *  412  =  3707 

20  *  3707  =  74160 


but  is  not  equivalent  to  CB( 2)  defined  by 

Thread  1  Thread  2 

log  4  +  log  5  =  1.301  log  9  4-  log  412  =  3.569 

Blocked  1.301  +  3.569  =  4.871 

104  871  =  74  1  60 


Similarly 


C(l)  =  sort  (listl),  sort  (list2),  sort  (list3),  concatenate  (Iistl,list2,list3) 
is  equivalent  to  CA{ 2)  defined  by 

Thread  1  Thread  2 

sort  (listl)  sort  (list2) 

sort  '■ ut>t3)  concatenate  (listl, list2)  =  list4 

concatenate  (Iist4,list3) 

The  question  of  when  two  computations  are  equivalent  is  a  subtle  one,  which  we  do  not  attempt 
to  make  precise  here  because  most  parallelizations  of  algorithms  introduce  new  work,  called  Waig(P ) 
above.  However,  this  concept  is  useful  in  heuristic  discussions  of  the  work  of  different  algorithms 
for  the  same  task.  The  range  of  possible  effects  of  parallelization  of  an  algorithm  are  very  large, 
but  there  seem  to  be  two  common  ones.  Later  we  examine  realistic  algorithms  in  some  detail,  but 
here  we  consider  a  simple  algorithm. 

(a)  Algorithmic  overhead  associated  with  individual  events.  An  algorithm  may  have  internal  infor¬ 

mation  that  needs  to  be  updated  when  new  external  information  is  received.  For  example,  an 
iteration  on  one  domain  receives  values  from  an  adjacent  part  of  another  domain.  These  val¬ 
ues  affect  the  error  estimates  along  the  domain  boundary,  which  in  turn,  affect  the  estimate  of 
convergence  ratio  and  relaxation  factors.  All  these  estimates  and  factors  must  be  recomputed 
when  new  information  is  received.  A  very  high  rate  of  events  could  distort  the  computation 
until  most  of  the  work  done  is  recomputing  these  estimates  and  factors,  instead  of  carrying 
out  the  iteration.  This  example  of  algorithmic  overhead  behaves  like  communication  and 
control  work  (indeed,  the  update  computations  are  to  control  the  numerical  behavior  of  the 
iterations). 

It  is  plausible  to  merge  this  work  into  Wcc  even  though  it  appears  to  be  algorithmic  work. 
There  seems  to  be  no  advantage  in  carrying  along  two  independent  sources  of  work  propor¬ 
tional  to  the  number  of  events. 

(b)  Algorithmic  overhead  associated  with  (long)  event  free  periods.  The  computation  in  one  thread 

might  profitably  use  information  from  other  threads  to  reduce  its  algorithmic  work.  For 
example,  a  search  party  of  10  men  covering  an  area  will  be  forced  to  have  all  10  men  search 
the  entire  area  unless  there  is  communication  between  them  about  which  areas  have  already 
been  searched.  Many  analogs  of  this  simple  situation  exist  in  computational  search  algorithms. 
Another  example  occurs  when  long  periods  between  events  force  one  t :  .  .id  to  save  data  for 
future  communications,  buffers  or  queues  can  become  full,  requiring  extra  work  to  save  data  in 
special  ways,  or  the  thread  can  even  become  idle  (introducing  an  unnecessary  event)  waiting 
to  empty  space  for  saving  data.  A  more  subtle  example  is  a  set  of  parallel  iterations  where  long 
periods  of  no  communication  means  that  the  iterations  are  running  but  not  accomplishing 
anything  useful. 

The  average  event  interval  is  T(P)-  P/E  and  this  type  of  algorithmic  overhead  is  modeled  by 

Walg{  P)  =  0(J(Fj~ )  •  (2-42) 
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Thus,  Waig(P)  is  proportional  with  the  expected  time  between  successive  events.  The  larger 
this  interval,  the  more  likely  it  is  that  a  thread  will  perform  unnecessary  work,  or  it  will 
duplicate  work  done  by  another  thread. 

This  informal  presentation  shows  that  the  avoidance  of  additional  work  for  the  algorithm  occurs 
only  in  embarrassingly  parallel  computations. 

Note  that  long  event-free  periods  are  usually  associated  with  other  types  of  undesirable  effects 
besides  work  duplication,  effects  related  to  practical  implementation  of  partial  computations  on 
real  machines  with  finite  memory.  Whenever  the  lifetime  of  partial  results  defined  as  the  interval 
between  the  instance  partial  results  are  produced  by  thread  <j>'  and  the  instance  they  are  consumed 
by  thread  <f>: ,  is  large,  then  the  memory  requirements  for  C  become  substantial  as  observed  in  [10]. 

The  algorithmic  workload  and  the  lifetime  of  partial  results  discussed  in  this  section  are  difficult 
to  be  captured  and  they  will  be  largely  ignored  in  this  study.  But  it  is  conceivable  to  assume  that 
a  second  conservation  law  of  the  type 

WCC(P)  +  WaigiP)  =  f(P)  (2.43) 

is  valid  and  captures  the  effect  that  for  a  given  P  the  sum  of  the  work  for  communication  and 
control  and  algorithmic  work  is  constant  and  any  gain  obtained  by  reducing  the  number  of  events 
is  compensated  by  increasing  the  level  of  work  duplication. 


2.8  Extensions  of  the  model  to  non-SPMD  computations 

The  SPMD  parallel  computations  are  homogeneous,  all  threads  of  control  <t>\  1  <  i  <  P.  ex¬ 
hibit  similar  behavior.  In  the  framework  of  the  E/T  model,  this  translates  into  the  fact  that  the 
characteristic  functions  of  all  threads  of  control  g'{P)  are  identical 


9l(P) 


1 

Pg{P) 


1  <i<  P. 


(2.44) 


In  case  of  non-SPMD  computations,  the  dissimularities  among  different  threads  of  control  is  re¬ 
flected  by  a  partial  ordering  of  threads,  based  upon  the  number  of  events  associated  with  each 
thread.  Call  4>e  the  thread  which  has  the  laTgest  number  of  events.  In  such  a  case,  the  E/T  model 
requires  one  to  identify  the  thread  <t>1  and  to  study  its  characteristic  function  gf{P).  Of  course, 
things  can  become  even  more  complex  when  the  thread  with  the  largest  number  of  events  is  data 
dependent  or  its  behavior  changes  widely  with  the  data. 


3  Quantitative  Analysis  of  SPMD  Parallel  Computations 

In  this  section  we  focus  upon  quantitative  analysis  of  a  parallel  computation  C  and  attempt  to 
estimate  measures  of  performance  such  as  speedup,  execution  time,  processor  utilization,  etc.  Such 
an  analysis  is  possible  only  if  C  exhibits  only  limited  data  dependencies.  Dynamic  computations 
in  which  the  actual  sequence  of  events  in  every  thread  of  control  are  data  dependent  lead  to  an 
intractable  analysis.  Fortunately,  most  SPMD  computations  satisfy  this  condition,  while  the  timing 
of  different  events  in  a  thread  of  control  may  change  depending  upon  the  data,  the  actual  sequence 
of  computations  and  events  occurring  in  a  thread  of  control  is  invariant  to  input  data. 

First,  we  consider  a  static  analysis  which  is  an  analysis  of  the  mapping  from  a  directed  acyclic 
graph,  V  to  C.  The  computation  and  communications  workloads  can  be  analyzed  at  different  levels 
as  discussed  in  the  overview.  The  effects  of  synchronization  and  blocking  are  not  captured  by 
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the  static  analysis.  The  dynamic  analysis  is  concerned  with  schedules  which  associate  times  with 
events.  At  this  stage  a  detailed  knowledge  of  the  hardware  is  necessary. 

The  transition  from  the  static  to  the  dynamic  stage  of  the  quantitative  analysis  is  difficult. 
A  “superposition”  property  which  would  allow  the  extension  of  results  obtained  in  stage  one  to 
stage  two  would  be  desirable  but  it  seems  that  the  occurrence  of  such  a  property  is  an  exceptional 
event.  Some  of  the  problems  encountered  in  this  area  are  due  to  the  difficulties  to  estimate  the 
communication  time  between  two  processors,  xt-  and  Xj.  This  time  depends  upon  factors  as 

(a)  The  architecture  of  the  system  ~H  and  the  number  of  processors  in  the  system  n.  For  example 

if  we  call  r  the  communication  delay  for  a  message  of  unit  length  then  r(n)  is  of  the  order  of 
y/n  for  a  grid,  log2  n  for  a  hypercube  and  n  for  a  ring  interconnection  network. 

(b)  The  communication  software,  the  communication  protocols,  the  routing  strategy,  etc. 

Effects  unrelated  to  the  parallel  computation  C  but  determined  by  the  need  to  share  resources  in 
a  multiuser  system  can  only  be  considered  during  the  dynamic  analysis.  Such  effects  are:  fragmen¬ 
tation  in  communication,  the  need  to  split  a  large  message  into  a  number  of  packets,  or  processor 
sharing,  in  case  of  multiuser  systems.  These  effects  are  extremely  difficult  to  be  captured  by  any 
model. 

While  static  analysis  is  often  susceptible  to  an  analytical  approach  the  dynamic  models  only 
rarely  lead  to  a  tractable  analysis.  Often  dynamic  models  can  be  validated  only  through  the 
process  of  monitoring  C.  Static  analysis  may  be  useful  to  make  decisions  concerning  scheduling  in 
a  multiuser  e  vironment.  For  example  if  the  thread  <t>'  expects  a  large  message  from  the  thread 
dP  then  the  arating  system  of  the  node  where  <f>'  runs  may  suspend  it,  run  another  process  and 
return  to  it  -iter  the  message  has  been  received.  Performance  measures  as  the  average  degree  of 
parallelism.  [3]  are  clearly  related  to  stage  one  in  our  approach  and  can  be  used  successfully  to 
make  scheduling  decisions  as  pointed  out  in  [14). 

3.1  Static  analysis  -  Mapping  in  the  E/T  model 

Mapping  in  the  context  of  the  E/T  model  is  the  process  of  deciding  which  computations  and  which 
problem  data  are  assigned  to  every  thread  of  control  of  C.  The  two  aspects  of  mapping,  computation 
mapping  and  data  mapping  are  closely  related. 

Let  us  for  the  moment  consider  the  general  case  when  a  parallel  algorithm  A  is  given  as  a  directed 
acyclic  graph,  DAG,  V  =  (F,  A)  whose  nodes  d,  €  V  are  computational  tasks  with  given  workload 
requirements  and  whose  arcs,  a,  €  A,  represent  both  temporal  and  functional  dependencies.  In 
addition,  the  arcs  have  associated  with  them  communication  load  requirements.  The  mapping  of 
the  DAG  D  to  a  parallel  computations  C  is  a  process  of  deciding  how  many  threads  of  control  should 
C  have  and  how  different  nodes  of  V  will  be  assigned  to  the  threads  of  control.  The  computation 
mapping  consists  of  the  following  steps 

(a)  Choose  the  number  P  of  threads  of  control  d>\  1  <  i  <  P. 

(b)  Group  together  a  number  of  nodes  of  V  and  assign  them  to  d>' . 

The  data  mapping  is  the  process  of  assigning  problem  data  to  different  threads  of  control.  Data 
mapping  follows  computation  mapping  and  allows  us  to 

(c)  Define  the  sequence  of  actions  performed  by  every  thread  <£>',  1  <  *  <  P ■ 
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(d)  Determine  the  sequence  of  events  associated  with  every  thread  <t>\  1  <  i  <  P. 

Note  that  the  SPMD  execution  corresponds  to  the  case  when  the  computation  mapping  assigns 
all  nodes  of  T>  to  every  thread  of  control  4> The  data  mapping  makes  the  execution  of  the  P 
threads  of  control  different. 

Formally,  the  computation  mapping  is  described  as  follows.  Denote  by  N  the  number  of  nodes 
of  V  =  (D,  A),  N  =  \D\.  Then  the  work  associated  with  the  DAG,  V  is  characterized  by  a  N  x  1 
vector  Wp 


Wp  =  fo]  (3.1) 

with  Wj  the  work  associated  with  node  d;  G  V.  The  mapping  from  V  to  C  is  characterized  by  the 
(P  X  N)  computation  mapping  matrix  Mv 


Mv  =  [mij]  (3-2) 

with 

mij  =  1  if  dj  is  mapped  into  </>',  1  <  i  <  P  and  1  <  j  <  N  ^3) 

m^  =  0  otherwise. 

The  integer  =  'Efjsi  is  called  the  grouping  factor  of  4>'  and  represents  the  number  of 
nodes  of  V  assigned  to  <t>' ,  with  1  <  i  <  P. 

The  data  mapping  associated  with  a  domain  B  decomposed  into  K  subdomains,  b r  is  charac¬ 
terized  by  the  (P  X  K)  data  mapping  matrix  Qb 


Qb  =  to)  (3-4) 

with 

<7,j  =  1  if  bj  is  mapped  into  4>',  1  <  i  <  P  and  \  <  j  <  N  (3  5) 

gt}  =  0  otherwise. 

The  work  performed  by  C{P)  is  characterized  by  a  P  x  1  vector  Wc, 


w  c  =  K] 

(3-6) 

with  w‘  the  work  assigned  to  4>'  by  the  mapping  M.  Clearly, 

Wf  =  .Vf  jy  W  j) 

(3.7) 

N 

w'  =  Y2 

(3.8) 

j=i 

We  consider  a  nondeterministic  model  and  assume  that  the  work  process  of  V  is  stationary 
and  the  random  variables  Wj,  1  <  j  <  A  have  mean  po  and  variance  op.  The  assumption  of  a 
stationarity  process  is  common  in  the  analysis  of  stochastic  processes  and  it  is  necessary  in  order 
to  promote  tractability  of  our  model.  In  general,  the  w,  are  dependent  random  variables  and  their 
dependence  is  characterized  by  the  work  covariance  matrix  ofV,  <rp 

4  =  [(4)ul  <3-9> 
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with 


(<rb)ij  =  Cov(tt),-,ti?j).  (3.9') 

If  we  assume  that  the  work  process  associated  with  the  DAG  V  is  stationary,  it  follows  immedi¬ 
ately  that  the  work  process  associated  with  the  mapping  C  is  also  stationary.  The  random  variables 
w'  with  1  <  i  <  P  have  a  distribution  with  mean  y'  and  variance  a1  such  that 

y'  =  Tmy.D  (3.10) 


m, 

(a')2  =  ttuijq  +  ^2  51  Cov  (vP ,  wk). 
j=l  k=  l 


The  work  covariance  matrix  of  C  is  a\  defined  by 


°C  =  \{oc)ij] 


(3.11) 

(3.12) 


((Tc)ij  =  Go\(w',vP).  (3.13) 

An  important  aspect  of  mapping  is  related  to  load  balance.  Intuitively,  it  seems  desirable  to 
assign  to  every  thread  of  control  an  equal  amount  of  work  with  the  hope  that  such  a  load  balanced 
mapping  will  eventually  lead  to  the  shortest  possible  execution  time  and  to  the  best  possible 
utilization  of  resources.  An  SPMD  computation  seems  an  ideal  case  from  the  load  balance  point 
of  view  since  in  this  case  all  threads  of  control  have  assigned  to  them  identical  workload.  But  data 
mapping  makes  the  execution  different,  the  actual  instruction  execution  sequence  in  each  thread  of 
control  is  different  due  to  data  dependencies  and  a  perfect  load  balance  is  unlikely  to  be  achieved. 
For  this  reason  we  consider  a  nondeterministic  analysis  and  regard  the  workload  associated  with 
any  thread  of  control  as  a  random  variable. 

In  addition  to  such  algorithmic  load  imbalance  effects  there  are  non- algorithmic  causes  such 
as  hardware  failures  and  retrys,  message  retransmission,  etc.  To  characterize  the  load  imbalance 
associated  with  any  given  mapping  we  introduce  a  load  imbalance  factor.  A  defined  in  the  following. 
Denote  by  Y  the  workload  associated  with  the  most  heavily  loaded  thread  of  control. 

Y  =  max(  w1 ,  w2, . . . ,  w\  . . . ,  wp).  (3.14) 

Call  Y  the  expected  value  of  the  random  variable  Y  and  call  w  the  mean  value  of  y'  defined  as 


Then  A  is  defined  implicitly  as 


w  - 


(3.15) 


Y  =  u>(l  +  A)  (3.16) 

In  this  expression  w  is  the  expected  workload  per  thread  of  control  and  \  is  the  expected  workload 
of  the  most  heavily  loaded  thread  of  control. 

To  conclude  this  section  we  summarize  the  parameters  which  may  be  used  to  characterize 
statically  a  parallel  computation  C,  in  addition  to  P.  E  and  W(P),  which  have  been  discussed 
previously. 
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k‘  -  The  number  of  events  in  the  thread  of  control  </>*,  1  <  t  <  P. 

a*  -  The  amount  of  work  performed  by  the  thread  </>'  between  two 

consecutive  events  e*  and  e}+1,  for  1  <  t  <  P,  1  <  j  <  k\ 

a'  -  The  expected  amount  of  work  associated  with  an  event  in 
thread  <j>\  computed  as  the  mean  value  of  a'-,  1  <  i  <  P, 

1  <  j  <  K*. 

Wx  -  The  total  workload  associated  with  the  thread  <j>',  1  <  i  <  P. 

p‘  -  The  expected  workload  associated  with  the  thread  <j>\  1  < 

i  <  P. 

w  -  The  expected  workload  per  thread  of  control. 

/3‘  -  The  amount  of  data  transferred  when  the  event  e!;  in  thread 

4>l  occurs,  1  <  i  <  P,  1  <  j  <  k*. 

0'  ~  The  expected  amount  of  data  transferred  per  event  in  thread 

</>',  computed  as  the  mean  value  of  0'.,  1  <  j  <  k'. 

0'  ~  The  total  communication  load  associated  with  thread  <J>',  1< 

i  <  P. 

0  ~  The  expected  value  of  0'. 

A  -  The  load  imbalance  factor. 

Finally,  we  stress  again  that  without  the  detailed  information  required  to  compute  a  schedule, 
the  performance  of  a  parallel  computation  can  only  be  estimated.  Such  estimates  may  be  used 
to  compare  different  mappings  Mi  of  a  given  algorithm  A  but  no  definite  statements  about  the 
actual  execution  time,  the  speedup  and/or  the  efficiency  can  be  made  at  the  stage.  Two  examples 
applying  these  ideas  follow. 


3.2  Examples 

3.2.1  A  parallel  algorithm  for  matrix  multiplication 

Consider  a  parallel  algorithm  for  matrix  multiplication  with  P  threads  of  control.  This  algorithm 
is  described  and  analyzed  in  [1].  Let  A  and  B  be  two  (n  x  n)  matrices.  The  algorithm  requires 
the  partitioning  of  A  and  B  into  q 2  disjoint  submatrices  AXJ  and  B}k-  Each  submatrix  is  of  size 
(mum).  The  values  of  q  and  m  are  given  by 


q  =  P1'3 


n  n 
=  9  =  PC5  • 


(3.17) 
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Without  lo6s  of  generality  assume  that  P  -  2“  and  we  have  n  =  mq  with  a  and  m  positive 
integers.  The  algorithm  proceeds  as  follows.  For  every  triplet  (i,j,k r)  compute 

j, k)  =  (i  -  l)q2  +  (fc  -  l)g  +  j.  (3.18) 

Clearly  1  <  l(i,j,k)  <  P  when  ( i,j,k )  G  [l,q].  There  are  two  steps: 

A.  Let  the  thread  perform  the  following  actions 

(Al)  -  Read  the  submatrix  Akj  with  m2  elements. 

(A2)  -  Read  the  submatrix  Bjk  with  m2  elements. 

(A3)  -  Compute  the  submatrix  C,yjt  =  Aim  x  B:k-  This  re¬ 

quires  0(m3)  operations. 

B.  Organize  the  threads  4>l  with  1  <  t  <  P  as  q2  complete  binary  trees  such  that  will 

compute 


Cik  =  °ijk  1  <  *  <  g,  1  <k<q.  (3.19) 

J=1 

This  addition  is  done  in  a  pipelined  manner.  Each  group  of  q  threads  with  i  and  fc 

fixed  and  with  1  <  j  <  q  computes  the  corresponding  Cik- 

An  example  with  P  =  26  and  n  =  12  is  shown  in  Figure  5.  Figure  6  shows  the  q  threads  of 
control  used  to  compute  C\\  for  this  example. 

For  this  algorithm  the  total  number  of  events  is 

E  =  3P  +  —(2q  -  1)  =  3 P  +  pi{ 2P'7  -  1).  (3.20) 

9 

Note  the  number  of  events  per  thread  of  control  ranges  from 

k*  =  0(\og2q)  when  t =  £(iyl,k )  to 

k(  =  0(  1)  when  l  ~  l( i, q,k).  ’ 

The  expected  active  period  between  two  consecutive  events  is  the  same  for  all  threads  and  it  is 
equal  to  the  workload  required  to  multiply  (add)  two  submatrices  of  size  m  x  m 

a1  =  0(m3). 

The  expected  duration  of  an  event,  varies  from 

3e  =  Q(m2)  when  t  =  t(i.l,k)  to 
3t  =  Q{miq)  when  t=({i,q,h). 


(3.22) 

(3.23) 
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Figure  5:  A  parallel  algorithm  for  matrix  multiplication,  with  P  threads  of  control. 
(F  =  64,  n  =  12,  q  =  F1'3  =  4,  to  =  =•  =  3). 
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Figure  6:  The  pipelined  algorithm  for  matrix  multiplication.  The  time  series  of 
events  are  shown  for  a  =  4  threads  of  control  which  compute  C\\. 


3.2.2  A  parallel  algorithm  for  banded  matrix  LU  decomposition 

The  next  example  is  from  a  concurrent  algorithm  for  banded  matrix  LU  decomposition.  A  detailed 
discussion  of  this  algorithm  can  be  found  in  Chapter  20  of  [4]  where  its  implementation  on  a 
hypercube  is  analyzed.  The  solution  of  many  PDE’s  can  be  formulated  as  the  solution  of  the  linear 
systems  of  equations 


AX  =  B  (3.24) 

where  A  is  a  banded  M  x  M  matrix  with  bandwidth  b  and  where  X  and  B  are  M  x  n  matrices  of 
n  solutions  and  free  terms,  respectively.  Only  the  case  when 

1  «b«M  (3.25) 

is  considered  here.  To  solve  (3.24)  a  window  of  size  mxm(m  =  26+l)is  defined  and  an  iterative 
algorithm  is  used  as  follows.  At  iteration  k  the  window  covers  the  submatrix  consisting  of  rows  k 
to  k  T  m  —  1  and  column  k  to  k  +  m  —  1.  The  process  of  solving  (3.24)  consists  of  three  steps. 

1.  An  LU  decomposition  of  A  is  performed.  During  the  fc-th  iteration,  the  following  transfor¬ 
mations  are  made: 

Set 


Lk,k  =  U 


and 


Lfc+i.fc 


Ak+i,k 

Ak,k 


1  <  i  <  m  . 


(3.26) 


Uk,k  = 


lk,k 


Uk,k+j  =  Ajt,fc+j,  1  <  j  <  m 


and 


Uk  +  i.k  +  j  —  Ak+I,k+J  ~  Lk+t,k  ■  Uk,k  +  ]1  1  <  i  <  TTl  l,  1  <  J  <  772  1. 

2.  A  forward  reduction  of  B  is  performed  by 

Bk+,,j  =  Bk+,,j  -  Bk,jLk+,,k  for  1  <  i  <  m.  0  <  j  <  n  -  1. 

3.  A  back-substitution  from  last  to  first  now  generates  the  solution  by 

0<  j<b-  1, 

Uk.k 

Bk-t,j  =  Bk-  t,j  ~  Ak,jf'k-i,j- 


(3.27; 


(3.28) 
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In  this  presentation,  only  the  LU  decomposition  (Step  1)  is  analyzed.  The  LU  decomposition 
takes  a  square  matrix  A  and  performs  an  in-place  transformation.  The  L  matrix  overlaps  the  lower 
triangular  part  and  the  U  matrix  overlaps  the  upper  triangular  part  of  A ,  including  the  diagonal. 
The  m  x  m  window  slides  along  the  diagonal  such  that  at  step  k  the  k- th  row  forms  the  upper  side 
of  the  window  and  the  k-  th  column  is  the  left  hand  side  of  the  window.  To  simplify  the  presentation 
of  the  algorithm,  assume  that  the  current  iteration  k  satisfies  the  condition 

k  mod  m  =  0  (3.29) 

and  that  the  matrix  element  -4*+,,*+^  is  assigned  to  the  thread  <j>nq+J1  with 

ii  =  i  mod  q,  j\  —  j  mod  q,  (3.30) 

where  q  =  \[P .  This  assignment  of  the  matrix  elements  corresponds  to  a  scatter  decomposition  as 
described  in  [4].  Our  analysis  corresponds  to  the  case  when  the  size  of  the  window  m  is  a  multiple 
of  q  and  when  no  pivoting  is  necessary.  In  this  case  n  =  m2  /q2  —  m2  /  P  elements  of  A  are  handled 
by  every  thread  of  control  4>x . 

The  fc-th  iteration  proceeds  as  follows: 

Step  1.  The  main  diagonal  element  is  updated  by  <f>°  as 

Uk,k  =  ^—.  (3.31) 

Ak,k 

Step  2.  The  k-th  row  of  A  is  updated  by 


Uk,k+i  —  Ak,k+ii  1  <  i  <  rn  —  1.  (3.32) 

This  requires  no  computation  and  no  communication. 

Step  3.  The  fc-th  row  is  transmitted.  Each  <b'  with  0  <  i  <  q  —  1  multicasts  y/n  elements  to 
(q  -  1)  related  threads  <b'+iq  with  1  <  j  <  q  —  1.  This  stage  requires  q(q  ~  1)  events  and  no 
computation. 

Step  4.  The  Ar-th  column  of  A  is  updated.  The  threads  4>'q  with  1  <  i  <  q  -  1  compute 

Lk+ j,k  =  Ak+j,k  ■  Uk.k  with  1  <  j  <  m  -  1.  (3.33) 

Here  (m  -  2)  computations  are  performed.  The  total  amount  of  work  per  iteration  is  a  = 
1  +  ( m  -  1 )  +  2(  m  -  1  )2  =  2 m2  -  3m  +  3.  The  total  number  of  events  per  iteration  is 

k  =  2q{q  -  I )  +  q2  =  iq2  -  2q  =  iP  -  2 \/~P .  (3.34) 

The  expected  amount  of  work  per  event  is 

2 m2  -  3m  +  3 

Of  —  — 

3  P  -  % fP 

The  expected  amount  of  data  transferred  per  event  is  3 
and  P  events  are  involved  in  the  start-up  process,  we  have 

E  =  M  (.3 P  -  yfp)  +  P  =  P(3A/  +  1 )  -  v'T’.l/. 


(3.35) 

Since  M  iterations  are  needed 


(3.36) 


3.3  The  dynamic  analysis  -  Schedules  in  the  E/T  model 

During  the  previous  stage  of  analysis  only  static  estimates  of  the  performance  of  parallel  compu¬ 
tation  C  can  be  obtained  since  so  far  the  model  does  not  include  the  concept  of  time.  To  extend 
the  model  we  have  to  consider  a  parallel  hardware  H  with  n  processors,  7r,-,  1  <  i  <  n  such  that 
n  >  P.  A  schedule  in  the  sense  of  the  E/T  model  is  a  mapping  of  every  thread  of  control  d>b)  to  a 
processor  nt. 

If  detailed  information  concerning  the  architecture  H  is  available  then  a  schedule  can  be  con¬ 
structed  and  accurate  performance  data  can  be  obtained.  Following  [11]  a  schedule  means  to  decide 
for  every  node  of  the  DAG  associated  with  the  computation,  C  the  processor  and  the  time  when 
the  node  could  execute.  In  the  context  of  the  E/T  model,  creating  a  schedule  means  to  determine 
when  every  event  will  occur  and  how  long  it  will  take.  The  information  about  the  processing  speed 
will  allow  us  to  map  the  amount  of  work  between  two  consecutive  events  in  a  thread  of  control  into 
the  corresponding  execution  time.  The  information  about  communication  speed  will  allow  us  to 
map  the  volume  of  data  to  be  transferred  into  a  communication  time.  In  this  case  the  time  when 
every  event  e‘  occurs  is  well  determined. 

Note  that  at  the  time  a  schedule  in  the  E/T  sense  is  constructed,  in  addition  to  the  algorithmic 
events ,  the  events  required  by  the  mapping  process,  new  events  may  need  to  be  considered.  Among 
the  classes  of  new  events  we  recognize 

(a)  Events  related  to  the  monitoring  of  C.  The  next  section  will  discuss  in  detail  this  class  of 
events. 

(b)  Events  related  to  different  functions  of  the  operating  system.  Is  is  conceivable  that  in  the 
future  more  sophisticated  operating  systems  for  parallel  machines  will  allow  multiprocessing. 
In  this  case  events  related  to  processor  scheduling,  memory  allocation,  etc.,  will  affect  the 
performance  of  any  computation  C. 

Consider  a  thread  of  control  <p'  assigned  to  processor  z'  and  define  the  following  quantities 
related  to  <t>\  1  <  i  <  P,  and  to  C 

o'  -  The  interval  of  time  <pl  is  active,  following  the  jth  event  e’  in 
thread  <t>'. 

a’  -  The  expected  active  time  of  <p'  between  two  consecutive 
events. 

a  -  The  expected  active  time  of  C  between  two  consecutive 
events. 

-  The  duration  of  the  jth  event  e'}  in  thread  $' ■ 

3’  -  The  expected  duration  of  an  event  in  4>' . 

3  -  The  expected  duration  of  an  event  in  C. 
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7*  ~  The  expected  ratio,  (rr,  of  suspended  time  to  active  time  in 

a1 

thread  <t>' . 

7  -  The  expected  ratio,  »,  of  suspended  time  to  active  time  in  C. 


If  k  is  the  expected  number  of  events  per  thread  of  control  then  the  average  fraction  tj(  P)  of 
the  lifetime  of  a  thread  of  C  devoted  to  computation  is  given  by 


V(P)  = 


(k  -  l)a 
(k  -  l)a  -(-  k/3 


1 

1  +  1^)  ' 


(3.37) 


As  expected  r/(P)  — ►  1  when  7 (P)  —*  0,  i.e.,  when  j3  <<  a,  or  suspended  time  is  much  less  than 
active  time.  In  the  case  of  a  one-to-one  mapping  from  threads  of  control  to  processors  r/  represents 
the  average  processor  utilization  for  C. 

The  speedup  is  given  by  (2.25).  We  have  T(l)  <  P(k  -  l)a  with  equality  when  Waig(P)  =  0. 
Clearly  we  have 


T(P)  =  (k  -  l)a  -f  R0  «  k(q  +  (3).  (3.38) 

and  hence  we  have  the  bound 

S(P)  =  T(1)/T(P)  <  Pv(P)-  (3.39) 

The  last  inequality  shows  that  a  low  processor  utilization  leads  to  low  speedup,  as  expected. 

3.4  Monitoring  parallel  computations 

A  model  of  a  physical  system  (process  or  phenomena)  is  an  abstraction  which  distinguishes  between 
essential  and  non-essential  aspects  of  the  system  being  modeled  and  attempts  to  predict  the  behavior 
of  the  system  using  a  small  subset  of  essential  parameters.  To  validate  a  model  means  to  compare 
predictions  of  the  system  performance  obtained  through  the  analysis  of  the  model  for  a  particular 
set  of  input  parameters,  with  actual  data  gathered  from  observations  of  the  real  system. 

In  this  section  some  of  the  issues  pertinent  to  the  validation  of  the  E/T  model  are  discussed  and 
it  is  argued  that  the  parameters  necessary  for  the  validation  of  the  model  of  a  parallel  computation 
C  can  be  obtained  easily  through  monitoring.  A  detailed  discussion  of  monitoring  as  well  as  a 
formal  model  for  the  monitoring  process  and  an  architectural  model  for  software  and  hardware 
tools  for  monitoring  parallel  and  distributed  software  is  presented  in  [8].  Here  we  discuss  only  the 
relationship  between  monitoring  and  validation  of  the  E/T  model. 

The  E/T  model  characterizes  a  parallel  computation  C  at  two  levels,  at  the  thread  of  control 
level  when  information  about  one  thread  of  control  is  necessary  for  the  model  and  at  the  global 
level ,  when  the  entire  collection  of  P  threads  of  control  are  taken  into  account.  At  the  first  level, 
the  total  number  k'  of  events  in  thread  4>\  as  well  as  the  mean  time  a'  between  two  consecutive 
events,  and  the  mean  duration  /?'  of  an  event,  are  the  parameters  of  the  model.  How  k\  a'  and  0' 
can  be  obtained  directly  by  monitoring  the  execution  of  </»'  is  shown  later. 

At  the  global  level  data  concerning  all  threads  of  control  must  be  gathered.  While  the  global 
level  characterization  of  C  is  not  qualitatively  different  at  the  thread  of  control  level,  the  validation 
of  the  model  through  monitoring  does  require  a  much  larger  volume  of  data  to  be  collected  and 
analyzed.  Hence  this  type  of  characterization  becomes  more  difficult  to  validate  for  massively 
pa-allel  computations  when  a  large  number  of  threads  of  control  must  be  monitored  and  analyzed. 
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Note  also  that  in  case  of  SPMD  parallel  computations,  all  threads  of  control  exhibit  quasi-identical 
behavior  and  monitoring  a  sampling  of  threads  (or  even  one)  can  provide  enough  information  to 
estimate  accurately  the  global  level  characterization  of  C.  For  other  types  of  parallel  computations 
(non-SPMD)  it  is  rather  difficult  to  extrapolate  the  knowledge  acquired  through  monitoring  one 
thread  of  control  to  the  global  characterization  of  C. 

Monitoring  a  parallel  computation  C  is  the  process  of  recording  the  events  of  interest  which 
occur  during  the  lifetime  of  processes  running  different  threads  of  control  of  C.  A  monitoring  event 
is  defined  [8]  as  a  change  of  state  of  a  process.  An  event  is  “of  interest”  depending  upon  the  goals 
of  the  monitoring  process.  Monitoring  the  execution  of  a  parallel  computation  C  is  necessary  for 
debugging  a  particular  implementation  of  C  and  for  performance  evaluation.  During  monitoring 
each  event  of  interest  generates  a  trace  record  which  contains  all  relevant  information  concerning 
the  thread,  e.g.  the  type  of  event,  the  time  of  the  event,  the  state  of  the  process,  etc. 

The  definition  of  a  monitoring  event  is  more  general  than  the  one  discussed  so  far  in  the  context 
of  the  E/T  model.  For  example  “/  =  5”  can  be  defined  as  an  event  of  interest  for  monitoring,  it  is 
indeed  a  change  of  state  of  the  corresponding  process  since  a  new  value  is  assigned  to  the  variable 
/,  but  it  is  not  an  event  in  the  sense  of  the  E/T  model  since  the  corresponding  thread  of  controi 
does  not  change  its  state  as  a  result  of  this  assignment.  However,  as  soon  as  a  change  of  state  of  a 
process  is  designated  as  an  event  of  interest  and  it  is  decided  to  monitor  it,  then  this  monitoring 
event  becomes  also  an  event  in  the  sense  of  the  E/T  model  since  monitoring  means  an  interruption 
of  the  original  flow  of  control  done  to  record  the  pertinent  trace  data.  On  the  other  hand,  any  event 
in  the  context  of  the  E/T  model  corresponds  to  a  change  of  state  of  the  process  which  embeds  the 
corresponding  thread  of  control,  hence  it  can  be  monitored. 

A  first  important  conclusion  of  this  discussion  is  that  there  is  a  one  to  one  mapping  between 
monitoring  events  and  the  events  defined  in  the  context  of  the  E/T  model.  In  other  words  the 
E/T  model  is  “observable”  through  monitoring,  all  the  parameters  required  by  the  E/T  model  can 
be  obtained  as  part  of  the  trace  data  gathered  through  monitoring.  A  second  conclusion  is  that 
monitoring  a  parallel  computation  may  affect  the  timing  of  events  as  well  as  the  total  number  of 
events  in  the  original  computation.  This  is  an  undesirable  effect  associated  with  every  process  of 
measurement. 

3.5  Monitoring  the  performance  of  iterative  methods  on  a  distributed  memory 
system 

An  experiment  to  study  the  performance  of  iterative  methods  on  a  distributed  memory  system 
is  described  in  detail  in  [9].  The  experiment  uses  the  parallel  ELLPACK,  PELLPACK  system 
developed  at  Purdue  [6],  running  on  a  128  processor  NCUBE.  The  TRIPLEX  tool  set  [7]  is  used 
to  monitor  the  execution  and  to  collect  trace  data. 

The  purpose  of  the  experiment  was  to  collect  detailed  information  concerning  the  execution  of 
a  particular  SPMD  application,  to  study  how  this  data  relates  to  the  high  level  characterization 
of  parallelism  in  the  framework  of  the  E/T  model,  and  to  investigate  how  similar  or  dissimilar  the 
behavior  of  the  threads  of  control  of  an  SPMD  computation  are. 

The  experiment  monitors  the  execution  of  the  code  implementing  a  Jacobi  iterative  algorithm 
for  solving  a  linear  system  of  equations,  an  important  component  of  a  parallel  PDE  solver.  To 
ensure  a  load  balanced  execution,  the  domain  decomposer,  part  of  the  PELLPACK  environment, 
attempts  to  assign  to  every  PE  an  equal  amount  of  computation.  A  careful  selection  of  the  interface 
points  of  the  neighboring  domains  is  also  necessary  in  order  to  achieve  a  balanced  communication. 
The  experiment  was  conducted  by  taking  a  problem  of  a  fixed  size  and  repeating  the  execution 
with  ?  •  imber  of  P  Es  ranging  from  2  to  128. 


The  detailed  behavior  of  all  threads  of  control  was  captured  by  recording  all  the  events,  marking 
changes  of  state  for  every  thread.  For  every  event  the  TRIPLEX  tool  creates  a  trace  record, 
which  contains  the  pertinent  information  about  the  event,  type,  time  stamp,  PE ,  amount  of  data 
transferred,  etc.  All  the  measurements  reported  are  based  upon  a  clock  with  resolution  of  0.1  m^r. 
To  minimize  the  volume  of  trace  data,  only  events  related  to  communication  and  control  wore 
rc^rded.  Even  so,  the  trace  data  collected  during  a  single  experiment  with  128  PEs  amounted  to 
about  25  Mbytes. 

The  raw  data  were  processed  in  several  stages.  First,  the  events  outside  the  scope  of  Jacobi 
iterations  were  filtered  out.  Then  a  preprocessing  to  gather  the  data  required  by  the  E/T  model 
was  performed.  The  active  time  between  events,  the  duration  of  an  event  (read/write)  and  the 
length  of  a  blocking  period,  were  obtained  by  correlating  local  events,  events  occurring  in  the  same 
thread  (on  the  same  PE).  The  time  for  communication  and  control  was  computed  as  the  difference 
between  the  duration  of  an  event  and  the  length  of  its  blocking  period.  To  compute  the  algorithmic 
blocking  (defined  as  the  interval  from  the  instance  a  read  is  issued  until  the  corresponding  write 
takes  place)  it  was  necessary  to  correlate  non-local  events,  events  involving  more  than  one  thread. 
Finally,  a  statistical  processing  was  performed  in  order  to  obtain  data  as  described  in  Section  3.3. 

Preliminary  results  indicate  that  in  spite  of  all  the  precautions  to  achieve  a  well  balanced 
communication,  the  behavior  of  threads  of  control  can  be  considerably  different.  The  number  of 
events,  the  active  time,  the  total  read  time  per  thread,  may  be  within  a  factor  of  two  from  one 
thread  to  another  as  shown  in  Figure  7  for  the  expected  active  time.  Figure  8  represents  the 
characteristic  function  g(P)  of  the  parallel  computation,  which  indicate  a  0(P2)  behavior.  This 
happens,  since  at  the  end  of  every  iteration  a  global  communication  implemented  as  broadcast- 
collapse  takes  place  in  order  to  communicate  values  between  threads  of  control.  There  is  also  a 
global  exchange  of  information  every  few  iterations  to  obtain  information  for  convergence  control. 
While  this  could  be  done  by  fan-in,  fan-out  communication  in  principle,  the  NCUBE  system  fo.ces 
the  use  of  broadcasting  which  is  another  source  of  0{P2)  events.  One  familiar  with  Jacobi  itera'ion 
would  not  expect  g(P)  =  0(P2).  This  behavior  arises  primarily  because  the  NCUBE  system  does 
not  provide  adequate  communication  utilities,  one  must  use  a  broadcast  when  one  actually  wants 
to  do  a  multicast  to  just  a  handful  of  “nearby”  processors.  Using  the  system  provided  “supposed" 
multicast  facility  actually  increases  the  communication  time,  because  it  is  implemented  using  a 
broadcast.  The  allocation  of  threads  of  control  to  actual  processors  can  also  affect  g(P),  while  the 
optimal  allocation  for  this  problem  is  AP-hard  in  general,  there  are  heuristic  algorithms  which 
keep  the  distances  between  actual  processors  to  a  reasonable  level.  The  expected  active  time  per 
event  decreases  linearly  (Figure  9),  while  the  write  time  is  essentially  constant  (Figure  10). 

The  read  time  per  event  experiences  a  sharp  increase  (Figure  11)  and  most  of  it  is  due  to  the 
algorithmic  blocking  ( Figure  12).  The  active  time  fraction  of  the  total  non-blocked  time  decreases, 
due  to  the  0(P 2)  of  the  number  of  events  per  thread  (Figure  13). 

These  results,  though  preliminary,  seem  to  indicate  that  the  point  of  view  taken  bv  the  1  I 
model,  namely,  that  the  work  for  communication  and  control  is  essential  in  understanding  tlm 
behavior  of  parallel  computation  is  well  motivated.  The  measured  speedup  of  a  parallel  computation 
C  may  be  disappointingly  low,  even  when  a  high  PE  utilization  is  observed,  simply  due  to  the 
overhead  associated  with  communication  and  control.  This  overhead  is  difficult,  to  measure,  but  it 
can  be  estimated  when  g(P)  is  known. 

Furt  her  experiments  are  necessary  to  establish  a  sound  relation  between  the  algorithmic  blocking 
and  g{  P).  The  present  data  show  a  strong  interdependence  between  the  two. 
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NUMBER  OF  THREADS  OF  CONTROL  log2(P) 


Figure  7:  The  minimum  (dashed)  the  average  (solid)  and  the  maximum  (plus) 
active  time  for  a  thread  of  control.  Irregular  domain,  33  x  33  grid. 
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Figure  8:  The  expected  number  of  events  per  thread  of  control  (solid  line)  and  a 
95  percent  confidence  interval  for  it. 
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Figure  9:  The  expected  time  of  an  active  period,  between  two  consecutive  events 
(solid  line)  and  a  95  percent  confidence  interval  for  it. 
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Figure  10:  The  expected  time  for  a  single  write  operation  (solid  line!  and  a  95 
percent  confidence  interval  for  it. 


65 


235 

215 

195 

175 

155 

READ  TIME 
PER  EVENT  135 
115 

95 

75 

55 

35 


NUMBER  OF  THREADS  OF  CONTROL  log2(P) 

Figure  11:  The  expected  time  for  a  single  read  operation  (solid  line)  and  a  95 
percent  confidence  interval  for  it. 


4  Conclusions 

It  is  extremely  difficult  to  provide  concise  characterization  of  parallel  computations,  invariant  to 
problem  size  and  especially  to  the  architecture  of  parallel  systems.  The  E/T  model  of  parallel 
execution  is  best  suited  to  the  important  class  of  SPMD  applications  and  provides  little  or  no 
insight  for  dynamic  computations  where  data  dependencies  affect  the  sequence  and  possibly  the 
number  of  events  in  a  thread  of  control. 

The  characteristic  function  E  =  g(P)  defined  by  the  E/T  model,  is  less  sensitw  to  the  architec¬ 
ture  of  the  parallel  system  and  the  problem  size  than  other  types  of  high  level  characterizations  of 
parallel  computations.  It  allows  a  uniform  treatment  for  both  message  passing  and  shared  memory 
paradigms.  The  average  workload  per  thread  of  control,  w(P)  provides  a  signature  of  a  parallel 
computation  and  allows  interesting  conclusions  concerning  the  asymptotic  behavior  of  different 
classes  of  parallel  computations. 

The  E/T  model  allows  quantitative  characterizations  of  the  model  as  well.  The  static  charac¬ 
terization  is  based  upon  the  computation  and  data  mapping  and  it  is  insensitive  to  timing  charac¬ 
teristics  (instruction  execution  rate,  communication  speed)  of  the  parallel  machine,  but  it  does  not 
capture  the  effects  of  blocking  and  synchronization. 

A  final  strength  of  the  E/T  model  is  its  closeness  to  the  experiment.  Monitoring  tools  typically 
provide  precisely  the  data  required  by  the  model.  The  measurements  reported  in  the  previous 
section  capture  the  detailed  behavior  of  all  threads  of  control  of  a  parallel  computation  in  a  PDE 
solver.  A  preliminary  analysis  of  the  measurements  shows  the  effects  of  the  additional  work  for 
communication  and  control  and  of  blocking.  For  a  problem  of  a  given  size,  the  fraction  of  the  time 
a  PE  is  active,  working  on  Wcona  to  its  total  running  time,  including  work  for  communication  and 
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FIGURE  12:  The  expected  algorithmic  blocking  time  during  a  read  operation  (solid 
line)  and  a  95  percent  confidence  interval  for  it.  The  algorithmic  blocking  is  defined 
as  the  interval  from  the  instance  a  read  is  issued  until  the  corresponding  write  is 
initiated. 
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Figure  13:  The  expected  active  time  fraction  of  the  non-blocked  time  per  thread. 
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control,  decreases  when  the  number  of  PEs  increases  under  these  conditions.  The  actual  shape 
of  both  functions  seems  closely  related  to  g(P).  Further  investigations  of  other  parallel  compu¬ 
tations  and  comparisons  of  results  obtained  from  running  the  same  computation  on  distributed 
memory  and  shared  memory  systems,  are  necessary  to  gain  insight  into  the  relationship  between 
the  characteristic  function  g{P)  and  other  measures  of  performance. 


Acknowledgements 

The  authors  express  their  thanks  to  Mo  Mu  for  his  helpful  comments. 

5  Literature 

[1]  A.  Aggarwal,  A.K.  Chandra  and  M.  Snir,  “Communication  complexity  of  PRAMs”,  Research 

Report  Rc  14998,  IBM  Research,  February  1989. 

[2]  A.  Aggarwal,  A.K.  Chandra  and  M.  Snir,  “On  communication  latency  in  PRAM  computations”, 

Research  Report  RC  14973,  IBM  Research,  September  1989. 

[3]  D.L.  Eager,  J.  Zahorjan  and  E.E.  Lazowska,  “Speedup  versus  efficiency  in  parallel  systems”, 

IEEE  Trans,  on  Computer  Systems,  vol.  38,  no.  3,  pp.  408-423,  March  1989. 

[4]  G.  Fox  et  al.,  “Solving  problems  on  concurrent  processors”,  Prentice  Hall.  1988. 

[5]  J.L.  Gustafson,  “Reevaluating  Amdahl’s  law”,  CACM  31,  5,  pp.  532-533,  May  1988. 

[6]  E.N.  Houstis  and  J.R.  Rice,  “Parallel  ELLPACK:  An  expert  system  for  parallel  processing  of 

partial  differential  equations”,  Math.  Comp.  Simulation,  vol.  31,  pp.  497-507,  1989. 

[7]  D  W.  Krumme,  A.L.  Couch  and  B.L.  House,  “The  TRIPLEX  tool  set  for  the  NCUBE  multi¬ 

processors”,  Technical  Report  Tufts  University,  June  1989. 

[8]  D.C.  Marinescu,  J.  E.  Lumpp,  T.L.  Casavant  and  H.J.  Siegel  “Models  for  monitoring  and 

debugging  tools  for  parallel  and  distributed  software”,  Journal  of  Parallel  and  Distributed 
Computing,  June  1990,  (to  appear). 

[9]  D.C.  Marinescu,  J.R.  Rice  and  E.  Vavalis,  “Performance  of  iteration  methods  for  distributed 

memory  processors”,  CSD-TR  979,  Computer  Sciences  Department.  Purdue  University.  May 
1990. 

[10]  Mo  Mu  and  J.R.  Rice.  “The  structure  of  parallel  sparse  matrix  algorithms  for  solving  partial 
differential  equations  on  hypercubes”,  CSD-TR  976.  Computer  Sciences  Department.  Purdue 
University,  May,  1990. 

[11]  C.H.  Papadimitriou  and  M.  Yannakakis,  “Towards  an  architecture-independent  analysis  of 
parallel  algorithms”,  Proc.  of  20th  Annual  ACM  Symp.  on  Theory  of  Computing,  pp.  510- 
513.  May  1988. 

[12]  J.R.  Rice  and  D.C.  Marinescu,  “Analysis  of  a  two  level  asynchronous  algorithm  for  PDEs”, 
in  Aspects  of  Computations  on  Asynchronous  Parallel  Processors ,  (M.  Wright  ed).  North 
Holland,  pp.  23-33,  1989. 


68 


[13]  J.R.  Rice  and  D.C.  Marinescu,  “Multi-Level  asynchronous  PDEs”,  in  Iterative  Methods ,  (D.R. 
Kincaid  and  L.J.  Hayes  eds),  Academic  Press,  pp  193-212,  1990. 

[14]  K.C.  Sevcik,  “Characterizations  of  parallelism  in  applications  and  their  use  in  scheduling”, 
Proc.  of  Sigmetrics  and  Performance’89,  Berkeley,  PA,  May  1989. 


69 


Recursive  LLSP  in  Distributed-Memory  Multiprocessors 

Jaeyoung  Choi,  Adam  W.  Bojanczyk* 

School  of  Electrical  Engineering 
Cornell  University,  Ithaca,  N.Y.  14853-5401 


Abstract 

We  discuss  implementations  of  block  algorithms  for  recursive  linear  least  squares  problems  (LLSP)  on 
a  linear  array  of  p  processors  A  recursive  least  squares  problem  is  a  composite  task  that  involves  trian- 
gularization  of  the  data  matrix  and  solution  of  the  resulting  triangular  systems  of  linear  equations.  These 
two  problems  when  mapped  on  to  a  linear  array  of  processors  have  different  communication  requirements 
that  may  lead  to  a  high  communication  overhead  for  this  composite  problem.  In  transforming  the  data 
matrix  to  a  triangular  form,  we  consider  the  sliding  rectangular  window  approach  to  eliminate  the  influ¬ 
ence  of  old  data  on  the  current  solution  vector.  This  approach  involves  both  updating  and  downdating  of 
the  data  matrix.  The  back-substitution  process  is  normally  used  for  solving  the  triangular  system.  How¬ 
ever,  as  the  back-substitution  process  is  communication  bound,  we  also  consider  updating  and  downdating 
of  the  inverse  of  the  triangular  factor  directly  and  hence  replace  the  back-substitution  process  by  direct 
matrix-vector  multiplication.  We  discuss  block  and  wrap  mappings  of  the  data  matrix  onto  a  linear  array 
of  processors.  The  effects  of  each  mapping  on  the  overall  execution  time  are  also  discussed.  We  propose 
wavs  to  decrease  the  execution  time  by  modifying  the  algorithms  for  this  problem.  We  present  results  of 
numerical  tests  on  a  linear  array  of  transputers. 


This  work  was  supported  by  the  ARO  grant  number  DAA  L03-90-G-0092 


1.  Introduction 


Let  A  be  a  full  rank  m  X  n  matrix,  and  let  b  be  a  real  m  dimensional  vector.  In  the  linear  least 
squares  problem  (LLSP  for  short),  we  want  to  find  a  real  n-dimensional  vector  x  such  that 


II A  •  x  —  bj|2  =  min  ||A  •  y  -  b||2 


(1.1) 


The  first  step  in  solving  LLSP  is  to  reduce  A  to  the  upper  triangular  form  R  by  applying  an  orthogonal 
transformation  Q  to  the  matrix  A.  Then  the  problem  (1.1)  is  equivalent  to  the  problem  of  finding  x  such 
that 

||R  ■  x  -  c||2  =  min  ||R  •  y  -  cj|2  (1.2) 

where  c  =  Qr  •  b.  Now  (1.2)  can  be  solved  by  the  back-substitution  process. 


Suppose  now  that  the  new  data  matrix  (X  bx  )  is  to  be  added  to  the  matrix  (A  b).  Let 
(  An,v  bn,„  )  =  ^  ^  ^  .  The  new  problem  to  be  solved  is  the  following: 


X  bx 

||An«„  *  Xnav  “  tlflin^  '  y 

It  is  easy  to  see  that  xn,„  satisfies  the  following  equivalent  relation: 

||An<„  ■  x„,„  —  bn,M||2  =  ^riin  |j  ^  x  )  ^  ~  ^bx  ) 


(1.3) 


(1.4) 


Thus,  while  solving  the  new  problem  (1-4)  we  can  exploit  the  fact  that  we  already  know  R.  What 
remains  to  be  done  is  to  transform  ^  to  upper  triangular  form.  This  is  known  as  the  updating  problem. 

Similarly,  suppose  that  a  subset  of  rows  in  the  data  matrix  is  no  longer  representative  to  the  behavior 
of  the  system  and  should  be  removed  from  the  data  set.  Formally,  let  (  A  b )  =  ^  ^c,y*n<  ^  • 

Then  we  would  like  to  solve  the  problem  (1.3)  where  A„,„  and  b«,„  are  replaced  by  Ac,,Tefl<  and  bc,TT,„, 
respectively.  Again,  we  would  like  to  exploit  the  fact  that  we  already  know  R.  We  are  now  looking  for  an 
orthogonal  transformation  P  such  th. 


■(?£  Me) 


(1.51 


where  R  is  upper  triangular.  This  is  known  as  downdating. 

In  applications,  we  often  want  to  remove  an  old  data  set,  add  a  new  data  set  and  possibly  repeat  this 
process  over  and  over  again.  In  signal  processing,  this  combined  process  of  updating  and  downdating  is 
referred  to  as  a  rectangular  sliding  window  process.  In  the  rectangular  sliding  window  approach  at  each 
recursive  step,  we  want  to  find  an  orthogonal  matrix  W  such  that 


W3 


(1.6) 
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where  R  is  upper  triangular.  In  (1.6),  R  represents  the  old  window  and  R  represents  the  new  window  in 
the  current  step. 

Once  the  new  factor  R  is  known,  the  corresponding  triangular  system  of  linear  equations  is  solved. 
The  back-substitution  process  is  normally  used  for  finding  the  solution  of  this  system.  On  parallel  architec¬ 
tures,  back-substitution  is  communication  bound.  Thus,  on  system  where  interprocessor  communication 
is  expensive  relative  to  computation,  back-substitution  may  increase  the  overall  execution  time.  One  way 
around  this  problem  is  to  work  directly  with  the  inverse  of  the  triangular  factor  R.  Then  back-substitution 
process  can  be  replaced  by  a  matrix-vector  multiplication  that  is  usually  very  efficient  on  any  parallel 
architecture.  It  is  known  that  the  transformations  which  update  and  downdate  the  triangular  factor  R  are 
closely  related  to  those  that  update  and  downdate  the  inverse  R-1.  We  will  consider  two  approaches  to 
update  and  downdate  R-1.  In  the  first  approach,  the  transformations  to  update  and  downdate  R~l  are 
computed  based  on  the  factor  R,  the  data  to  be  added  X,  and  the  data  to  be  deleted  Y.  In  the  second 
approach,  the  transformations  are  computed  based  on  R_l,  X  and  Y.  In  this  sliding  approach,  we  will 
compare  these  two  approaches  with  respect  to  the  execution  time  on  a  linear  array  of  processors.  The 
important  issue  of  numerical  behavior  will  be  considered  separately  in  a  companion  study. 

We  consider  two  popular  mappings,  block  mapping  and  wrap  mapping.  In  block  mapping  the  matrix 
is  partitioned  into  blocks  of  column  with  each  block  assigned  to  exactly  one  processor,  and  consecutive 
blocks  mapped  onto  consecutive  processors  in  the  array.  In  wrap  mapping,  consecutive  columns  are  mapped 
to  consecutive  processors  modulo  the  number  of  processors.  In  general,  block  mappings  lead  to  smaller 
communication  requirements  and  wrap  mappings  exhibit  a  better  load  balance.  We  compare  various 
strategies  for  solving  recursive  LLSP  on  a  linear  array  of  8  transputers. 


2.  Up-Downdatings  of  Order  Jb 

One  of  the  two  major  steps  in  solving  recursive  linear  least  squares  problems  is  updating  and  down¬ 
dating  of  the  data  matrix.  We  will  consider  this  process  where  data  to  be  added  to  the  matrix  consists  of 
k- rows  and  the  data  to  be  deleted  consists  of  another  jfc-rows  (ifc  >  1).  Formally,  given  a  nonsingular  n  x  n 
upper  triangular  matrix  R,  a  k  x  n  matrix  X  and  a  k  x  n  matrix  Y,  where  rank(X)  =  rank(Y)  =  k,  we 
want  to  find  an  orthogonal  matrix  W  and  annxn  upper  triangular  matrix  R  such  that 


WT  ■ 


(2.1) 


Note  that  (2.1)  implies  that 

RrR  =  RrR+ XrX  -  YrY  (2.2) 

Thus  as  long  as  the  right  hand  side  of  (2.2)  is  positive  definite,  the  problem  (2.1)  is  well-defined.  The 
problem  (2.2)  is  that  of  finding  the  Cholesky  factor  after  two  rank-&  modifications  of  RTR  (one  rank-fc 
addition  of  XTX  and  one  rank-it  deletion  of  Yr  Y).  We  will  call  the  joint  process  of  rank-fc  updating  and 
rank-fc  downdating  as  up-douindating  of  order  k. 
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We  can  treat  the  problem  (2.2)  as  a  sequence  of  2k  rank-1  modification  problems,  k  rank-1  updating 
problems  and  k  rank-1  downdating  problems.  Each  rank-1  modification  could  be  realized  by  a  sequence 
of  n  two-dimensioned  rotations  [6].  Such  an  approach  would  require  2n2  +  3n  +  0(1)  flops  per  rank-1 
modification  for  a  total  of  2kn(2n  +  3)  flops.  It  is  well-known  that  Householder  transformations  require 
fewer  operations  than  the  corresponding  sequence  of  rotations.  If  we  treat  (2.2)  as  a  two  step  process 
consisting  of  a  rank-k  updating  followed  by  a  rank-fc  downdating,  then  each  step  can  be  realized  by  a 
sequence  of  Householder  transformations  in  (k  +  l)n(n  4-  1)  flops  for  a  total  of  2(k  4-  1  )n(n  +  1)  flops,  a 
two-fold  saving  over  the  rotation  based  approach. 

Recently  hyperbolic  Householder  transformations  have  been  proposed  to  realize  the  joint  process  (2.2) 
of  updating  and  downdating.  Let  #  =  diag(In)Lk,  -It),  then  a  transformation  H  is  called  hyperbolic  with 
respect  to  $  if  and  only  if 

$  =  Hr  •  *  •  H.  (2.3) 

Note  that  if  H  is  hyperbolic  with  respect  to  $  such  that 


where  R  is  upper  triangular,  then  (2.2)  holds.  We  will  consider  three  algorithms  for  recursive  LLSP  based 
on  hyperbolic  transformations. 

3.  Algorithms  for  Recursive  LLSP  via  Hyperbolic  Transformations 

3.1  Algorithm  A 

The  recursive  algorithm  for  up-downdating  of  order  k  based  on  the  hyperbolic  Householder  transfor¬ 
mation  is  simple.  The  hyperbolic  transformation  H  is  constructed  as  a  product  of  hyperbolic  Householder 
transformations  H,  (1  <  *  <  n),  where  H,  operates  on  X,  Y  and  the  i-th  row  of  R,  i.e., 

/ R  d  \  /R  d  \ 

•••Hi  •  X  bx  =  0  bx  (3.1.1) 

\Y  br/  \  0  by  / 

and  R  is  an  n  x  n  upper  triangular  matrix.  The  definition  of  a  hyperbolic  Householder  transiormation  and 
the  way  it  can  be  constructed  and  applied  to  a  matrix  are  discussed  in  section  7. 

Algorithm  A  requires  (2k  4-  l)n(n  +  3)  multiplications. 

3.2  Algorithm  B 

Let  us  start  with  a  procedure  for  the  updating  the  inverse  of  a  triangular  matrix.  The  algorithm  for 
rank- 1  updating  of  the  inverse  of  the  triangular  factor  R  [6]  can  be  extended  to  rank-k  updating  of  the 
inverse.  Let  R  be  an  upper  triangular  n  x  n  matrix,  X  be  a  k  x  r»  matrix,  and  H  =  li,H„-i  •••Hi  be  a 
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product  of  Householder  transformations  which  transform  the  matrix 


(5) 


to  upper  triangular  form.  The 


transformation  H  when  applied  to  ^  gives  R-r.  To  see  this,  let  us  consider  the  following  identity: 


I  =  ( R“ 1 


=  (R-1 


21LSS'h(5) 


Note  that  because  of  the  form  of  H,  matrix  R" 1  is  upper  triangular,  hence  it  must  be  the  inverse  of 
R.  It  can  be  shown  that  Z  is  such  that  ZrZ  =  R-lVr  (1  +  VVT)-1  VR-r  and  V  =  XR-1. 

The  same  idea  can  be  extended  to  up-downdate  R  and  R-r  simultaneously.  Let  F  =  FnFn_i  Fi 
be  a  product  of  hyperbolic  matrices  to  up-downdate  R  with  X  and  Y,  respectively.  The  same  F  can  be 
applied  to  up-downdate  R-r.  Let 


R  d 


'R  d 


F  ■  X  br  =  0  b. 


Y  b, 


0  bv 


Then 


/R> 

=  (R“r  ZT  Wr).  O 

Vo, 


i.e. 


'R-r 

0 

0 


/  R“r 
*  Z 
\  w 


where  WTW  =  R~  1NT  (l-NNT)*lNR-T  and  N  =  YR'1 


This  algorithm  requires  2(2I:+l)n(n-f  2)  multiplications.  The  number  of  multiplications  is  almost  twice 
as  large  as  that  of  Algorithm  A,  hence  Algorithm  B  is  not  recommended  for  a  sequential  processor  where  the 
number  of  operations  determines  the  cost.  However  as  the  inverse  R-r  is  known,  back-substitution  can 
now  be  replaced  by  matrix-vector  multiplication.  Matrix-vector  multiplication,  in  general,  requires  less 
communication  among  processors  than  back-substitution,  thus  potentially  leadi',t  t-  a  faster  execution 
time  of  the  recursive  sliding  window  approach. 


3.3  Algorithm  C 

Note  that  Algorithm  B  requires  storing  both  matrix  R  and  its  inverse  R~ 1 .  It  is  possible  to  operate 
on  the  inverse  R“ 1  without  keeping  the  triangular  matrix  R.  Algorithm  C  is  based  on  the  algorithm  for 
rank-1  downdating  of  the  inverse  in  [6]. 

Let  us  rewrite  the  rank-/:  updating  problem  as: 

RrR  +  XrX  =  RrR  (3.3.1) 
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Assume  that  X  is  an  1  x  n  data  matrix,  then  (3.3.1)  is  a  rank-1  updating  problem.  Let  us  find  a 
sequence  of  plane  rotations  •  •  • ,  Gi.n+i,  such  that  for  G  =  Gn,n+1Gn_i>n+i  •••  Gi.n+i, 


G  ^  -R"rXr 


R~r  \  _  /  °nxi  R~T\ 

Olxn  )  \  «  Z  / 


(3.3.2) 


Gl>n+i  (1  <  *  <  n)  is  the  Givens  rotation  operating  in  plane  (t,  n  +  1)  which  forces  the  »-th  component  of 
the  first  column  of  the  matrix  on  the  left  in  (3.3.2)  to  zero.  Then  R“r  is  the  inverse  of  the  desired  factor 
in  (3.3.1).  It  can  be  shown  that  ZTZ  =  R^V7  (l+  VV1")'1  VR-T  and  V  =  XR"1. 


We  can  generalize  the  process  (3.3.2)  of  rank-1  updating  to  rank-1:  updating  by  treating  the  rank-fc 
updating  as  a  sequence  of  k  rank-1  updatings.  Assume  that  X  is  a  k  x  n  data  matrix,  and  consider  the 
matrix 


X3_f-  R~rXT  R~T\ 

B-[  h  o  lxJ- 

Let  G'  be  the  product  of  Givens  rotations  as  defined  in  (3.3.2)  operating  on  the  top  n  rows  and  the  (n-t-j)-th 
row  of  B  such  that  the  first  n  components  of  the  j-th  column  of  B  are  zero.  Then  for  G'  =  GJ  GJ_  t  •  •  •  G' , 


f-R-TXr 

G  '  v  I, 


R~T)  _  fOnxk  R~T\ 
OkxJ  ~  U,  Z  J 


(3.3.3) 


where  R“ 1  is  the  desired  updated  factor. 

Similarly,  we  expand  the  same  idea  to  rank-Jfe  downdatings.  Let  Y  be  a  fc  x  n  data  matrix  and  consider 
the  following  equation: 

RTR-YrY  =  RTR,  (3.3.4) 


and  the  matrix 


_  /R~TYr  R”r\ 

~  V  It  0ixn) 


Let  Gj  be  the  product  of  hyperbolic  Givens  rotations  operating  on  the  top  n  rows  and  the  (n  -+-  j)-th  row 
of  C  such  that  the  first  n  components  of  the  j’-th  column  of  C  are  zero.  Then  for  G^  =  GfG$_l  •  -  -  Gf , 


*  fR-TYr  R-T)_/0nxi  R~T\ 

■  l  it  oixn )-  \  Vi  w  ; 


(3.3.5) 


where  R'r  is  the  desired  lower  triangular  factor.  It  can  be  also  shown  that  WrW  =  R-1Nr  (I  —  NN1*) 
NR~r  and  N  =  YR_1. 

The  combined  process  of  (3.3.3)  and  (3.3.5)  is  an  algorithm  for  up-downdating  of  order  k  of  the 
triangular  factor.  Let 

RrR  +  XTX  -  Yr  Y  =  RrR, 


and  consider  the  matrix 


D  = 


-R-rXT 

R-rYr 

R~T 

Ii 

o4 

Olx„ 

ok 

Ik 

Olxn 

Let  S,,  1  <  j  <  k,  be  the  product  of  Givens  rotations  operating  on  the  top  n  rows  and  the  (n  +;)-th 
row  of  D  such  that  the  first  n  components  of  the  ;-th  column  of  P  are  zero.  And  let  S},  k  +  1  <  j  <  2 k, 
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be  the  product  of  hyperbolic  Givens  rotations  operating  on  the  top  n  rows  and  the  (n  +  j)-th  row  of  D 
such  that  the  first  n  components  of  the  ( j  —  £)-th  column  of  D  are  zero.  Then  for  S  =  S2tS2t-i  •••Si, 


/-R-TXT  R-rYT 

R~r 

d  \ 

/0„xt 

0„xl 

R-r 

d  \ 

S-  It 

Ot 

OtXn 

bx 

=  u. 

p 

Z  bx 

V  o* 

It 

Otxn 

b  r) 

\  Oi 

u  4 

W 

by) 

where  P  is  a  square  matrix  and  R-r  is  the  desired  factor. 


(3.3.6) 


This  algorithm  is  based  on  Givens/hyperbolic  Givens  transformations,  so  it  requires  nk(8k  +  5r»  +  17) 
multiplications 


4.  Mapping 

We  assume  that  we  are  given  a  ring  of  processors.  The  processors  are  numbered  from  1  to  p  with 
the  leftmost  processor  labeled  Pi  and  the  rightmost  processor  labeled  Pf.  The  data  matrix  is  distributed 
among  processors.  The  mapping  of  the  matrix  onto  the  processors  affects  the  communication  require¬ 
ments,  the  degree  of  concurrency  and  the  load  balance  among  the  processors.  The  objectives  to  minimize 
communication,  maximize  concurrency  and  uniformly  balance  the  load  tend  to  conflict  [7]. 

There  are  two  widely  used  mappings;  ‘block  mapping’  and  ‘wrap  mapping’.  In  the  block  mapping, 
contiguous  blocks  of  n/p  columns  are  assigned  onto  each  processor,  where  n  is  the  size  of  the  matrix  and  p 
is  the  number  of  processors.  The  first  processor  PL  has  the  first  n/p  columns  (column  1  to  column  n/p),  the 
second  processor  P2  has  the  next  n/p  columns  (column  n/p  -F  1  to  column  2 n/p),  and  so  on.  In  the  wrap 
mapping,  consecutive  columns  are  assigned  to  consecutive  processors  from  the  first  processor  Pi  to  the  last 
processor  Pr,  then  wrapping  back  to  Pl  and  continuing  with  further  columns.  In  the  wrap  mapping,  the 
first  processor  Pt  contains  all  columns  j  -p  +  1  (0  <  j  <  n/p),  the  second  processor  P2  has  all  columns 
j  -p  +  2  (0  <  /  <  n/p),  and  so  on.  The  two  mappings  are  illustrated  in  Fig.  4.1. 


(a)  Block  Mapping 


(b)  Wrap  Mapping 


Figure  4.1.  Two  Mappings  for  n  =  If  and  p  =  4 
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We  will  compare  the  two  different  mappings  in  implementing  Algorithms  A  and  B  and  their  effects  on 
the  overall  execution  time  of  up-downdating  followed  by  solution  of  a  triangular  system  of  linear  equations. 
At  the  i-th  step  of  up-downdating  (1  <  »  <  n),  the  processor  which  is  assigned  the  i-th  column  of  the 
matrix  computes  the  corresponding  transformation  and  sends  the  parameters  of  the  transformation  to  the 
processors  that  are  assigned  the  columns  whose  indices  are  greater  than  ».  In  the  wrap  mapping,  the  last 
processor  Pp  receives  the  transformation  from  Pr.\  and  sends  it  to  Pi  again  if  necessary.  In  the  block 
mapping,  the  last  processor  in  the  ring  Pp  only  receives  and  never  sends  transfer  mat  ions. 

After  applying  transformations  to  its  assigned  columns,  each  processor  becomes  idle  for  the  remainder 
of  the  current  step.  The  block  mapping  causes  processors  containing  the  earlier  blocks  to  be  idle  much  of 
the  time,  whereas  the  wrap  mapping  keeps  all  processors  busy  as  long  as  possible.  Thus  the  wrap  mapping 
is  likely  to  yield  higher  concurrency  and  processor  utilization  than  the  block  mapping.  On  the  other  hand, 
the  block  mapping  has  potentially  smaller  communication  requirements,  since  transformations  generated 
by  a  processor  need  to  be  sent  only  to  higher  numbered  processors,  rather  than  all  processors  as  in  the 
wrap  mapping.  Once  the  data  matrix  is  triangularized,  the  resulting  triangular  system  of  linear  equations 
has  to  be  solved.  This  is  discussed  in  the  next  section. 


5.  Parallel  Triangular  Solver  and  Inverse  of  the  Triangular  Matrix 

5.1  Parallel  Triangular  Solver 

Back-substitution  is  used  to  solve  the  triangular  system  in  Algorithm  A.  The  serial  algorithm  to 
compute  the  solution  of  R  ■  x  =  b  is  as  follows: 

For  i  =  r»:  1 

x,  =  b,  /R,., 

bl:,-  l  =  bi:,_  I  —  Ri:1_  It,  •  X, 

end 

where  we  use  MATLAB  notation:  bi;l _  j  =  [bi,b2,  •  •  •  ,b,_i]r,  Ri;,_  lt,  =  [R1iMR2>ii  ■  •  •  ,R,_  i,,]T.  R,,  is 
the  t-th  column  of  R  and  RJ>;  is  the  j-th  row  of  R  . 

Li  k  Coleman  [4]  and  Heath  k  Romine  [5]  implemented  the  algorithms  for  solving  triangular  systems 
on  a  hypercube  multiprocessor.  We  adopt  the  Li  k  Coleman  algorithm  for  solving  the  triangular  system 
in  the  wrap  mapping  of  Algorithm  A.  In  a  naive  implementation  of  back-substitution  algorithm  using  the 
wrap  mapping  at  the  t-th  step,  1  <  t  <  n,  the  processor  Pj,  1  <  j  <  p,  receives  the  vector  bi;,  from  the 
right  processor  P^ computes  one  component  x,  of  the  solution,  updates  the  vector  bi:)_t  and  sends  the 
updated  vector  bi,,-!  to  the  next  processor  P,-i-  The  computations  of  the  algorithm  are  distributed  over 
processors,  but  are  not  executed  simultaneously.  There  are  no  concurrent  computations.  Li  k  Coleman 
have  modified  the  algorithm  so  that  operations  can  be  overlapped.  For  details,  see  [4]. 

Heath  k  Romine  developed  a  wavefront  algorithm  for  the  wrap  mapping.  In  their  algorithm,  each 
processor  computes  one  component  x,  (1  <  »  <  n)  of  the  solution,  then  proceeds  to  compute  the  vector 
zsuEQi;,.  i  =  Ri;,_  i,,  ■£,.  After  computing  the  first  <7  components  (for  some  integer  <7  satisfying  1  <  <7  <  n), 
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(a)  Wave  from  Algotiyhm  (b)  Modified  Wavefront  Algorithm 

in  Wrap  Mapping  in  Block  Mapping 

Figure  5.1.  Snapshots  of  the  Wavefront  Algorithm 

the  processor  P}  (1  <  j  <  p)  sends  them  to  the  left  neighbor  P,  _  t  so  that  the  latter  can  compute  the  next 
component.  Meanwhile,  the  processor  P}  resumes  the  computation  of  the  next  <7  components  of  zsum. 
We  modified  the  algorithm  for  the  block  mapping.  Each  processor,  when  ready,  computes  n/p  components 
of  x.  After  the  processor  receives  n/p  components  of  zsuxn  from  the  left  neighbor,  it  updates  them  by 
multiplying  the  components  by  the  corresponding  n/p  x  n/p  submatrix  of  the  triangular  matrix  R  and 
sends  them  to  the  right  neighbor,  and  so  on.  A  snapshot  of  the  wavefront  algorithm  for  the  wrap  mapping 
and  a  snapshot  of  the  modified  wavefront  algorithm  for  the  block  mapping  are  illustrated  in  Figure  5.1. 


5.2  Parallel  Algorithm  for  Computing  the  Transposed  Inverse  of  a  Triangular  Matrix 


In  order  to  avoid  the  back-substitution  in  Algorithms  B  and  C,  the  triangular  matrix  is  inverted  during 
initialization  of  the  recursive  process  and  then  the  inverse  is  maintained.  The  cost  of  computing  the  inverse 
is  high  but  affects  initialization  only.  Here  we  propose  a  parallel  algorithm  for  computing  the  transposed 
inverse  of  a  triangular  matrix. 


Assume  that  R  is  a  nonsingular  and  upper  triangular  n  x  n  matrix.  R"T  is  required  in  the  initial  step 

n 

of  the  Algorithms  B  and  C.  Let  R  •  T  =  B.  Then  B,,  =  £  Ri*Ttj.  Assume  that  T  =  R_l,  B  =  I,  so 

i=i 

R  T  =  T  •  R  =  I.  T  is  also  a  nonsingular  and  upper  triangular  matrix.  From  R  T  =  I, 


’  R,,T„  =  1,  if  *  =  ; 

E  FLiTkj  =  0,  if  »  <  j 

<  i<  i 

E  RiiTi,  =  0,  if »  >  j 

.  j<i<i 


The  elements  of  T  are 


T., 


I/R1.1  if  *  =  > 

(  £  R,j,Ti,)/R,,,  if  *  <  > 

<<k<] 

0,  if »  >  ] 


(5.1) 
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Figure  5.2.  Order  of  Computing  L(=  R"T) 

Let  L  =  Tr  =  R_t.  The  elements  of  L  are  generated  from  top  row  to  the  bottom  row.  The  order  of 
computation  is  shown  in  Figure  5.2. 

A  sequential  code  to  compute  (5.1)  could  look  as  follows: 

For  t  =  1  :  n 

L,..  al/Ri,, 
if  t  >  1 

tempi;, _i  =  L,lt  Ri:,-i,( 

For  k  —  t  —  1:  —1: 1 

L,,i  =  —tempt  Lt.i 

tempi;t_ i  =  <empi;i_ i  +  L,,i  Ri:t-i,i  (if  k  >  1) 

end 

end 

end 

Assume  that  we  are  given  n  linearly  connected  processors  P,  (1  <  »  <  n)  arranged  from  left  to  right. 
Let  the  t-th  processor  P,  store  initially  the  i-th  column  of  R  (Ru,.,);  at  the  end  of  the  computations,  it 
will  have  the  i-th  column  of  L  (L,:n,,).  The  parallel  version  of  the  algorithm  might  look  as  follows: 

L,„  =  1/R,,, 

tempi-,.!  =  L,,, 

send  tempi-, _i  to  P,_i  (if  t  >  1) 

For  k  =  »  +  1  :  n 

receive  tempi;,  from  P,  +  i  (if  $  <  r») 

Lt.,  =  -temp,  •  L,,, 

tempi;,,  i  =  tempi,,. i  +  Li.,  •  Ri:l- 1., 

send  tempi;i_i  to  P,  - 1  (if  *  >  1) 

end 

Processor  P,  computes  L,t, ,  calculates  »  —  1  components  of  the  intermediate  vector  temp,  tempi;)_i  = 
L,,,  Ri;,-!,,  and  sends  them  to  the  left  neighbor  P,_  t .  Next  P,  receives  tempi;,  from  the  right  neighbor 
P,+ 1,  computes  L,+it,  and  updates  tempi,,-!,  and  so  on.  The  order  of  computing  L,;n>,  in  P,  is  shown  in 
Figure  5.3. 
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Figure  5.3.  Order  of  Computing  L(=  R“r)  in  Parallel 

The  transposed  inverse  L  is  lower  triangular  and  can  be  conveniently  stored  together  with  the  upper 
triangular  matrix  R,  forming  a  square  matrix.  The  parallel  algorithm  for  computing  the  transposed  inverse 
of  a  triangular  matrix  can  be  easily  implemented  in  both  the  block  mapping  and  the  wrap  mapping. 


€.  Transputer  and  Occam 

The  INMOS  transputer  is  a  high-speed  parallel  processor  which  combines  processing,  memory  and 
interconnect  in  a  single  VLSI  chip.  It  contains  4  inter-processor  links  which  provide  high-speed  commu¬ 
nication.  INMOS  T800/20  is  a  32-bit  transputer  with  floating  point  processor  which  is  able  to  perform 
floating  point  operations  concurrently  with  the  processor,  sustaining  a  rate  of  1.5  Mflops  [2], 

A  link  between  two  transputers  is  implemented  by  connecting  link  interfaces  on  the  transputers  by 
two  one-directional  signal  wires,  along  which  data  is  transmitted  serially.  Messages  are  transmitted  as  a 
sequence  of  bytes,  each  of  which  must  be  acknowledged  before  the  next  is  transmitted. 

The  test  system  has  8  transputers  on  two  boards  in  a  Mac  IIx.  Each  board  contains  up  to  four 
transputers  and  has  one  INMOS  C004  programmable  link  switch  designed  to  provide  a  full  crossbar  switch 
between  32  input  links  and  32  output  links.  A  message  has  to  pews  through  the  C004  switch  once  for 
on-board  communication,  and  twice  for  off-board  communication.  We  configured  the  transputers  as  a 
ring  and  measured  the  communication  time  for  this  configuration.  We  measured  the  time  for  sending  a 
64-bit  message  around  the  ring  of  8  transputers  1000  times.  Then  the  average  time  for  the  toral  of  8000 
communications  was  recorded  as  Tcamm.  We  also  measured  the  execution  time  for  the  loop, 

For  »  =  1:8000 
y  =  xy  +  z 

end 


where  all  operations  are  double-precision  (64-bit)  floating  point  operations.  In  order  to  decrease  the 
overhead  caused  by  the  loop,  the  loop  was  modified  by  repeating  10  multiplications  and  additions  within 
the  loop.  The  average  computation  time  was  recorded  as  TCQnp.  The  ratio  of  the  communication  time  to 
the  computation  time  of  the  system  is. 


TCorn  n 


T 

1  comp 


8.944u»ec 

2.120u«ec 


~  4.219. 
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The  communication  time  is  about  4.219  times  more  expensive  than  the  computation  time  in  the 
transputer  system.  The  communication  speed  piays  an  important  roie  in  the  execution  time  of  paral¬ 
lel  algorithms.  By  reducing  the  communication  requirements  of  the  algorithms,  even  at  the  expense  of 
increasing  computational  work  load,  we  may  save  the  total  execution  time  of  parallel  algorithms. 

The  main  programming  language  supported  by  transputers  is  OCCAM.  All  programs  were  written 
in  OCCAM.  Occam  is  designed  to  support  explicit  hardware  concurrency  and  it  reflects  the  concurrency 
found  in  the  transputer  [9].  Although  transputers  provide  implementation  of  most  modern  programming 
languages,  concurrency  in  transputer  systems  is  most  effectively  supported  by  Occam. 


7.  Implementation  of  Hyperbolic  Householder  Transformation 

In  this  section,  we  present  a  possible  implementation  of  a  Householder  transformation.  The  variant 
we  consider  is  related  to  that  suggested  in  [10]. 

We  start  with  a  definition  of  a  hyperbolic  Householder  transformation.  Annxn  hyperbolic  House¬ 
holder  matrix  has  the  form 

HY  =  *-2vvT/vr*v  (7.1) 

where  $  —  diag(±l)  and  v  is  any  vector  for  which  v$v  ^  0. 

In  our  recursive  up-downdating  ♦  is  of  the  form, 


*  =  diag(I,Ip,-If) 

where  the  middle  Ip  corresponds  to  rank-p  updating  and  the  last  -If  corresponds  to  rank-<j  downdating. 
Let  c  be  a  vector  such  that  |lc||*2  =  cr<$c  is  positive.  Then  the  choice  v  =  c  T  IM|*ei  guarantees 
H<  =  ±||c||*ei,  i.e.,  H  can  compress  the  hyperbolic  norm  of  the  vector  c  into  a  selected  component  of  c. 

Let  us  consider  the  1-th  column  of  Eq.(3.1.L)  in  the  fc-th  modification  step.  From  (7.1),  we  have  that 

/Hi.:  \  /R4,:\  (  Ri':  ~2Vl  Mkl  /V*‘V  \ 

X.  ’  J  =  Hyj  •  I  X:.:  =  X;.:-2  v2:p^MK/v*4v  (7.2) 

V  /  \  Y:.:  )  \  -Y:.:  +  2vp+2;p+^lAfi:/v*i  v  ) 

where  Mi:  =  vr  •  (  Rf  . ,  X*". ,  Y?>  f . 

Let 

Sik  =  Ri.i  +  ||^:,*l|2  ~  l|Y:,i||2  =  , 

u  =  sqrt(5ti), 

and  t  , 

54;  =  Rk.kB.k.1  +  jZ  "  E  Y’.*  Y’.‘- 

1*1  1*1 

According  to  the  definition  of  v,  Vi  =  C(  ±  ||c||*  =  Rj,*  ±  »i-  Rj,:  can  be  represented  as  follows. 
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(7.3) 


^  - 2 -2(R»  * 1‘^)  ■ 


Then,  using  the  above  result, 

X:;  =  X:>j  -  2v3:,+l 


">L  =  x,_Iif±l(R4,-Rl,) 


and 


V 

Mki 


Y: .  =  Y: .  -  2  v,+3:p+<+1^  =  Y:,  -  -ty  (H4t:  -  Rk.:) 
By  defining  V3.p+f+i  =  ▼3sf^t-*.i/vi,  (7.2)  can  be  represented  as  follows: 


Hi,:  \ 

f  =f  Su/'k  \ 

x„  = 

Xv>  -  »3:r+1  (Rk^-Rk,:) 

Y:^  / 

\  ~Y:t:  +  Vp+2.p+t_t.i(R*i:  -  Rk^)  / 

(7.4) 


Ri.:  is  computed  directly  from  (7.3).  By  forming  the  component  Rkt:  first,  the  other  components,  X:,: 
and  Y.  • ,  can  be  formed  in  a  straitforward  way. 


8.  Implementations 

We  implemented  the  three  algorithms  for  up-downdating  on  the  transputer  array.  Figures  8.1,  8.2  and 
8.3  show  how  the  data  is  distributed  in  the  transputer  array  for  Algorithms  A,  B  and  C,  respectively.  The 
figures  show  snapshots  for  the  cases  of  a  16  x  16  data  matrix,  4  processors  and  up-downdating  of  order  2. 
The  darkly  shaded  area  represents  data  for  which  corresponding  computations  are  finished  and  no  more 
changes  will  occur.  The  data  in  the  lightly  shaded  area  represents  active  data  which  is  operated  on.  The 
white  area  represents  data  that  will  be  operated  on. 

Figure  8.1a  shows  the  typical  block  mapping  of  the  algorithm.  The  computations  start  from  the  left¬ 
most  node  and  propagate  to  the  right.  The  other  processors  receive  transformations,  send  them  to  the  right 
and  update  their  own  data  according  to  (7.4).  The  rightmost  processor  only  receives  the  transformations 
from  the  left.  The  block  mapping  of  Algorithm  A  has  the  simplest  communication  scheme.  However  the 
load  balance  of  the  block  mapping  among  the  nodes  is  not  as  good  as  that  of  the  wrap  mapping.  The  wrap 
mapping  of  Algorithm  A  is  shown  Figure  8.2b. 

Algorithm  B  operates  on  the  triangular  matrix  and  its  inverse.  The  transposed  inverse  matrix  is  placed 
at  the  bottom  of  the  triangular  matrix.  The  computation  load  is  perfectly  balanced  among  processors. 
The  block  mapping  of  Algorithm  B  has  simpler  communication  strategy  than  the  wrap  mapping. 

Each  processor  in  Algorithm  C  computes  its  own  partial  products  of  -R_rXr  and  R~r  Yr  of  { 3.3.6) 
based  on  its  own  partial  data  R"T,  X  and  Y.  All  partial  pruducts  in  all  processors  have  to  be  added  to 
construct  the  elements  of  -R“rXr  and  R_rYr  The  nodes  send  their  own  partial  products  to  the  right 
neighbors,  receive  the  others’  from  the  left,  and  then  update  their  own  partial  products  with  the  received 
partial  products.  Then  the  nodes  send  the  received  partial  products  to  the  right  neighbor,  receive  new 
partial  products  from  the  left  and  update  their  partial  products  again.  After  repeating  p  -  1  times,  all  of 
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a)  Block  Mapping 


b)  Wrap  Mapping 


Figure  8.2.  Snapshots  of  Algorithm  B  (r»  =  8,  p  =  4  and  k  =  2) 


a)  Block  Mapping 


nn  JlEu  i w  i  fh 


b)  Wrap  Mapping 


Figure  8.3.  Snapshots  of  Algorithm  C  (n  =  8,  p  =  4  and  Jfc  =  2) 
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Table  9.1.  Timing  Results  of  Algorithm  A  (msecs) 


a)  Block  Mapping 


Size 

of  Data 

1  Proc 

2  Proc 

4  Proc 

8  Proc 

16  x  16 

1  col  u pAdn 
__2 1col_u gAdn 
4  col  upAxin 

s.m 

8.940 

15.040 

4.928 

7.168 

11.712 

3584 

_  5.184 

'  8.256_ 

3.008 

4.096 
“'  6.400 

32  x  32 

2  col  upAdn 

4  col  upAdn 

23.104 

35.008  ~ 
58416 

19.264 

28464 

47.552 

_  12452 

_  17.792"  " 

28.800“ 

_ 8420 

11.712 

18560 

48  x  48 

1  col  upAdn 

2  cal  u pAdn 
4  col  upAdn 

JQJtt _ 

77.312 

130.176 

_ 

_ 63.360_ _ 

105.088 

-22329 _ 

37568 

61.248 

_ 

23.104 

37.120 

64  x  64 

1  col  upAdn 

2  col  upAdn 
4coi  upAdn 

90416 _ 

138.112 

232768 

_75.392  __ 

112256 

185.728 

_  47 .488 
_  "69468 

U4.56o“ 

27.008 

38.720 

62656 

80  x  80 

1  col  upAdn 

4  cot  upAdn 

140.800 

214.336 

361.728 

_1 15.968  __ 

173.888 
288.192 

_  71460 _ 

"~io62/W 

174000“ 

40.384 

“58560' 

'95.040 

b)  Wrap  Mapping 


Size 

of  Data 

1  Proc 

2  Proc 

4  Proc 

8  Proc 

1  cot  upAdn 

5.888 

6784 

4.288 

3492 

16  x  16 

2  cot  upAdn 

8.960 

10240 

6208 

4.736 

4  cot  upAdn 

15.040 

16.960 

10.112 

7.424 

1  cot  upAdn 

23.104 

26368 

14592 

9.152 

32  x  32 

2  cot  upAdn 

35.008 

40  064 

21.696 

13.184 

4  cot  upAdn 

58.816 

67.392 

35.776 

21.312 

1  cot  upAdn 

-5$..7ft4_ 

3052?  1 

[7,984 

48  x  48 

2  cot  upAdn  | 

77412 

86528 

45.696 

26  368 

4  cot  upAdn  ] 

130.176 

146.240 

75.904 

43008 

1  col  upAdn  i 

90  816 

101.312 

54  592 

30.144 

64  x  64 

2  cot  upAdn 

138.112 

155.648 

r  82432 

44  608 

4  cot  upAdn 

232768 

256.920 

137.984 

73.472 

t  col  upAxin 

140.800 

156160 

81.920 

44  9  28 

80  x  80 

2  cot  upAdn 

214336 

240.832 

i  24. 160 

66.944 

4  cot  upAdn 

361.728 

410.176 

208.512 

111  040 

the  nodes  collect  products  of  -R-rXT  and  R"TYr.  Once  the  elements  of  -R"rXr  and  R"rYT  are 
computed,  no  more  communication  is  required.  Now,  each  processor  computes  Givens  rotations  based  on 
-R~rXr  and  hyperbolic  Givens  rotations  based  on  R-TYT,  and  applies  the  rotations  to  R  as  well  as  to 
— R-rXr  and  R-rYr.  This  part  of  the  computation  is  purely  sequential.  The  snapshots  are  illustrated  in 
Figure  8-3,  where  the  leftmost  box  represents  — R"TXT.  and  the  next  shows  R-rYr.  The  wrap  mapping 
of  Algorithm  C  exhibits  better  load  balancing  than  block  mapping. 


9.  Results  and  Discussions 

We  implemented  the  three  algorithms  and  measured  the  execution  time  on  an  8  transputer  array. 
Tables  9.1,  9.2  and  9.3  show  the  time  results  of  Algorithms  A,  B  and  C,  respectively.  We  tested  the 
algorithms  for  several  sets  of  data  with  one,  two  and  four  column  up-downdatings. 

The  time  results  of  the  algorithms  are  compared  in  Figures  9.1,  9.2  and  9.3  for  32  x  32  and  64  x  64 
data  matrices.  The  results  show  that  Algorithm  A  is  usually  the  fastest  since  it  requires  the  minimum 
number  of  operations.  The  block  mapping  of  Algorithm  A  is  better  for  small  data  matrices,  small  numbers 
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Table  9.2.  Timing  Results  of  Algorithm  B  (msecs) 


a)  Block  Mapping 


Size  of  Data 

1  Proc 

2  Proc 

4  Proc 

8  Proc 

16  x  16 

1  col  up&dn 

2 jaal_ugdtdn 

4  col  updtda 

10.176 

13.616 

26.624 

6784 

10.240 

76.960 

4.288 
_  6208 
“10.112" 

1392 

4.736*“ 

7.424 

32  x  32 

2  col  upAdn 

4  col  upAdn 

26368 

40064 

67.392 

14.592 

21696 

35.776" 

9.152 
“ill  84“ 
21.312 

48  x  48 

1  col  up&dn 

2  col  upAdn 

4  col  upAAi 

.Jim _ 

143.232 

245.504 

.5&7ft4 

86.5  28 

146.240 

_M,»« _ 

_ _45_696  _ 

75.904 

.-17.314 _ 

_ 26  368 _ 

43.008* 

64  x  64 

1  col  up&dn 

2  col  upAdn 

4  col  updfcdn 

165.568 

__2S7.920 _ 

“**  441560 

101.312  __ 

155.648 

256.920 

54.592 

82432 

137.9841 

m 

1  col  up&dn 
2j»i_ugAdri 
4  cal  upAdn 

257.472 

_ 40Z048__ 

691.136 

156160 

_._240.832__ 

410.176 

81.920 

1*24.160*'“  ' 
208512“ 

44 928 
“66.944* 
111.040 

b)  Wrap  Mapping 


Size  of  Data  j 

1  Proc 

2  Proc 

4  Proc 

8  Proc 

■ 

1  col  upAdn  j 

10.176 

7.104 

5.056 

4.160 

BEasaai 

15.616 

10.688 

7.360 

*5.696  "* 

■Hi 

4  col  up&dn  | 

26.624 

17.856 

11.904 

8.768 

41.152 

26688 

16.064 

32  x  32 

KgfRjal 

63.808 

40.896 

23.808 

16.064  1 

4  col  up&dn  | 

69.184 

39.104 

J&P9S 

27,250 

325 76 

215^8 

48  x  48 

2  col  up&dn  j 

143.232 

88.192 

48.832 

30  784 

4  col  upxfedn  j 

245504 

149.888 

81.408 

49.472 

165568 

99.712 

56512 

34  560 

64  x  64 

2  col  up&dn 

257.920 

154  240 

85.760 

50.432 

4  col  upAdn 

442560 

BHEuESa 

144.192 

82.304 

1  col  upAdn 

257.472 

153.024 

84  992 

80  x  80 

402.048 

237568 

129.728 

mgasSm 

691.136 

406.848 

2*19.264 

121.152  | 

of  up-downdating  columns  and  large  numbers  of  processors.  On  the  other  hand,  wrap  mapping  is  better 
for  large  data  matrices,  large  numbers  of  up-downdating  columns  and  small  numbers  of  processors.  For 
example,  if  each  node  has  a  light  computation  load,  block  mapping  is  preferable.  In  the  case  of  a  heavy 
load,  however,  wrap  mapping  is  preferable. 

Algorithm  B  operates  on  the  triangular  matrix  and  its  transposed  inverse.  It  uses  direct  matrix-vector 
multiplications  to  obtain  the  solution  of  the  triangular  system.  The  number  of  multiplications  in  Algorithm 
B  is  twice  that  of  Algorithm  A,  so  it  is  usually  slower  than  Algorithm  A.  The  block  mapping  of  Algorithm 
B  always  has  better  performance  than  wrap  mapping,  since  the  computation  load  is  perfectly  balanced 
among  processors  for  both  mappings,  and  the  former  has  a  simpler  communications  scheme. 

The  construction  of  — R-rXr  and  R-rYT  in  Algorithm  C  impose  a  large  communication  overhead 
since  each  processor  needs  its  own  copy  of  —  R-rXr  and  R-rYr,  and  this  requires  additional  transfer 
of  data  among  processors.  Each  needs  p  —  1  communication  and  addition  steps  to  acquire  their  own 
copy  of  the  products.  In  addition,  the  algorithm  uses  Givens  and  hyperbolic  Givens  rotations  instead 
of  hyperbolic  Householder  reflections.  Therefore,  it  requires  more  multiplications  than  the  other  two 
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Table  9  3.  Timing  Results  of  Algorithm  C  (msecs) 


a)  Block  Mapping 


Size  of  Date 

1  Ptoc 

2  Proc 

4  Proc 

8  Proc 

16  x  16 

1  col  up&dn 

2  caTugAdn^ 
4  col  up&dn 

7.172 

13.616 

33.600 

6.734 

13.696 

**29504 

J.056 

10.368 

. ‘*23.680 

4.160 

8.7*68* 

~  *20*544 

32  x  32 

27.200  _ 

56.37*6 

1 1*6.480 

sum 

76.384 
*  32704  * 
70784* 

11.776 

"*24128* 

54208 

48  x  48 

1  col  updfcan 
2ooi  iroAdn 
4  col  up&dn 

,_«L4SS _ 

122.112 

245.504 

-A3.1SP  ._ 

101744 

213.248 

_.33,«<L _ 

67.008 

139.648" 

_ 

_ 45.3J2_  _ 

98.944 

64  x  64 

1  col  upidn 

2  col  upAdn 

4  col  u p&dn 

KQEg|BMB 

94.080 

171.688 

361280 

_  58.624_ 

*11*3.856 

237.504 

37J.84 _ 

’“73.408 

156.416 

80  x  80 

l  col  up&dn 
s2cp(ugdtdEia 
4  col  upAdn 

EEESPI 

EggEPPP 

b)  Wrap  Mapping 


Size  of  Data  | 

1  Proc 

2  Proc 

4  Proc 

8  Proc 

1  col  upAdn  | 

7.872 

5.760 

4.672 

4.544 

16  x  16 

15.616 

■ftmzaa 

9664 

9.280 

33.600 

25.728 

22272 

21504 

27.200 

19.392 

13.184 

11.264 

32  x  32 

56576 

38.208 

26,560 

22912* 

4  col  up&dn  | 

116.486 

80.832 

59008 

51.776* 

wisissa 

_ 

..ism _ 

j mi- . 

48  x  48 

2coi  upAdn 

122112 

77.568 

49  600 

39  680 

4  col  ut>Adn  1 

245.504 

158208 

107.456 

87  9  36 

■ITITSE'Jl 

WQUOMli 

66.944 

41.408 

30.272 

64  x  64 

2  col  up&dn  1 

214080 

129.024 

81.152 

60*096 

4  col  up&dn 

428.806 

268.416 

174.336 

130.944 

i  col  upicifi  i 

172992 

60544 

42304 

80  x  80 

2  col  updfcdn 

329.984 

194624 

117.952 

83.648 

4  col  upAxtn 

657  152 

400.896 

247.168 

180.672 

algorithms.  Householder  transformations  have  almost  the  same  number  of  multiplications  as  additions, 
and  the  number  of  additions  in  Givens  rotations  is  half  the  number  of  multiplications.  Algorithm  C  needs 
slightly  more  multiplications  than  Algorithm  B.  But  the  number  of  additions  required  by  Algorithm  C 
is  almost  half  of  the  number  of  multiplications.  a.id  much  less  than  the  number  of  additions  required  by 
Algorithm  B.  Thus,  Algorithm  C  is  faster  than  Algorithm  B  for  small  numbers  of  up-downdating  columns 
though  more  multiplications  are  required.  The  wrap  mapping  of  the  algorithm  is  even  faster  than  that  of 
the  Algorithm  A  for  1  column  up-do’  ndatings  with  2  processors.  The  algorithm  performs  poorly  for  the 
large  number  of  up-downdating  columns  since  each  node  has  to  modify  — R~rXT  and  R~r  Yr  as  well  as 
portions  of  the  matrix  inverse. 

A  buffer  between  two  processors  is  necessary  to  smooth  communication  among  nodes  The  transputer 
system  does  not  support  the  hardware  buffer.  However,  the  buffer  can  be  simulated  with  software.  A 
software  buffer  is  different  from  a  hardware  buffer,  in  that  communication  between  a  software  buffer  and 
a  process  can  be  accomplished  through  a  software  channel,  which  is  a  memory-read  or  memory-write 
operation.  One  software  buffer  is  inserted  between  every  two  processors.  Figure  9.4  shows  performance  of 
the  algorithms  without  using  buffers.  Note  that  processors  in  Algorithm  C  do  not  need  buffers  since  they 
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Timc(msttj)  Time(mK**> 


a)  n  =  32  b)  n  =  64 

Figure  9.1.  Comparison  of  Algorithms  for  1  column  up-downdatings 


a)  n  =  32  b)  n  =  64 

Figure  9.2.  Comparison  of  Algorithms  for  2  column  up-downdatings 


do  not  have  to  communicate  to  modify  their  own  data.  Clearly,  the  buffered  implementations  have  better 
performance  than  the  unbuffered,  especially  in  the  wrap  mapping  of  Algorithm  A. 

The  communication  speed  can  be  altered  in  the  transputer  array.  The  algorithms  were  initially  run 
with  20  Mbits/sec  transfer  rate  on  each  channel.  In  addition,  they  were  run  with  10  Mbits/sec  transfer 
rate.  The  ratio  of  the  communication  time  to  the  computation  time  is  changed  to  the  following  ratio  when 
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TimcOnseci) 


a)  n  =  32  b)  n  =  64 

Figure  9.3.  Comparison  of  Algorithms  for  4  column  up-downdatings 


Figure  9.4.  2  column  up-downdatings  without  buffer  for  n  =  32 


the  link  speed  is  10  Mbits/sec: 


rc,.wm 

Teomp 


14.380uiec 

2.120u*ec 


~  6.783. 


Comparisons  of  eiapsed  times  for  the  algorithms  when  the  link  speed  is  10  Mbits/sec  are  presented 
in  Figure  9.5,  which  is  compared  with  Figure  9.3b.  The  time  difference  between  the  wrap  mappings  of 
the  Algorithms  A  and  C  using  2  processors  becomes  slightly  larger  than  that  of  Figure  9.3b.  The  block 
mapping  of  Algorithm  A  has  better  performance  than  the  wrap  mapping  using  8  processors,  since  the  wrap 
mapping  requires  more  communication. 
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Number  of  Procenon 

Figure  9.5.  1  column  up-downdatings  with  10  Mbits/sec  for  n  =  64 
10.  Future  Work 

We  implemented  the  three  algorithms,  measured  their  elapsed  time  and  compared  the  performance 
results  on  the  transputer  array.  The  future  work  for  the  time  analyses  of  the  algorithms  will  be  presented 
at  a  later  date.  The  propagation  and  truncation  errors  resulting  from  the  different  algorithms  will  be 
monitored  and  analyzed.  Finally,  the  algorithms  will  be  implemented  on  a  two-dimensional  transputer 
array,  and  the  comparison  of  the  time  results  conducted  by  the  different  algorithms  will  be  presented. 
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Abstract 

A  new  generalization  of  the  singular  value  decomposition  (SVD),  the  hyperbolic  SVD 
is  advanced,  and  its  existence  established  under  mild  restrictions.  The  hyperbolic  SVD 
accurately  and  efficiently  finds  the  eigenstructure  of  any  matrix  that  is  expressed  as  the 
difference  of  two  matrix  outer  products.  Signal  processing  applications  where  this  task 
arises  include  the  covariance  differencing  algorithm  for  bearing  estimation  in  sensor  ar¬ 
rays,  sliding  rectangular  windowing,  and  array  calibration.  Two  algorithms  for  effecting 
this  decomposition  are  detailed.  One  is  sequential  and  follows  a  similar  pattern  to  the 
sequential  bidiagonal  based  SVD  algorithm.  The  other  is  for  parallel  implementation  and 
mimics  Hestenes’  SVD  algorithm.  Numerical  examples  demonstrate  that,  like  its  conven¬ 
tional  counterpart,  the  hyperbolic  SVD  exhibits  superior  numerical  behavior  relative  to 
explicit  formation  and  solution  of  the  normal  equations.  Furthermore  the  hyperbolic  SVD 
applies  in  problems  where  the  conventional  SVD  cannot  be  employed. 

1.  INTRODUCTION 

The  singular  value. decomposition  (SVD)  is  a  tool  of  both  practical  and  theoretical  im¬ 
portance  in  digital  signal  processing.  The  SVD  of  an  n  x  m  complex  valued  matrix  A  is 
given  by  [8): 

A  =  USV\ 

where  S  is  an  n  x  m  diagonal  matrix  with  non-negative  diagonal  (with  the  entries  ordered 
from  largest  to  smallest),  U  is  am  n  x  n  unitary  matrix3,  V  is  an  m  x  m  unitary  matrix 
and  *  denotes  Hermitian  conjugation  in  case  of  a  complex  valued  matrix  and  simple 

1 A  more  comprehensive  description  of  the  algorthmic  aspect*  of  this  decomposition  can  be  found  in 
The  Hyperbolic  Singular  Value  Decomposition  and  Applications”  to  be  published  in  the  IEEE  ASSP. 

2 A  square  matrix  U  is  unitary  if  it  satisfies  UU *  =  UW  =  l  the  identity  matrix. 
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transposition  in  case  of  a  real  valued  matrix. 

The  SVD  provides  a  solution  to  the  following  problem: 

PI  Given  a  matrix  A,  find  the  eigenvalues  and  eigenvectors  of  AA f. 

The  SVD  can  thus  be  interpreted  as  follows:  the  eigenvalues  of  .4/1*  are  the  squares  of 
the  entries  of  S,  (note  that  all  the  eigenvalues  of  AA '  are  real  and  non-negative),  and 
the  columns  of  U  axe  the  corresponding  eigenvectors.  We  get  the  eigenvectors  of  A*  A 
■‘for  free”  in  the  columns  of  V .  For  numerical  reasons  it  proves  more  accurate  to  not 
explicitly  form  the  product  AA*,  but  rather  to  perform  an  algorithm  on  the  data  matrix 
A  directly3  [8].  The  outer  product  AA'  arises  in  the  normal  equations  encountered  (among 
other  places)  in  various  estimation  and  adaptive  filtering  problems.  Further  details  on 
the  normal  equations  and  the  role  of  the  SVD  in  its  analysis  and  solution  are  found  in 
[11],  and  in  [8]. 

Consider  now  the  following  (related)  problem: 


P2  Given  two  n  x  m  matrices  Ai,  A2  find  the  eigenvectors  and  eigenvalues  of 

Ai  A{  —  A2  Aj. 

We  would  like  to  solve  P2  without  forming  the  outer  products  and  subtracting  them 
explicitly.  To  this  end  we  introduce  a  new  generalization  of  the  ordinary  SVD  which  we 
call  the  hyperbolic  singular  value  decomposition  or  HSVD.  Just  as  the  ordinary  SVD  was 
initially  “designed”  to  allow  efficient  solution  of  Pi,  our  HSVD  is  designed  to  efficiently 
solve  P2.  Again  the  primary  motivation  is  numerical  accuracy.  In  addition,  as  we  shall 
see,  the  HSVD  can  be  implemented  on  a  parallel  machine.  The  HSVD  actually  solves  the 
following  slightly  more  general  version  of  P2: 

P3  Given  a  matrix  A  and  a  matrix  $  that  is  diagonal  with  ±1  on  the  diagonal, 
find  the  eigenvalues  and  the  eigenvectors  of  A$Ah 


P2  is  indeed  a  special  case  of  P3,  which  is  easily  made  evident  by  setting: 
A  =  [  A,  A2  ] 


3It  is  not  advisable  to  form  outer  products  of  matrices  explicitly, 
point,  with  an  accuracy  of  three  digits  after  the  decimal  point,  and 
A  _  r  5.015  -4.995  ‘ 


-4.995  5.015 

Then  the  eigenvalues  of  A  are: 

50.1002  -50.0998 
-50.0998  50.1002 

But  with  the  limited  (three  digit)  accuracy: 


AAf  = 


10.01  and  .02}  and  A  is  not  singular. 


AA' 


50.100  -50.100 
-50.100  50.100 


Suppose  we  are  working  in  fixed 


Square  A  to  get: 


Thus  the  computed  AA'  is  singular,  while  the  unlimited  accuracy  one  isn’t.  The  singular  values,  computed 
as  the  square  roots  of  the  eigenvalues  of  AA'  will  of  course  be,  not  {10.01  and  .02},  but  {10.01  and  0}. 


94 


Note  also  that  Pi  is  likewise  a  special  case  of  P3  associated  with  $  =  /.  The  problem 
P2  comes  up  in  at  least  three  distinct  physical  scenarios.  One  is  the  downdating  problem, 
another  (the  one  that  initially  caught  our  interest)  is  the  so-called  covariance  differencing 
problem,  and  a  third  is  array  calibration.  Note  that  the  data  in  these  problems  can  be 
complex  [11],  and  therefore  that  the  solution  must  work  for  both  real  and  complex  data. 

In  downdating,  an  estimation  or  filtering  problem  (the  solution  of  which  involves  the 
SVD)  is  solved  for  a  string  of  data.  Anticipating  subtle  changes  in  the  data  and  in 
the  solution,  the  older  data  is  expected  to  be  outdated,  and  should  be  discarded4.  The 
columns  of  Ai  in  P2  are  then  a  subset  of  the  columns  of  At,  and  the  resulting  difference 
is  actually  non-negative  definite.  This  simplifies  the  problem  somewhat,  and  indeed  there 
are  two  efficient  algorithms  for  solving  the  problem  by  serial  computation.  One  approach 
is  to  form  the  Cholesky  factor  of  the  covariance  matrix  using  the  hyperbolic  Householder 
scheme  [20]  and  then  proceed  using  the  conventional  SVD.  The  second  approach  is  the 
one  described  in  [13]  which  is  tailored  to  single  column  downdating.  In  contrast,  in  the 
event  that  a  parallel  computer  is  used  the  HSVD  algorithm  is  the  only  one  we  know  of 
for  this  problem. 

The  covariance  differencing  problem  arises  in  high  resolution  bearing  estimation  in  un¬ 
known  noise  fields  [1],  [14],  [16],  [18],  [19],  [24],  [25],  [26],  [27].  Bearing  estimation  in 
unknown  coherent  noise  is  a  topic  which  has  received  a  great  deal  of  attention  recently,  as 
evidenced  by  the  large  number  of  references  cited  above.  The  failure  of  standard  eigen- 
structure  schemes  (such  a  MUSIC,  ESPRIT  etc.)  in  the  presence  of  coherent  noise  fields 
was  demonstrated  by  Bienvenue  in  1979  [1].  A  solution,  based  on  covariance  differencing, 
was  presented  by  Paulraj  and  Kailath  [16]  in  1986,  and  independently  by  Tuteur  and 
Rockah  [24].  Since  then,  variants  have  been  proposed  [14],  [18],  and  order  determina¬ 
tion  and  performance  analysis  issues  addressed  [25],  [26],  [27].  A  general  overview  of  the 
bearing  estimation  problem  is  available  in  [15]. 

The  key  idea  behind  this  bearing  estimation  scheme  is  as  follows.  Suppose  that  the  noise 
remains  the  same  (in  statistical  average)  between  Ai ,  and  Aj,  but  that  the  signals  of 
interest  change.  Then  the  covariance  difference  will  contain  only  a  residual  amount  of 
noise,  but  the  signals  will  still  be  present.  Details  as  to  how  the  two  snapshot  matrices 
Aj,  Aj  are  formed,  and  how  parameter  estimates  are  found  from  the  eigenstructure  of  the 
difference  covariance,  are  found  in  the  above  references. 

Despite  the  interest  in  covariance  differencing  for  coherent  noise  bearing  estimation,  no 
(prior)  published  work  has  addressed  computational  issues. 

A  related  problem  is  array  calibration.  Suppose  we  have  an  array  which  (due  to  component 
errors,  or  other  physical  considerations)  has  an  unknown  array  response.  We  are  hampered 
in  our  efforts  to  measure  the  response  by  ambient  noise.  An  (offline)  solution,  proposed  in 
[23],  is  as  follows.  First  form  a  covariance  matrix  from  the  array  with  no  sources  present. 
Then  turn  on  a  calibration  source  at  bearing  angle  9  and  frequency  u>,  and  form  the  new 
array  covariance.  Next  find  the  principle  eigenvector  of  the  covariance  difference.  This 
eigenvector  should  be  an  accurate  estimate  of  the  array  response  at  the  specified  u 
co-ordinates. 

Mn  the  adaptive  filtering  community  this  procedure  is  often  called  sliding  rectangular  windowing  [11]. 
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The  structure  of  this  paper  is  as  follows.  The  hyperbolic  decomposition  is  introduced  in 
§2..  We  also  establish  that  the  HSVD  is  canonic  in  the  sense  that  it  exists  subject  only  to 
mild  restrictions  on  the  attendant  A  and  $  matrices.  In  sections  §3.  and  §4.  we  describe 
both  sequential  and  parallel  algorithms  for  implementing  the  HSVD.  A  simple  numerical 
example  of  the  sequential  algorithm  is  detailed  in  §5.,  where  we  also  explore  the  numerical 
behavior  of  the  HSVD  using  numerical  experiments.  We  use  simulations  from  a  realistic 
covariance  differencing  based  bearing  estimation  application  as  well  as  a  more  artificial 
example  selected  for  its  analytic  tractability.  Both  cases  demonstrate  the  anticipated 
superior  numerical  behavior  of  both  sequential  and  parallel  HSVD  as  contrasted  to  direct 
outer  product  formation. 

2.  THE  HYPERBOLIC  SVD 

We  now  state  and  prove  the  fundamental  existence  theorem  for  the  hyperbolic  SVD. 

Theorem:  Let  $  be  an  m  x  m  diagonal  matrix,  with  entries  ±1  and  let  A 
be  an  n  x  m  matrix,  such  that  A$A*  is  non  degenerate  in  the  sense  that 
rank(AQA^)  —  mm{n,m}s.  Then  there  exists  an  n  x  n  unitary  matrix  U, 
and  an  m  x  m  matrix  V  with 

V'QV  =  $  (1) 

where  4  is  a  diagonal  matrix  with  entries  ±1  (possibly  different  from  $),  and 
an  n  x  m  diagonal  matrix  D  with  positive  real  diagonal  entries,  such  that 

A  =  UDV\  (2) 

Following  the  nomenclature  in  [20j  we  cadi  a  matrix  W  satisfying  =  $  a  hyper¬ 

normal  matrix.  A  matrix  satisfying  the  more  general  condition  in  (1)  will  be  called  a 
hyperexchange  matrix.  Note  that  a  hyperexchange  matrix  and  a  hypernormal  matrix  are 
related  by  the  equation  W P  =  V  where  P  is  a  permutation  matrix  such  that  P$P*  =  $. 
Note  also  that  “hyperexchangivity”  is  partially  preserved  under  multiplication,  in  par¬ 
ticular,  if  Vri)  is  hyperexchange  with  respect  to  such  that  yril^rilyri)  =  $*2\ 
and  if  is  hyperexchange  with  respect  to  $*2\  such  that  yf^^f^yl3)  =  then 
y(2)tyti)t$(i)y(i)y(2)  _  $(3)^  yri)y(2)  l3  obviously  hyperexchange  with  respect  to 

As  mentioned  above  we  call  (2)  the  hyperbolic  singular  value  decomposition  of  A.  As  a 
special  case  when  0  =  /  we  obtain  the  ordinary  SVD.  For  real  data  the  proof  remains 
valid,  and  the  Hermitian  operator  *  reverts  to  the  ordinary  transpose  operator. 

The  columns  of  the  matrix  U  of  the  hyperbolic  SVD  of  A  are  the  eigenvectors  we  seek, 
and  the  diagonal  entries  of  D$D*  are  the  eigenvalues  we  seek. 

*This  result  can  be  extended  for  rank  deficient  matrices.  However,  though  this  extension  is  not  trivial, 
it  is  mostly  of  theoretical  value,  as  “real  world”  matrices  are  not  rank  deficient  (due  to  noise  at  least). 
For  details  see  (22). 
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It  would  be  interesting  at  this  point  to  note  some  of  the  properties  of  the  hyperexchange 
matrices. 

•  The  hyperexchange  matrix  always  has  an  inverse.  This  inverse  bears  an  interesting 
relation  to  the  matrix’s  Hermitian  transpose.  To  see  this  consider  the  hyperexchange 
matrix  V  in  (1),  and  multiply  both  sides  of  the  equation  by  the  matrix  4,  to  obtain, 

=  /. 

It  is  directly  apparent  that  the  inverse  of  V  (which  is  unique)  is  given  by, 

V-1  =  (3) 

and  that  it  always  exists. 

•  The  singular  values  of  a  hyperexchange  matrix  have  an  interesting  structure.  The 
singular  values  come  in  pairs  of  reciprocals,  i.e.,  if  cr  is  a  singular  value  of  a  hy¬ 
perexchange  matrix  V,  then  so  is  1  /a.  This  is  a  direct  consequence  of  (3)  because 
while  the  singular  values  of  reciprocal  matrices  are  reciprocals  of  each  other,  those  of 
transpose  matrices  are  identical  (to  within  the  number  of  zeros,  which  is  irrelevant 
here),  and  by  (3)  it  can  be  seen  that  V~l  and  V*  have  the  same  singular  values. 

•  The  eigenvalues  of  a  hyperexchange  matrix  don’t  in  general  have  any  structure,  but 
if  it  is  a  hypemormal  matrix  (recall  that  means  that  $  and  *  in  (1)  are  equal),  then 
the  eigen-structure  is  similar  to  the  singular-structure.  More  precisely,  if  A  is  an 
eigenvalue  of  V ,  then  so  is  1/A*.  This  property  was  already  discussed  in  [20J,  where 
it  was  also  proven,  but  it  can  also  be  easily  seen  from  (3),  for  if  4  and  #  are  equal, 
then  V1  and  V~l  are  orthogonally  similar,  and  have  the  same  eigenvalues. 

An  existence  proof  of  the  hyperbolic  singular  value  decomposition  follows.  It  is  not 
a  straight  forward  extension  of  the  existence  proof  of  the  SVD.  Rather  it  has  its  own 
interesting  features. 

Proof  of  the  Hyperbolic  SVD  Existence  Theorem:  For  (2)  to  hold  U  must  sat¬ 
isfy: 

*  UDQD'U  (4) 

We  can  find  such  a  U  by  diagonalizing  the  left  hand  side  of  (4),  i.e.  U  is  a 
unitary  matrix  such  that: 


U\A*A')U  =  A 


(5) 


where  A  is  a  diagonal  matrix  whose  entries  are  eigenvalues  of  the  Hermitian 
matrix  y4*Af.  U  always  exists  since  Hermitian*  matrices  are  always  diagonal- 
izable  by  unitary  transforms  (8).  Let  the  elements  of  A  be  ordered, 


t^i|  >  |Ajj  >  ■  •  •  >  |A„| 

If  m  <  n  |Am+l|  =  ••■  =  |A„|  =  0.  Let 
|Ai|,/a 

D  =  w1'*  ... 

ia.i*'*  . 


and 

,  where  sgn(x)  a  |  ^  ^  J 

Then  if  V  *  A'UD'1,  A  =  UDV ♦  and 
V'*V  =  d-xWa*a'ud~1  »  4. 


4« 


s^n(Ai) 


sjn(Aa) 


*S*(*m) 


This  completes  the  proof  for  the  case  m  <  n. 
If  m  >  n  let 


r  ia,i,/> 


|A.|‘« 


and 


4  = 


syn(Aj) 


syn(Aj) 


so  A  *  DQD.  Let 


D  ss  [  D  0  ] .  (6) 

«A  matrix  M  is  Hermitian  if  it  satires  Af»  =  M,  for  a  real  matrix  that  is  equivalent  to  Af  bein* 
•ymmetric. 
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To  establish  (2)  it  suffices  to  show  that  for  some  hyperexchange  matrix  V, 


A*  A  =  VD'U'UDV'  =  V 


D2  0 
0  0 


Let  us  partition  V  =  |  V  V 
V  =  A'UD 


V'. 
,  and  set 


then  (7)  is  satisfied. 

Eq  (1)  can  be  restated  in  the  following  manner: 
(a)  =  *  , 

(1 b )  V'$V  =  0  , 

(c)  V'4>V  =  *  , 

where  $,  $  and  $  are  related  by: 

*=i  M 
1  0  $ 


(7) 

(8) 


(9) 


(9)(a)  follows  directly  from  (4)  (6)  and  (8). 

Note  that  since  by  assumption  is  full  rank  (when  m  >  n,)  so  is  A.  By 

the  way  in  which  V  is  defined  in  (8)  both  V  and  $V  are  full  rank.  If  V  is 
chosen  to  lie  in  the  exactly  m  —  n  dimensional  subspace  orthogonal  to  QV,  i.e. 

V  €  ($V’)X  then  (9)(6)  is  satisfied. 

Let  Q  be  an  orthogonal  matrix  that  spans  ($V')X,  then  V  can  be  decomposed 
as  V  =  QX  with  X  some  m  x  m  matrix.  To  assure  satisfaction  of  (9)(c),  all 
we  must  show  is  that  a  matrix  X  exists,  so  that  X'Q'QQX  =  $.  This  rests 
on  the  fact  that  is  non-singular,  which  we  prove  by  contradiction7. 

Suppose  that  were  singular,  then  a  vector  y  ^  0  would  exist  such  that 

Q^QQy  =  0.  This  implies  that  $Qy  lies  in  the  space  orthogonal  to  Q ,  i.e.,  in 
the  space  spanned  by  $V.  Thus  there  exists  a  vector  z  ^  0  so  that 

QQy  =  ♦Vz. 

Multiply  both  sides  on  the  left  by  V*  to  get, 

V'tQy  =  V'QVz.  (10) 

The  left  hand  side  of  (10)  is  clearly  zero,  which  implies  that  z  =  0  as  well, 
which  is  a  contradiction. 

7An  alternate  proof  can  be  found  in  [7]  where  $  norms  are  studied  as  a  particular  case  of  indefinite 
scalar  products. 
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Having  established  that  Q*$Q  is  nonsingular,  we  can  diagonalize  it  by  an 
orthogonal  matrix  Y: 

Y'Q'QQY  =  A. 


A  of  course  has  no  zero  diagonal  entries,  and  we  can  therefore  let 


D  = 


w,/2 

M1" 


and 


r  sgn{6i) 


sgn{S2) 


sgn(6m)  J 


and  A  =  D$D.  We  choose  X  =  YD  l,  and  V  =  QX.  Then, 
V'QV  =  D-'Y'Q'tQYD-1  =  $, 


which  completes  the  proof  of  the  HSVD  existence  theorem. 


3.  A  SEQUENTIAL  ALGORITHM  FOR  HYPERBOLIC  SVD 

The  serial  hyperbolic  SVD  algorithm,  outlined  below,  mimics  the  classic  SVD  procedure 
[8].  The  main  difference  is  in  the  use  of  the  hyperbolic  Householder  transform  described 
in  [20]  and  the  hyperbolic  Givens  rotation,  described  in  [3].  Both  are  here  further  per¬ 
fected  for  the  matter  at  hand.  The  main  difference  between  the  transforms  and  rotations 
described  previously  and  the  ones  used  (and  described)  here,  is  in  the  concept  of  the  ex¬ 
change.  Thus  if  (for  reasons  that  become  apparent  later)  a  transform  or  a  rotation  cannot 
be  completed  as  expected,  the  elements  of  the  matrices  involved  are  exchanged  in  a  way 
that  preserves  pertinent  properties,  but  enables  the  algorithm  to  proceed. 

3.1.  The  Hyperbolic  Householder  Transform  and  the  Hyperbolic  Givens  Ro¬ 
tation 

The  original  Householder  transformation  [10]  of  a  (column)  vector  v  involves  finding  am 
i  orthonormal  matrix  A/  so  that 

Mv  =  ±  |]t>||  ej,  (11) 

where  j|  v  j|=  Vv*v  and  e\  is  the  unit  vector  with  the  first  element  one  and  all  the  rest 
zeros.  This  can  be  viewed  as  compressing  all  the  vector’s  energy  into  the  first  entry.  It 
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is  easily  verified  that  a  matrix  M  given  by:  M  =  I  —  2btf/b*b  where  b  =  vq:  ||  v  ||  t\ 
satisfies  (11).  The  hyperbolic  Householder  transform  will  take  on  a  similar  form,  and  a 
signature  matrix  $  has  to  be  specified  as  well  as  the  vector  v.  We  will  then  expect  H  to 
be  hypemormal  (see  definition  after  equation  (2)),  and  Hv  =  ±j[u||^ei  where, 

(12) 

Note  that  despite  the  notation  ||  v  ||4  is  not  a  norm,  because  norms  are  always  non-negative. 

The  natural  thing  to  try  would  be  to  let  b  =  v  q:  ||  v  ||4ex  and  H  =  <5  —  266t/5t$6.  This  was 
done  in  [20]  and  H  was  defined  only  for  y’s  and  $’s  for  which  >  0  and  thus  ||  v  ||« 
is  well  defined.  But  in  order  to  be  useful  for  HSVD  a  matrix  H  should  be  obtainable  for 
any  pair  $  and  v. 

In  order  to  achieve  this  generality,  look  more  carefully  into  what  actually  happens  when 
a  vector  is  transformed.  If  v  denotes  the  original  vector  and  v  denotes  the  transformed 
vector,  then  we  expect  two  things: 

•  and 

•  v  has  the  form  ±||t;||4ei,  this  can  be  viewed  as  compressing  ail  its  hyperbolic  “en¬ 
ergy”  into  its  first  entry. 

It  turns  out  that  the  two  conditions  cannot  generally  be  met  simultaneously,  because  the 
sign  of 

(IMUei^lMl^ei  =  |M|J*(1,1)  is  determined  by  s^n($(l,l))  and  is  independent  of 
the  sign  of  ||  v  ||#.  Suppose  for  the  moment  that  ||  v  ||*^  0,  then  a  solution  springs  to  mind: 
why  insist  on  ex?  Why  not  choose  another  unit  vector  et,  or  in  other  words,  why  not 
try  to  compress  the  hyperbolic  “energy”  into  the  kth  entry,  where  $(Jfc,  it)  is  the  “right” 
sign,  i.e.,  it  equals  —1  if  ||u||^  <  0  and  +1  otherwise.  Alternately,  permute  $  and  v, 
exchanging  the  kth  and  the  1*‘  entries,  in  the  manner  described  in  the  pseudo  computer 
code  below: 

i/sWIMM  ±  #(M) 

then 

find  a  A:  so  that  sgndMI.j)  =  $(fc,  fc) 

permute  entries  1  and  k  in  v,  and  entries  (1, 1)  and  (k,k)  in  $ 

end  if 

<>■='>  T  yillilUI'i 

The  resulting  matrix  H  is  a  hyperexchange  matrix  with  respect  to  $.  This  hyperbolic 
Householder  scheme  is  used  mainly  in  the  introductory  part  of  bidiagonalizing  the  data 
matrix®,  see  subsection  3.2.. 

For  use  in  the  main  part  of  the  serial  HSVD  scheme  another  hyperbolic  extension  of  a 
classic  algorithm  will  be  introduced,  it  is  the  hyperbolic  Givens  Rotation.  The  original 

®This  extension  of  the  hyperbolic  Householder  transform  for  nonpositive  normed  vectors  was  also 
developed  (independently)  by  Cybenko  [5]  in  a  different  context. 
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Givens  rotation,  (8J  is  a  two  by  two  matrix  that  is  similar  to  the  Householder  transforma¬ 
tion  for  a  length  two  vector.  Let  v  be  a  length  two  real  column  vector  v*  =  [t>i  u2],  and 
let 


G  = 


c 

—a 


(13) 


where,  a  —  v2/\Jv*  +  v2,  and  c  =  v\l\Jv\  +  v\.  Then  Gv  =||  v  ||  [1  0]t,  and  G&  =  /,  or 
otherwise  phrased  (Gv)*(Gv)  =  v*v  and  Gv  is  proportional  to  t\.  Now  suppose  we  are 
interested  in  finding  a  matrix  H  so  that  Hv  is  proportional  to  ei,  and  (Hv)^(Hv)  =  v*$v, 

where  $  and  equal  ±  *  ^  .  Again,  as  in  the  hyperbolic  Householder  transform,  we 

have  to  allow  $  ^  In  order  to  achieve  that  we  note  that  a  and  c  in  (13)  are  the  cosine 
and  sine  of  tan~1(v2/vi).  The  hyperbolic  Givens  rotation  will  naturally  use  the  hyperbolic 


trigonometric  functions  sink  and  cosh.  Thus,  given  u*  =  [ui  v2]  and  $  = 
supposing  ||  v  ||*-j£  0,  the  required  matrix  will  be  given  by: 


1  0 

0  -1 


,  and 


H  = 


Ch 

-<rsk  ch 


(14) 


with 

Sh  =  v2/\Jv\  -  u|,  ch  =  vi/yjvl  -  v\,  and  a  =  s^n(|J  v||«  $(1, 1)). 

Note  that  sa  and  ca  are  indeed  the  hyperbolic  sine  and  cosine  of  tanh_1(u2/ui).  Note 
also  that  although  all  the  references  above  were  to  column  vectors,  and  that  the  matrices 
were  employed  from  the  left,  the  above  is  trivially  translated  to  row  vectors  with  matrices 
employed  on  the  right. 


At  this  point  a  problem  that  is  common  both  to  the  hyperbolic  Householder  and  to  the 
hyperbolic  Givens  algorithms  should  be  addressed:  what  happens  when  ||  v  ||#=  0?  The 
answer  is  that  both  procedures  per  se  fail  (see  [5]  for  some  implications  of  this  problem). 
But  when  put  in  the  context  of  the  whole  HSVD  algorithm  a  mechanism  for  recovery 
exists,  and  will  be  described  subsequently. 


What  we  rely  upon  in  recovering  from  a  situation  of  ||  v  ||*=  0  is  that  the  hyperbolic 
Householder  and  the  hyperbolic  Givens  are  applied  to  whole  matrices,  not  merely  to  iso¬ 
lated  column  or  row  vectors,  and  that  while  hyperbolic  transformations  are  applied  to  the 
columns  orthogonal  transformations  may  be  performed  on  the  rows.  To  demonstrate  the 
recovery  suppose  we  attempt  to  transform  A  into  a  bidiagonal  matrix  using  an  orthogonal 
matrix  on  the  left  and  hyperbolic  Householder  transformations  on  the  right  (the  overall 
HSVD  procedure  is  composed  of  just  such  tasks).  We  start  by  attempting  to  “squeeze” 
all  of  a{  into  its  first  element.  We  employ  the  hyperbolic  Householder  procedure,  but  to 
our  chagrin  discover  that  ||  ai  ||$=  0.  A  solution  can  be  found  by  using  a2  in  the  following 
way: 
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If  we  choose  an  angle  9  and  let  c  =  cos9  and  a  =  sinQ  then  the  block  diagonal  matrix 
c  —a 

Q  =  a  c  is  orthogonal.  Its  affect  on  A  is  as  follows: 


I 


Now  calculate  the  $-norm  squared  of  dj: 

I  II  <*i  IU  l2  =  |al*8i|  =  |*aa5*«a  +  2csf?e(aJ$a2)|. 


Since  afacii  and  /fe(flx$q2)  are  just  read  numbers,  we  can  find  9  such  that  ||  ax  j|«= 

s<7n(ai$ax)y^|a^di|  ^  0  thereby  recovering  from  the  initial  “trap”  of  the  vanishing 

norm.  The  choice  of  9  is  rather  unlimited  (so  long  as  the  new  norm  isn’t  zero  as  well), 
but  we  might  for  instance  choose  to  maximize  the  resulting  norm. 


3.2.  The  Serial  HSVD  Algorithm 

The  best  available  serial  algorithm  for  the  full  SVD  is  the  bidiagonalization  based  scheme 
due  to  Golub  and  Kahan  [8].  Our  serial  HSVD  algorithm  borrows  heavily  from  Golub  and 
Kahan's  original  algorithm.  We  assume  that  the  reader  is  familiar  with  this  algorithm, 
(to  which  we  will  refer  as  the  “classical”  algorithm,)  and  will  just  note  where  our  differs. 

The  first  step  is  to  bidiagonalize  the  matrix  A,  i.e.,  find  unitary  matrices  U  and  V  so  the 
U AVt  is  bidiagonal.  This  is  classically  achieved  through  the  use  of  Householder  matrices. 
Our  algorithm  requires  a  unitary  matrix  on  the  left,  but  a  hyperexchamge  matrix  on  the 
right,  we  use  Householder  and  hyperbolic  Householder  matrices  respectively  to  build  these 
matrices.  We  will  call  the  bidiagonal  matrix  “Z?”.  If  the  matrix  A  was  not  square,  B  will 
have  columns  of  zeros.  If  only  the  eigenvalues  and  eigenvectors  of  A$A*  are  of  interest, 
not  the  HSVD  of  A  per  se9,  A$At  can  be  summarized  as  B$B*  where  B  is  the  matrix 
composed  of  the  first  n  rows  of  B,  and  $  is  the  n  x  n  top  left  hand  corner  of  <&. 

We  note  that  the  matrix  B  (or  B)  earn  now  be  made  into  a  real  matrix  with  diagonal 
unitary  (and  thus  also  hypernormal)  matrices  on  the  right  and  on  the  left.  Once  this  is 
achieved  we  proceed  with  a  hyperbolic  version  of  the  QR  algorithm,  which  we  call  the 
hyperbolic  QR  algorithm.  It  differs  again  from  the  classical  version  by  using  unitary 
matrices  on  the  left  and  hyperexchange  matrices  on  the  right. 

An  important  point  is  that  the  proofs  of  the  theorems  used,  the  implicit  Q  theorem  and 
the  convergence  of  the  QR  algorithm,  do  not  depend  on  the  positive  definiteness  of  the 
composite  matrix  (AA*),  nor  on  how  it  was  formed.  Thus  they  are  applicable  for  A$ A* 
as  well. 

9  As  is  true  for  all  applications  considered  by  us. 
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4.  A  SYSTOLIC  ALGORITHM  FOR  HYPERBOLIC  SVD 


Our  parallel  algorithm  for  the  hyperbolic  SVD  is  an  adaptation  of  a  well  known  biorthog- 
onalization  technique  developed  by  Hestenes  [9j  for  computing  the  singular  value  decom¬ 
position  of  an  n  x  m  matrix  A.  The  method  is  known  as  a  one-sided  Jacobi  method. 

The  one-sided  (Hestenes)  method  can  be  modified  to  carry  out  the  singular  value  de¬ 
composition  of  a  matrix.  A  =  TrSVt.  The  technique  finds  a  unitary  matrix  U  such  that 
6’M  =  has  orthogonal  columns,  i.e.,  6'M  has  orthogonal  rows. 

If  we  insist  that  U*A  be  hypernormal  with  respect  to  the  matrix  $,  then  (f/tA)$(f/tA)t 
will  give  the  eigenvalues  of  A$.4t,  the  precise  quantities  we  were  interested  in  in  the  first 
place.  The  process  of  finding  V  is  iterative  and  proceeds  by  constructing  a  sequence  of 
matrices  A^k\  k  =  0,1,...,  d(fc+1)  =  J^A^k\  here  /4(0)  =  A  and  J^k\  k  =  0,1,...,  are 
plane  rotations  operating  on  pairs  of  rows  of  A.  Angles  of  rotations  are  chosen  in  such 
a  way  that,  for  a  single  rotation,  the  resulting  rows  become  hypernormal.  By  applying 
rotations  to  all  different  pairs  of  rows  in  a  sweep,  and  iterating  the  sweeps,  the  limit 
matrix  lim*_ «>  A^  becomes  hypernormal. 

Parallel  implementations  of  the  algorithm  are  based  jh  earlier  works  on  one-sided  Jacobi 
methods  [2].  The  key  observation  is  that  long  as  rotations  operate  in  different  planes, 
they  are  independent  and  can  be  executed  by  different  processing  units  all  at  the  same 
time;  such  a  simultaneous  transfc-mation  is  called  a  parallel  rotation.  The  success  of 
this  approach  heavily  depends  on  the  ordering  of  rctauons  in  a  sweep.  There  are  various 
strategies  for  choosing  the  order  of  rotations,  or  pivot  rows,  in  the  sweep.  The  choice 
is  influenced  by  the  communication  geometry  directly  supported  by  the  target  computer 
architecture.  In  several  studies  it  has  been  shown  that  there  exist  strategies  which  are 
well  suited  for  linear  array,  ring  and  hypercube  topologies.  It  turns  out  that  one  can 
essentially  mimic  the  Hestenes  sweep  selection  process  for  hyperbolic  rotations. 

5.  SIMULATION  RESULTS 

We  coded  both  the  sequential  and  the  linear  systolic  array  versions  of  the  hyperbolic  SVD 
scheme  and  applied  them  to  the  square  root  covariance  differencing  task.  The  primary 
purpose  of  simulations  is  to  explore  the  numerical  accuracy  of  our  new  algorithms,  as 
compared  to  direct,  explicit,  formation  of  covariances  matrices  followed  by  differencing, 
and  finally  eigenanalysis.  We  shall  refer  to  this  latter  approach  as  the  “power  domain” 
scheme,  since  it  demands  the  explicit  formation,  storage,  and  processing  of  squared-data, 
that  is  power-like,  quantities. 

We  conducted  two  numerical  experiments.  The  first  of  these  involved  the  example  fur¬ 
nished  in  the  original  paper  on  covariance  differencing  by  Paulraj  and  Kailath  [16].  There 
is  little  point  in  detailing  this  example  here,  the  interested  reader  can  consult  [16]  for 
specifics.  We  used  8  sensors,  3  signals  at  —30°,  —12°,  and  —15°,  and  50°  of  rotation. 
We  retained  the  same  noise  field  as  in  [16].  However,  to  better  differentiate  numerical 
errors  from  statistical  fluctuations,  we  increased  the  number  of  snapshots  from  300  to  one 
thousand  giving  matrices  of  size  8  x  1000.  (Spot  checks  for  300  snapshots  assured  us  that 
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this  had  no  effect  on  the  relative  stability  of  either  method.)  When  the  signal  power  levels 
used  were  as  per  [16]  the  problem  is  well  conditioned,  and  both  the  systolic  and  the  power 
domain  schemes  work  well  with  a  64-bit  floating  point  number  system10.  To  produce 
ill  conditioning  for  the  rather  extended  precision  of  our  computer  language  we  increased 
the  dynamic  range  of  the  signals  to  80dB,  by  increasing  the  first  signal’s  amplitude,  and 
leaving  the  rest  alone.  This  led  to  a  breakdown  in  the  power  domain  method,  but  not  in 
either  of  the  two  new  methods  based  on  the  hyperbolic  SVD.  More  specifically,  all  three 
methods  accurately  identified  the  first  signal.  However  the  next  two  signals  were  identi¬ 
fied  correctly  using  the  HSVD  method  (with  an  error  of  1°  and  4°  respectively)  while  the 
power  domain  gave  estimates  off  by  5®  and  41®.  An  error  of  a  few  degrees  (ss  1  -  4)  is 
within  the  range  of  statistical  fluctuations  for  this  estimation  task,  while  an  error  of  41® 
intimates  numerical  collapse. 

In  the  Paulraj/Kailath  example  exact  assessment  of  numerical  errors  is  impossible  since 
the  “exact”  answer,  furnished  by  mapping  the  computations  onto  an  infinite  precision 
computer,  is  unknown.  To  avoid  this  shortcoming  the  second  experiment  we  conducted 
involved  a  covariance  difference  whose  eigenvalues  we  knew  explicitly.  This  allowed  us 
to  determine  exactly  how  much  numerical  error  each  algorithm  produced.  The  draw¬ 
back  here,  in  contrast  to  example  one,  is  that  the  covariance  difference  no  longer  has  a 
nice  physical  interpretation,  and  does  not  (as  far  as  we  know)  correspond  to  any  known 
practical  application. 

To  begin  we  form  the  n  by  m  matrix =  (diag(Ai,...,  An)|0],  If  we  define  the  signature 
matrix  $  via  #  =  diag({(-l)*,  i  =  0,  ....,n  -  1})  then  the  eigenvalues  of  ♦♦♦♦  are  quite 
dearly  Aj,  ...,  A’.  Now  suppose  we  multiply  ♦  on  the  left  by  any  n  x  n  unitary 
matrix  U  and  on  the  right  by  any  m  x  m  matrix  V  hypernormal  with  respect  to  $  to 
form  the  matrix  A.  Then  A$Af  =  U'ftVQVi'ftW*  =  has  the  same  eigenvalues 

as  but  is  now  a  full  matrix.  We  can  readily  generate  random  choices  for  U  and 

V  as  products  of  respectively  Householder  and  hyperbolic  Householder  transformations. 
By  selecting  A,  we  can  influence  the  condition  number  of  the  eigenanalysis  problem.  As 
in  the  first  example  both  hyperbolic  SVD  schemes  lead  to  better  numerical  behavior  for 
a  given  condition  number  and  a  given  wordlength.  Some  examples  are  given  below. 

The  preliminary  simulations  were  conducted  using  MATLAB™  for  which  relative  pre¬ 
cision  €  is  2'4*.  For  a  given  data  matrix  A  =  £/' tVk  we  constructed  the  corresponding 
“covariance”  matrix  A$A*.  We  chose  =  diag  (10*,  104,1),  and  generated  the  hypernor¬ 
mal  matrix  14  as  a  product  of  Jfc,  Jfc  »  1,2,3, 4, 6,  random  hyperbolic  Householder  matrices. 
Note  that  the  condition  number  of  A$A*  is  101*  which  is  comparable  to  the  reciprocal  of 
the  relative  precision  used  in  the  computations.  We  computed  the  eigenvalues  of 
via  hyperbolic  Hestenes  method,  on  the  original  data  matrix  A.  Next  we  repeated  the 
computation  via  the  two-sided  Jacobi  method  which  operated  on  A'&A*. 

Let  us  denote  the  exact  eigenvalues  of  A$A*  as  Af ,  the  computed  eigenvalues  by  Hestenes 
method  as  ,  and  by  Jacobi  method  as  A/. 

10The  simulations  were  conducted  on  an  IBM-PC7^  using  MATLABtw.  (All  data  types  are  (Mbits 
in  MATLABrw  .)  The  power  domain  scheme  used  tridiagonalisation  followed  by  the  QR  method  [8]  to 
excise  the  eigenstructure  of  the  covariance  difference. 
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Let  X  = 


and  7,-  = 


H  - 


We  observed,  see  Table  1  below,  that  the  hyperbolic  Hestenes  method  always  gave  better 
approximation  of  the  eigenvalues  than  the  Jacobi  method.  However,  the  accuracy  of 
the  hyperbolic  Hestenes  was  influenced  by  the  number  of  terms  in  the  product  V*  and 
varied  from  simulation  to  simulation.  This  can  be  explained  by  the  loss  of  accuracy  while 
computing  the  hyperbolic  norm. 


Table  1 


k 

A 

Y 

1 

Ai 

10"4 

10"9 

2 

10-4 

10-8 

3 

10-3 

10“8 

4 

Hr3 

10-8 

6 

10° 

10-4 

1 

A3 

lO"14 

10-13 

6 

10-“ 

10-n 

We  also  ran  the  QR  eigenanalysis  [8]  method  on  and  the  serial  hyperbolic  SVD  on 

A.  The  results  did  not  differ  markedly  from  the  ones  indicated  above,  so  we  decided  not 
to  include  them  in  the  table  above. 

6.  CONCLUSIONS 

We  presented  an  extension  of  the  SVD,  which  we  called  the  hyperbolic  SVD.  Its  existence 
was  established  under  mild  restrictions.  We  derived  two  algorithms  for  effecting  this 
decomposition,  one  sequential  and  the  other  parallel. 

The  HSVD  is  indicated  whenever  one  seeks  to  evaluate  the  eigenstructure  of  a  covariance 
(outer  product)  matrix  provided: 

(i)  The  covariance  involves  a  difference  of  outer  products, 

(ii)  all  (or  moat)  of  the  eigenvectors  and  eigenvalues  are  required, 

(iii)  numerical  ill-conditioning  is  anticipated  (using  the  available  wordlength  for  a 
given  computer),  and 

(iv)  either  a  sign  indefinite  matrix  arises  or  a  parallel  implementation  is  required. 

We  mentioned  three  signal  processing  applications  of  this  new  canonic  decomposition, 
which  satisfy  the  above  conditions,  mainly  bearing  estimation,  sliding  rectangular  win¬ 
dowing,  and  array  calibration. 
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Numerical  experiments  demonstrated  the  enhanced  numerical  accuracy  available  using  the 
HSVD  in  contrast  to  competing  schemes.  Theoretical  backing  for  this  improved  accuracy 
remains  a  topic  for  future  investigation. 

We  feel  confident  that  there  are  many  more  applications  within  and  beyond  digital  signal 
processing  where  the  HSVD  will  be  useful  for  its  numerical  stability,  fast  computational 
characteristics,  and  as  a  theoretical  structure. 
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Taylor  series  and  the  overall  properties  of  composites  1 
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Abstract.  We  deal  with  the  effective  electrical  conductivity  and  the  effective  elastic 
moduli  Lm  of  multiphase,  ordered  or  disordered  composite  materials.  The  problem 
of  estimation  of  L'  can  be  approached  through  perturbation  expansions  around  the 
uniform  case.  Interestingly,  knowledge  of  truncated  expansions  for  Lm  leads  to  rigorous 
bounds  on  this  quantity.  We  describe  a  general  construction  which  permits  one  to  find 
a  hierarchy  of  bounds  from  truncated  expansions  for  L‘ .  Our  approach  is  motivated 
by  previous  work  of  Beran  and  others.  These  bounds  give  good  estimations  for  weakly 
heterogeneous  materials,  i.e.,  composites  in  which  the  departure  from  homogeneity 
is  not  too  strong.  For  strongly  heterogeneous  materials,  such  as  conductor-insulator 
mixtures,  these  bounds  lead  to  broad  intervals  of  uncertainty.  We  thus  address  this 
problem,  and  show  that  tight  bounds  can  be  found  for  the  effective  conductivity  of 
two-phase  ordered  or  disordered  composites,  in  which  material  of  very  large  or  very 
low  conductivity  is  arranged  in  the  form  of  particles  inside  a  conducting  matrix.  These 
bounds  depend  on  two  parameters  A  and  B  which  are  related  to  the  particle  shapes  and 
the  interparticle  distances.  For  instance,  if  the  particles  are  assumed  to  be  spherical, 
the  bounds  depend  only  on  a  parameter  q  which  is  related  to  the  minimum  interparticle 
distance,  and  they  give  excellent  estimations  for  a  wide  range  of  values  of  the  parameter 
q.  In  many  cases,  they  improve  substantially  over  previous  estimations  for  the  effective 
conductivity  of  this  type  of  composites. 

Introduction.  We  deal  with  a  physical  property,  elasticity  or  conductivity,  whose 
constitutive  linear  relation  is 

f=Le.  (1) 

In  the  case  of  conductivity,  L  is  a  symmetric  tensor  of  second  order  which  represents 
the  local  conductivity,  and  /  and  e  are  the  current  and  electric  field  respectively.  In 
the  case  of  elasticity,  L  is  the  Hookean  fourth  order  tensor,  and  /  and  e  are  the  tensors 
of  stress  and  strain  respectively.  In  this  case,  products  are  understood  as  follows: 

(Lc)tj  ~  Ltjki6ki, 


where  the  usual  summation  convention  has  been  used. 

An  r-phase  composite  material  is  usually  modeled  by  a  spatially  varying  tensor  L. 
which  is  locally  constant  and  assumes  r  different  values.  Similarly,  in  a  polycrystal. 

'Supported  by  the  U.S.  Army  Research  Office  under  grant  DAAL-03-88-K-01 10 
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the  tensor  L  is  locally  constant  and  its  values  vary  among  all  rotations  of  a  given 
fixed  tensor  L0.  The  quantity  of  interest  is  then  the  effective  tensor  Lm  defined  by  (see 
e.g.  [2,13]) 

(/)  =  T*(e)  (2) 

where  (h)  stands  for  the  average  of  the  quantity  h,  and  /  and  e  are  fields  of  strain 
and  stress,  or  current  and  electric  field  that  are  compatible  with  the  uniform  applied 
field  ( e }  =  e0.  The  averages  above  should  be  understood  to  be  ensemble  averages 
if  one  deals  with  random  materials,  or  averages  over  the  unit  cell  if  one  deals  with 
periodic  materials.  Of  course,  all  the  quantities  involved  are  assumed  to  be  statistically 
stationary,  so  that  ensemble  averages  do  not  depend  on  the  point  in  space  at  which 
they  are  performed. 

The  minimum  energy  principle  states  (see  [2,3])  that  the  true  field  e  in  the  composite 
that  is  compatible  with  an  uniform  applied  field  eo  is  the  one  that  minimizes  the  energy 

(e'Le*)  (3) 

among  all  curl  free  trial  fields  verifying  (e*)  =  e0.  Furthermore,  the  true  energy  in  the 
material  due  to  the  applied  field  eo  is  given  by 


~{eLe)  =  -e0Lme0. 


(4) 


The  complementary  principle  of  minimum  energy  states  (see  [2,3])  that  the  true 
field  /  in  the  composite  that  is  compatible  with  an  uniform  applied  field  /0  is  the  one 
that  minimizes  the  expression 

</‘£"7f>  (5) 

among  all  divergence  free  trial  fields  /*  such  that  (/‘)  =  /o-  The  true  energy  in  the 
body  due  to  the  applied  field  is 

\[HL)-'f)  =  r,h(L-)-'  A-  (6) 


Series  expansions.  Series  expansions  are  central  in  the  theory  of  bounds  for 

effective  moduli.  Typically',  the  fields  c  and  /  and/or  the  effective  modulus  Lm  are 

expanded  in  series  of  the  form 

e  =  2>,  (7) 

1=0 

/  =  £/.  (8) 

1=0 


no 


(9) 


i=0 

The  order  of  smallness  of  the  quantities  e;,  /,  and  L‘  is,  roughly  speaking,  the  i-th 
power  of  the  order  of  smallness  of  the  heterogeneity.  One  simple  way  to  obtain  such 
expansions  is  to  consider  Taylor  series.  Other  kinds  of  expansions  have  been  considered, 
which  lead  to  consideration  of  the  n-point  correlation  function  of  the  microstructure. 
We  will  assume,  however,  that  the  expansions  above  are  simply  the  Taylor  series  of 
the  various  quantities.  For  instance,  in  the  case  of  conductivity  of  a  mixture  of  two 
components  of  conductivities  z  and  u>,  we  take  w  as  fixed,  and  consider  expansions  in 
powers  of  ( z  —  w ).  In  the  case  of  elastic  moduli  of  a  mixture  of  two  isotropic  components 
of  bulk  moduli  kx  and  /c2  and  shear  moduli  (ix  and  /x2 ,  we  take  kx  and  fix  as  fixed,  and 
consider  expansions  in  powers  of  (/z2  —  fix)  and  (/c2  —  kx). 

We  note  that  for  all  i,  e,  is  a  curl  free  field  and  /,  is  a  divergence  free  field,  and 
that,  for  i  >  1  the  mean  values  of  e,  and  /,  are  zero.  It  follows  (see  also  [8])  that  the 
true  fields  in  a  composite  verify  the  equations 


(emLe)  =  0 

(10) 

and 

for  all  m  >  1. 

(fm{L)~lf)  =  0 

(11) 

Bounds  for  the  effective  moduli.  To  obtain  upper  bounds  on  a  given  effective 
modulus,  we  follow  Beran  (see  [2])  in  using  a  truncated  expansion  together  with  varia¬ 
tional  parameters.  We  choose  to  consider  a  Taylor  series  expansion  (7).  We  substitute 
the  trial  field 

k 

=  eo  +  J2a'e'  (12) 

t=i 


in  the  energy  expression  (3).  The  a,'s  are  real  constants  to  be  determined.  Since  (12) 
is  an  admissible  tiial  field,  we  have  the  upper  bound 

k 

to Lme0<  a,aj{e,Lej}.  (13) 

«•;= o 


where  we  have  put  o0  =  1.  This  is  an  upper  bound  on  L'  for  any  choice  of  the 
parameters  a,,  i  >  1.  Finally,  minimizing  this  expression  on  the  parameters  a,,  (i  >  1 ), 
yields  the  Beran  type  upper  bound  for  L' . 

Of  course,  this  Beran  type  bound  can  only  be  made  explicit  if  the  quantities  (e.Z-Cj) 
are  known.  Quantities  of  this  type  can  be  computed  as  integrals  of  certain  statistical 
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functions:  the  n-point  correlation  functions  (see,  for  example,  [2]  in  the  case  of  con¬ 
ductivity  of  mixtures,  [3]  in  the  case  of  elastic  properties  of  mixtures,  [24]  in  the  case 
of  conductivity  of  polycrystals).  These  functions  are,  in  general,  extremely  difficult 
to  determine.  For  low  orders,  however,  the  quantities  needed  to  evaluate  the  bounds 
have  been  computed  for  cell  like  mixtures  as  introduced  by  Miller  [20]  (see,  for  instance 
[20,11,19]). 

The  point  we  wish  to  establish  here  is  that,  because  of  equations  (9),  (10)  and  (11), 
the  quantities  (ejXe;)  (1  <  i,j  <  k )  can  be  simply  read  off  from  the  first  2k  +  1  terms 
in  the  Taylor  expansion  of  X*. 

To  illustrate  the  procedure,  we  derive  the  first  upper  bound  in  the  hierarchy.  We 
substitute  a  trial  field  of  the  form 

e*  =  €q  -f-  ctiei  ( 14) 

in  the  variational  principle  (3),  and  obtain 

e0L'e0  <  e0(L)e0  +  2ax(e0L'ex)  +  ax{(exL'ex)  +  (exL0ex)),  (15) 


where  Xo  is  a  constant  isotropic  tensor,  and  L'  =  L  —  L0  represents  the  oscillation  of 
L  about  L0  (Note  that  (eaL0ex)  =  0  since  (et)  =  0).  The  choice  of  <*!  that  minimizes 
the  right  hand  side  is 


(e0  L'ex) 

{ex  L'e i)  +  (exLoex) 


(16) 


Because  of  equations  (9),  (10)  and  (11),  one  can  show  that 


(exL'ex)  —  eoLjeo 
(exL0ex)  =  — eoX2eo, 


and 

{e-oL'e  i) 

Therefore,  the  upper  bound  reads 


—  (exLoex) 


eoX2eo- 


t0L  e0  <  eo(L)to  — 


( e0L2eo)2 
eoL^(iX  —  e0X2t0 


where  X*  denotes  the  term  of  order  i  in  the  Taylor  expansion  of  X”. 

An  explicit  upper  bound  for  different  kinds  of  materials  and  physical  properties 
can  now  be  obtained  simply  by  substitution  into  equation  (19)  of  the  results  for  the 
truncated  expansions  given  in  (or  easily  obtainable  from)  [1,7,15,24,11],  Lower  bounds, 
and  bounds  of  higher  order  can  be  obtained  in  an  entirely  analogous  manner. 
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Strongly  heterogeneous  composites.  Here  we  discuss  the  problem  of  determin¬ 
ing  the  effective  electrical  conductivity  of  two-phase,  isotropic,  strongly  heterogeneous 
mixtures.  Because  the  constitutive  relation  (Ohm’s  law)  is  linear,  we  can  assume  that 
the  conductivity  of  one  of  the  components  equals  one.  The  conductivity  of  the  second 
component  will  be  denoted  by  z,  and  the  effective  conductivity  will  now  be  denoted  by 
m  =  m(z). 

It  is  a  well  known  fact  that  the  function  m  can  be  extended  as  an  analytic  function 
to  complex  values  of  z  outside  the  negative  real  axis  [4,13].  There  are  examples  of 
materials  for  which  the  function  m  is  singular  in  the  whole  negative  real  axis  (see  [12]). 

For  ordered  or  disordered  materials  which  consist  in  a  matrix  containing  separated 
inclusions,  the  zeroes  and  singularities  of  the  function  m  cannot  accumulate  neither  at 
z  =  0  nor  at  z  =  oo.  More  than  that,  an  explicit  interval  in  the  negative  real  axis  can 
be  found  depending  on  the  distribution  and  shapes  of  the  particles,  outside  which  the 
function  m  is  regular  and  nonzero  [9].  This  allows  us  to  find,  by  means  of  the  complex 
variable  method  of  Bergman  [4,21,13],  new  bounds  on  the  effective  conductivity  of 
mixtures  with  separated  inclusions.  (The  problem  of  describing  the  singularities  of  the 
function  m  is  also  connected  to  the  properties  of  absorptance  of  solar  energy  exhibited 
by  some  ceramic-metal  mixtures  [10].  Numerical  calculations  of  the  singularities  for 
periodic  arrays  have  been  conducted  [5,6,17,18].  Our  results  are  in  agreement  with 
these  numerical  calculations.) 

Consider  a  microgeometry  which  consists  of  a  matrix  containing  a  number  of  ran¬ 
domly  placed,  separated  inclusions.  Associated  with  any  such  material,  there  are  two 
constants  A  and  B  which  determine  a  region  in  the  negative  real  axis  where  the  singu¬ 
larities  and  zeroes  of  m  can  occur  [9].  The  constants  A  and  B  depend  on  the  shapes 
of  the  inclusions  and  on  the  interparticle  distances.  Assuming,  for  simplicity,  equally 
sized  randomly  placed  spherical  particles,  the  constants  A  and  B  depend  on  a  single 
parameter  q,  0  <  q  <  1,  equal  to  the  ratio  of  the  particle  diameter  to  the  minimum 
distance  between  two  centers.  Values  of  A  and  B  for  arrays  of  spheres  are  given  in 
table  1,  for  the  whole  range  of  values  of  the  parameter  q. 

The  bounds  on  m  depend  on  the  constants  A  and  B.  Explicitly,  calling  px  the 
volume  fraction  of  material  of  conductivity  z,  p2  =  I  —  pi.  and  defining 


A 

~  1  +  .4’ 

(20) 

1  +  0' 

(21) 

8  —  s  «sm , 

(22) 

1  -  ZTZU 

(23) 

l  —  z  d 
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(24) 


and 


_  1 -(!-*)««-  _ (l-W _ 

'-('-K  (i  -  +  f*<?  - s") 


our  bounds  read  (see  [9]) 


W(z)  <  m(z)  <  V(z)  for  0  <  z  <  1  (25) 

and 

V(z)  <  m(z )  <  W(z)  for  z  >  1.  (26) 

The  upper  bound  in  the  case  0  <  z  <  1  and  the  lower  bound  in  the  case  z  >  1 
coincide  with  the  corresponding  bounds  given  by  Hashin  and  Shtrikman  [14].  The  two 
remaining  bounds  are,  in  many  cases,  much  tighter  than  the  corresponding  bounds  of 
Hashin  and  Shtrikman,  reflecting  our  assumption  that  the  particles  cannot  touch. 

To  illustrate  numerically  ov  •  iesults,  we  consider  a  microgeometry  formed  by  spheres 
of  conductivity  z  coated  ov  -l  corona  of  material  of  conductivity  1  immersed,  in  a  ran¬ 
dom  manner,  in  a  m-.  „r;  of  conductivity  1.  This  is  a  random  array  of  closely  packed 
coated  spheres.  The  coated  spheres  fill  about  60  percent  of  the  volume  [22].  In  figures 
l.a-3.b  we  plof  yur  bounds  for  this  microgeometry.  In  table  2.a~2.b  we  give  the  values 
of  the  bounds  V  and  W  for  the  complete  range  of  values  of  the  parameter  q.  For 
comparison,  we  include  the  values  fl;  and  Bu  of  the  lower  and  upper  bounds  obtained, 
for  the  same  microgeometry,  by  the  method  of  security  spheres  (see  [16,23]).  We  thank 
S.  Torquato  and  J.  Rubinstein  for  communicating  their  preliminary  results  to  us. 
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Table  1 


A 

2<n  =  ~B 

ZM 

sm 

.00 

1.00000 

-2.00000 

-1.00000 

0.33333 

0.50000 

.10 

1.00000 

-2.00300 

-1.00000 

0.33300 

0.50000 

.20 

1.00000 

-2.02419 

-1.00000 

0.33067 

0.50000 

.30 

1.00000 

-2.08325 

-1.00000 

0.32433 

0.50000 

.40 

1.00000 

-2.20513 

-1.00000 

0.31200 

0.50000 

.50 

1.00000 

-2.42857 

-1.00000 

0.29167 

0.50000 

.60 

1.00000 

-2.82653 

-1.00000 

0.26133 

0.50000 

.70 

1.28311 

-3.56621 

-0.77936 

0.21900 

0.56200 

.80 

2.07377 

-5.14754 

-0.48221 

0.16267 

0.67467 

.90 

4.53506 

-10.07011 

-0.22050 

0.09033 

0.81933 

.91 

5.08694 

-11.17389 

-0.19658 

0.08214 

0.83571 

.92 

5.77776 

-12.55552 

-0.17308 

0.07377 

0.85246 

.93 

6.66702 

-14.33404 

-0.14999 

0.06521 

0.86957 

.94 

7.85395 

-16.70789 

-0.12732 

0.05647 

0.88706 

.95 

9.51709 

-20.03418 

-0.10507 

0.04754 

0.90492 

.96 

12.01360 

-25.oz7l9 

-0.08324 

0.03842 

0.92316 

.97 

16.17681 

-33.35361 

-0.06182 

0.02911 

0.94178 

.98 

24.50669 

-50.01339 

-0.04081 

0.01960 

0.96079 

.99 

49.50305 

-100.00631 

-0.02020 

0.00990 

0.98020 

1 

oo 

— oo 

0 

0 

1 

Table  1:  values  of  the  different  constants  related  to  the  bounds  (25)  and  (26) 
for  an  arbitrary  array  of  spheres,  as  functions  of  the  parameter  q.  The  interval 
zm  <  z  <  z\i  contains  all  the  zeroes  and  singularities  of  m  in  the  z  plane. 


117 


FIGURE  l.a 

Case  c  =  . 


Figures  l.a-3.b:  the  bounds  (25)  and  (26)  in  the  case  of  a  random  closely  packed 
array  of  coated  spheres.  The  coated  spheres  are  assumed  to  fill  60  percent  of  the 
volume  of  the  sample.  Figures  l.a-l.b:  case  q  =  .5.  Figures  2.a-2.b:  case  q=.8. 
Figures  3.a-3.b:  case  q=.99.  Other  quantities  of  interest  can  be  found  in  tables  2. a 
and  2.b. 
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Table  2.a 

Case  z  =  0 


q 

Pi 

V 

W 

B, 

Bu 

0.10 

0.000600 

0.999100 

0.999100 

0.999100 

0.999100 

0.20 

0.004800 

0.992813 

0.992817 

0.992794 

0.992829 

0.30 

0.016200 

0.975851 

0.975895 

0.975634 

0.976024 

0.40 

0.038400 

0.943238 

0.943485 

0.942029 

0.944186 

0.50 

0.075000 

0.890653 

0.891566 

0.886076 

0.894118 

0.60 

0.129600 

0.815090 

0.817430 

0.801308 

0.824549 

0.70 

0.205800 

0.712703 

0.720102 

0.680336 

0.736492 

0.80 

0.307200 

0.574567 

0.600555 

0.514334 

0.633121 

0.90 

0.437400 

0.376004 

0.461639 

0.292309 

0.519165 

0.93 

0.482614 

0.293637 

0.416807 

0.212756 

0.483717 

0.96 

0.530842 

0.191347 

0.370753 

0.126452 

0.447948 

0.99 

0.582179 

0.056292 

0.323619 

0.032892 

0.411999 

Tables  2. a,  2.b:  the  bounds  (25),  (26)  and  the  corresponding  security  sphere 
bounds  B,  and  Bu.  Table  2.a:  a  closely  packed  array  of  coated  spheres,  2  =  0. 
Table  2.b:  a  closely  packed  array  of  coated  spheres,  z  =  oc. 
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Table  2.b 
Case  2  =  oo 


9 

Pi 

W 

V 

Bi 

0.10 

0.000600 

1.00180 

1.00180 

1.00180 

1.00180 

0.20 

0.004800 

1.01447 

1.01449 

1.01438 

1.01452 

0.30 

0.016200 

1.04940 

1.04959 

1.04834 

1.04995 

0.40 

0.038400 

1.11980 

1.12097 

1.11374 

1.12308 

0.50 

0.075000 

1.24324 

1.24841 

1.21951 

1.25714 

0.60 

0.129600 

1.44669 

1.46525 

1.37270 

1.49592 

0.70 

0.205800 

1.77739 

1.85262 

1.57777 

1.93973 

0.80 

0.307200 

2.33025 

2.66324 

1.83599 

2.88852 

0.90 

0.437400 

3.33239 

5.14329 

2.14523 

5.84207 

0.93 

0.482614 

3.79838 

7.28052 

2.24720 

8.40043 

0.96 

0.530842 

4.39443 

12.6322 

2.35310 

14.8163 

0.99 

0.582179 

5.18012 

50.1269 

2.46269 

59.8044 
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DYNAMIC  SHEAR  BAND  DEVELOPMENT  IN  PLANE  STRAIN  COMPRESSION 

OF  A  BIMETALLIC  BODY 


R.  C.  Batra  and  Z.  G.  Zhu 

Department  of  Mechanical  and  Aerospace  Engineering 
and  Engineering  Mechanics 
University  of  Missouri-Rolla 
Rolla,  MO  65401-0249,  USA 

ABSTRACT.  We  study  plane  strain  thermomechanical  deformations  of  a  prismatic 
viscoplastic  body  of  square  cross-section  and  deformed  at  a  nominal  strain-rate  of  5000 
sec* 1 .  The  body  has  two  thin  layers  placed  symmetrically  about  the  horizontal  centroidal 
axis.  The  layer  material  differs  from  that  of  the  body  in  only  the  value  of  the  yield  stress  in 
a  quasistatic  simple  compression  test.  The  yield  stress  for  tne  layer  material  is  taken  to  be 
either  one-fifth  or  five  times  that  of  the  matrix  material.  Three  cases,  namely,  when  there  is 
an  elliptical  void  with  its  major  axis  aligned  along  the  horizontal  centroidal  axis  or  the  ver¬ 
tical  centroidal  axis,  but  the  void  center  coincides  with  the  center  of  the  cross-section,  and 
when  there  are  two  elliptical  voids  with  major  axes  aligned  along  the  vertical  centroidal  axis 
and  a  void  tip  abuts  the  layer/matrix  interface  are  studied.  The  deformations  are  assumed 
to  be  symmetrical  about  the  vertical  and  horizontal  centroidal  axes. 

It  is  found  that  in  each  case  shear  bands  initiate  from  points  on  the  traction  free  edges 
where  the  matrix/layer  interfaces  intersect  them  and  propagate  into  the  softer  material.  For 
the  soft  layer  these  bands  initially  merge  into  one  ana  propagate  horizontally. 

Subsequently,  each  of  these  bands  bifurcates  into  two  whichpropagate  into  the  matrix 
material  along  the  direction  of  the  maximum  shear  stress.  There  is  minimal  interaction 
between  these  bands  and  those  initiating  from  points  near  the  void  tips. 

INTRODUCTION.  Adiabatic  shear  bands  are  narrow  regions  of  intense  plastic 
deformation  that  form  during  high  strain-rate  processes,  such  as  shock  loading,  ballistic 
penetration,  metal  forming,  and  machining.  As  these  bands  generally  precede  material 
fracture,  a  knowledge  of  factors  that  inhibit  or  enhance  their  growth  is  essential  to  the  pro¬ 
duction  of  durable  materials  and  more  efficient  manufacturing  processes.  These  bands 
form  in  both  ferrous  and  nonferrous  alloys. 

Johnson  (1987)  has  recently  pointed  out  that  Tresca  (1878)  and  Massey  (1921) 
observed  hot  lines,  now  referred  to  as  adiabatic  shear  bands,  during  the  forging  of  platinum. 
Both  Tresca  and  Massey  stated  that  these  were  the  lines  of  greatest  sliding,  and  also  there¬ 
fore  the  zones  of  greatest  development  of  heat.  Wulf  (1978)  has  reported  experimental 
observations  of  adiabatic  shear  bands  in  high  strain  rate  (2000  to  25000  sec*1)  compression 
of  7039  aluminum  armour.  He  found  that  the  cross-section  of  the  cylindrical  specimens 
changed  from  circular  to  elliptical  after  the  compression  test,  and  adiabatic  shear  bands 
formed  in  the  specimens  which  subsequently  failed  by  crack  propagation  along  the  dom¬ 
inant  band.  Further  references  to  the  analytical,  numerical  and  experimental  work  on  shear 
banding  may  be  found  in  two  recent  books  (Dodd  and  Bai  (1987),  Semiatin  and  Jonas 
(1984)). 

Recently,  LeMonds  and  Needleman  (1986),  Needleman  (1989),  Anand  et  al.  (1988), 
Zbib  and  Aifantis  (1988),  Batra  and  Liu  (1989,1990),  Zhu  and  Batra  (1990),  Batra  and  Zhu 
(1990),  and  Batra  and  Zhang  (1990)  have  studied  the  phenomenon  of  shear  banding  in 
plane  strain  deformations  of  a  viscoplastic  solid.  Whereas  Needleman  studied  a  purely 
mechanical  problem,  other  works  have  treated  a  coupled  thermomechanical  problem. 
LeMonds  and  Needleman,  Zbib  and  Aifantis,  and  Anand  et  al.  neglected  the  effect  of  iner¬ 
tia  forces  on  the  ensuing  deformations  of  the  body.  These  investigations  have  employed 

*  Supported  by  the  U.  S.  Army  Research  Office  Contract  DAAL03-88-K-0184  to  the  Uni¬ 
versity  of  Missouri  -  Rolla. 
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different  constitutive  relations,  different  techniques  to  integrate  the  stiff  set  of  governing 
partial  differential  equations,  and  have  generally  assumed  that  the  entire  body  or  the  por¬ 
tion  of  the  body  whose  deformations  were  analyzed  had  only  one  defect  in  it.  The  prismatic 
body  whose  plane  strain  thermomechanical  deformations  are  studied  herein  is  of  a  square 
cross-section  and  has  two  thin  layers  made  of  a  viscopiastic  material  different  from  that  of 
the  body  and  placed  symmetrically  about  and  parallel  to  the  horizontal  centroidal  axis. 
These  horizontal  layers  may  be  thought  of  as  representing  planes  of  chemical  inhomoge¬ 
neity.  Also,  as  stated  above  in  the  abstract,  there  is  either  an  elliptical  void  at  the  center  of 
the  cross-section  or  two  ellipsoidal  voids  with  major  axes  along  tne  vertical  centroidal  axis 
and  tips  touching  the  layer/matrix  interfaces.  The  voids  can  form  during  manufacturing. 
However,  the  symmetrical  situation  considered  herein  is  to  simplify  the  problem.  The  ver¬ 
tices  of  the  ellipsoidal  void  on  its  major  axis  and  the  points  on  tne  free  edges  where  the  thin 
layer  and  the  matrix  materials  meet  act  as  nuclei  for  the  initiation  of  shear  bands.  It  thus 
becomes  an  interesting  exercise  to  study  the  initiation  and  growth  of  various  bands  and  the 
interaction,  if  any,  amongst  them.  We  account  for  the  effect  of  inertia  forces,  strain-rate 
sensitivity  of  the  materials,  thermal  softening  effects,  heat  conduction,  and  the  heat  gener¬ 
ated  due  to  plastic  working. 

FORMULATION  OF  THE  PROBLEM.  Figure  1  depicts  the  cross-section  of  the 

frismatic  body,  and  the  relative  dimensions  of  the  ellipsoidal  void  and  the  two  thin  layers. 

or  this  case,  the  centers  of  the  void  and  the  cross-section  coincide  and  the  major  axis  of  the 
void  coincides  with  the  vertical  centroidal  axis  of  the  cross-section.  It  is  assumed  that  the 
body  is  loaded  along  the  vertical  axis,  plane  strain  state  of  deformation  prevails,  and  that  the 
deformations  are  symmetrical  about  the  two  centroidal  axes.  Thus  the  deformations  of  the 
material  in  the  first  quadrant  are  analyzed.  We  use  a  fixed  set  of  rectangular  cartesian  coor¬ 
dinate  axes  and  the  referential  description  of  motion  to  describe  the  thermomechanical 


deformations  of  the  body.  The  governing  equations  are: 

balance  of  mass:  (pJ) '=  0,  (2.1) 

balance  of  linear  momentum:  p0vj  =  Tjact,  (2.2) 

balance  of  moment  of  momentum:  Tj>ct  xj  a  =  Tj  a  xj  a,  (2.3) 

balance  of  internal  energy:  p0e  =  -Qace  +  Tja  vj<a,  (2.4) 

where 

Tia  =  (Pq/p)  °ij  xa.j' 

°ij  =  ~  B  (p/p0  —  lj  S  i j  +  2p  Dj^ j  ,  (2.5) 

2p  =  [00/J3I]  (1  +  bl)m  (1  -  ad),  (2.6) 

I2  =  (1/2)  Dij  Dij,  (2.7) 

Dij  ~  ~  (1/3)  Dj^  ^ ii »  (2.8) 

Qa  =  ( P q/ p)  *3 i  Xa (  *Ii  =  ^  ^  »  i*  (2.9) 

e  =  c  6  +  B (p/pQ  -  1)  p/p2  .  (2.10) 


In  these  equations  xj  gives  the  position  at  time  t  of  the  material  particle  vj  =  xj  is 
its  velocity  in  the  Xj-direction,  p  is  its  present  mass  density,  p0  its  mass  density  in  tne  refer¬ 
ence  configuration,  J  =  det  [xj  ^j,  xj  a  =  dx/8  X^,  Tja  is  the  first  Piola-Kirchoff  stress  ten¬ 
sor,  o  jj  is  tne  Cauchy  stress  tensor,  e’is  the  specific  internal  energy,  Qa  is  the  heat  flux 
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measured  per  unit  area  in  the  reference  configuration,  D  is  the  strain-rate  tensor  and  £7  is 


its  deviatoric  part,  a  superimposed  dot  indicates  material  time  derivative,  a  comma  followed 
by  index  a  (j)  implies  partial  differentiation  with  respect  to  (x;),  and  a  repeated  index 
implies  summation  over  the  range  (1,2)  of  the  index.  In  the  constitutive  relations  (2.5), 

(2.9)  and  (2.10),  the  material  parameter  B  represents  the  bulk  modulus,  o0  is  the  yield 
stress  in  a  quasistatic  simple  compression  test,  parameters  b  and  m  characterize  the  strain- 
rate  sensitivity  of  the  material,  a  describes  its  thermal  softening,  6  equals  the  temperature 
change  of  a  material  particle  from  that  in  the  reference  configuration,  k  is  the  constant  ther¬ 
mal  conductivity  and  c  is  the  constant  specific  heat.  Here  we  have  not  considered  the 
stresses  caused  by  the  thermal  expansion. 

The  foregoing  equations  hold  in  regions  occupied  by  the  matrix  and  the  layers.  The 
values  of  material  parameters  for  the  matrix  and  the  layer  materials  are  the  same  except 
that  either 


o0  layer  =  5  „o 


matrix 


(2.11a) 


o0  layer  =  (1/5)  aQ  matrix. 

In  terms  of  the  deviatoric  stress  s  defined  by 

s  -  o  +  [B (p/Po  -  1)  -  (2/1/3)  tr  D]  l , 
=  2n  D, 

equations  (2.12),  (2.5)  and  (2.6)  give 

(1/2  tr  s2)1/2  =  (oq/J  3 )  ( 1  -  aO)  (1  +  bl)m. 


(2.11b) 


(2.12a) 

(2.12b) 


(2.13) 


We  assume  that  the  body  is  initially  at  rest  at  a  uniform  temperature,  has  a  constant 
mass  density  and  is  initially  stress  free.  That  is 


p(x,o)  =  1,  v(x,o)  =  o,  #(x,o)  =  0 


(2.14) 


For  the  material  in  the  first  quadrant,  we  impose  the  following  boundary  conditions. 

v2  =  -  h(t),  T12  =  0  and  Q2  =  0,  on  the  top 

surface  AB,  (2.15 


Tn  ~  ^21  —  ®  and  Qj_  —  0, 

surface  BC, 

v2  =  0,  T12  =  0  and  Q2  =  0, 
surface  CD, 

TiaNa  =  0'  Qa  Na  =  °# 
surface  DE, 


(2.15) 

on  the  right 

(2.16) 

on  the  bottom 

(2.17) 

on  the  void 

(2.18) 


—  0 ,  T21  =  0  and  Qi  =  0, 
surface  EA. 


on  the  left 

(2.19) 


That  is  the  top  surface  is  moving  downward  with  a  speed  h(t),  contact  between  it  and  the 
loading  device  is  smooth,  the  right  surface  is  traction  free,  and  the  entire  boundary  is  ther- 
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mally  insulated.  If  at  any  time  during  the  deformations  of  the  body,  a  point  on  the  void 
surface  touches  the  vertical  axis,  the  boundary  condition  on  it  is  changed  to  (2.19).  The 
boundary  conditions  (2.17)  and  (2.18)  reflect  the  presumed  symmetry  of  deformations 
about  the  xj  and  X2  axes.  For  the  loading  function  h(t)  we  take 
h(t)  =  v0  t/tr,  0  £  t  <.  tr, 

=  Vq  t  >  tj-.  (2*20) 

The  steady  speed  v0  of  the  top  surface  of  the  block  is  reached  in  time  tr. 

The  matrix  and  the  layer  are  assumed  to  be  perfectly  bonded.  Thus  at  the  common 
interfaces  between  them,  the  velocity  field,  surface  tractions,  the  temperature  and  the  nor¬ 
mal  component  of  the  heat  flux  are  assumed  to  be  continuous. 

For  other  configurations  of  the  voids,  the  boundary  conditions  are  appropriately 
modified. 

RESULTS.  The  finite  element  code  developed  by  Batra  and  Liu  (1989, 1990)  was 
modified  to  analyze  the  present  problem.  In  order  to  compute  results,  we  used  the  follow¬ 
ing  values  of  various  material  and  geometric  parameters. 

b  *  10000  sec,  aQ  -  333  MPa,  k  =  49.22  W  m"1  'C-1, 
m  =  0.025,  c  =  473  J  Kg-1  “C"1,  pQ  =  7860  Kg  m“3, 

B  *  128  GPa,  H  *  5  mm,  Vq  =  25  m  sec”1,  (3.1) 

a  *  0.0025  •C“1. 


Thus  the  average  applied  strain-rate  -y  aVg  equals  5000  sec'1,  the  reference  temperature 

2 

d  o  -  °o/(p  0C)  ®  89.6°  C,  and  v  •  ppvQ  la  ~  =  0.015.  The  nondimensional  number  v  signi¬ 
fies  the  effect  of  inertia  forces  relative  to  tne  flow  stress  of  the  material.  For  the  simple 
shearing  problem,  Batra  (1988)  noted  that  the  inertia  forces  play  a  noticeable  role  when  v 
=  0.004.  Thus  for  the  present  problem,  the  inertia  forces  will  very  likely  play  a  significant 
role. 

LAYER  MATERIAL,  SOFTER  THAN  THE  MATRIX  MATERIAL  Figure  2 
depicts  the  contours  of  the  maximum  principal  logarithmic  strain  t  at  an  average  strain  of 
0.079  when  the  center  of  the  ellipsoidal  void  coincides  with  the  center  of  the  cross-section 
and  the  major  axis  of  the  void  is  aligned  along  the  vertical  centroidal  axis.  These  contours 
and  other  results  reported  by  Zhu  and  Batra  (1990b)  reveal  that  shear  bands  initiate  from 
points  on  the  right  traction  free  edge  where  the  matrix  /layer  interfaces  intersect  it. 

Because  the  layer  material  is  softer  and  its  thickness  quite  small,  these  bands  merge  into  one 
band  that  initially  propagates  horizontally  into  the  layer.  When  the  matrix  material  has 
softened  somewhat  due  to  the  rise  in  its  temperature,  the  horizontally  propagating  band 
bifurcates  into  two  bands  that  propagate  into  the  matrix  along  ±45°  directions.  The  band 
propagating  into  the  matrix  material  above  the  layer  has  more  severe  deformations  associ¬ 
ated  with  it  than  the  one  propagating  into  the  matrix  material  below  the  layer. 

The  band  initiating  from  a  point  near  the  void  tip  on  the  vertical  centroidal  axis  pro¬ 
pagates  along  a  line  that  makes  an  angle  of  45*  with  the  horizontal.  This  band  seems  to 
pass  through  the  soft  layer  rather  easily. 

Figure  3  shows  the  contours  of  the  maximum  principal  logarithmic  strain  e  at  an  aver¬ 
age  strain  of  0.0333  when  the  ellipsoidal  void  is  at  the  center  of  the  cross-section  and  the 
major  axis  of  the  void  is  aligned  along  the  horizontal  centroidal  axis.  These  plots  look  quite 
similar  to  that  for  the  case  when  the  major  axis  of  the  void  coincides  with  the  vertical  cen¬ 
troidal  axis.  For  a  further  discussion  and  details  of  results  in  this  case,  see  Batra  and  Zhu 
(1991). 

When  the  void  tip  touches  the  matrix/layer  interface  and  the  major  axis  of  the  void 
coincides  with  the  vertical  centroidal  axis,  the  contours  of  t  plotted  in  Figure  4  at  an  aver¬ 
age  strain  rate  of  0.0175  look  quite  different  from  the  previous  two  cases.  Results  given  by 
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Batra  and  Zhu  (1990)  and  these  contours  of  e  suggest  that  a  shear  band  initiates  within  the 
matrix  surrounding  the  void  tip  near  the  matrix/layer  interface  and  propagates  into  the 
matrix  material  below  the  common  interface,  the  direction  of  propagation  being  nearly  45* 
to  the  vertical  axis.  The  shear  bands  initiating  at  points  of  intersection  of  the  matrix/layer 
interfaces  with  the  right  traction  free  surface  propagate  into  the  soft  layer  and  then  bifur¬ 
cate  into  the  matrix  material  along  lines  making  an  angle  of  approximately  45*  with  the 
vertical.  The  band  in  the  layer  near  the  upper  matrix/layer  interface  bifurcates  into  the 
matrix  prior  to  that  near  the  lower  interface.  Also  the  band  in  the  layer  near  the  upper 
matrix/layer  interface  continues  to  propagate  horizontally  into  the  layer  too,  while  that  near 
the  lower  surface  does  not. 

LAYER  MATERIAL  STRONGER  THAN  THE  MATRIX  MATERIAL.  We  first 
study  the  case  when  the  center  of  the  ellipsoidal  void  coincides  with  that  of  the  cross- 
section.  In  Figure  5,  we  have  plotted  contours  of  e  at  an  average  strain  of  0.122  when  the 
major  axis  of  the  void  is  vertical.  Now  the  bands  initiating  from  points  on  the  right  traction 
free  edge  where  the  matrix/layer  interfaces  intersect  it  propagate  into  the  matrix  material 
along  lines  making  an  angle  of  approximately  ±45*  with  the  horizontal.  Recall  that  the 
matrix  material  is  softer  than  the  layer  material.  The  quarter  of  the  square  cross-section 
studied  is  divided  into  five  subregions,  each  of  which  is  deforming  essentially  rigidly,  and 
there  is  a  shear  band  at  the  four  common  boundaries  (e.g.,  see  Zhu  and  Batra  (1990b)). 
However,  when  the  major  axis  of  the  void  is  horizontal,  contours  of  «  depicted  in  Figure  6 
at  an  average  strain  of  0.031  suggest  a  picture  different  from  the  one  when  the  major  axis  of 
the  void  was  vertical.  We  should  add  that  the  average  strains  in  the  two  cases  are  quite 
different.  Hence,  a  direct  comparison  is  not  very  meaningful. 

Figure  7  shows  contours  of  €  at  an  average  strain  of  0.057  when  a  void  tip  is  at  the 
matrix/layer  interface.  These  contours  and  other  results  given  by  Batra  and  Zhu  (1990) 
reveal  that  a  shear  band  initiating  from  the  void  tip  abutting  the  matrix/layer  interface  pro¬ 
pagates  initially  along  the  interface  and  then  into  the  matrix  material  along  a  line  making  an 
angle  of  nearly  45*  with  the  vertical.  The  shear  band  initiating  from  the  lower  void  tip  also 
propagates  into  the  matrix  material  along  a  line  making  an  angle  of  approximately  45“  with 
the  vertical.  Two  shear  bands  also  initiate  from  points  on  the  right  traction  free  edge  where 
matrix/layer  interfaces  intersect  it,  and  these  bands  propagate  into  the  matrix  material  along 
lines  making  an  angle  of  45*  with  the  vertical.  Even  though  it  seems  that  near  the  vertical 
centroidal  axis  a  shear  band  has  propagated  into  the  layer,  there  is  no  localization  of  defor¬ 
mation  occurring  in  the  layer  material.  This  is  evidenced  by  the  plots  of  t  versus  the  aver¬ 
age  strain  at  several  points  in  the  layer  that  are  given  in  Fig.  8c  or  Batra  and  Zhu’s  (1990) 
paper.  The  contours  of  the  temperature  rise,  not  included  herein,  support  the  picture  laid 
out  above  for  the  development  of  four  bands,  two  from  void  tips  and  two  from  points  on  the 
right  traction  free  surface  where  the  layer  and  the  matrix  materials  meet. 

CONCLUSIONS.  We  have  analyzed  the  problem  of  the  initiation  and  growth  of  shear 
bands  in  a  prismatic  viscoplastic  body  containing  an  ellipsoidal  void,  and  two  thin  layers 
made  of  a  different  viscoplastic  material  placed  symmetrically  about  the  horizontal  cen¬ 
troidal  axis.  The  body  is  deformed  in  plane  strain  compression  along  the  vertical  axis  at  an 
average  strain-rate  of  5000  sec*1 ,  and  its  deformations  are  assumed  to  be  symmetrical  about 
the  two  centroidal  axes. 

Two  shear  bands  initiate  from  points  on  the  right  traction  free  edge  where  the  mat¬ 
rix/layer  interfaces  meet  it.  These  bands  propagate  into  the  softer  material.  When  the 
matrix  material  is  softer,  these  bands  propagate  along  lines  that  make  an  angle  of 
approximately  ±45*  with  the  horizontal.  However,  when  the  layer  material  is  softer,  these  bands 
essentially  merge  into  one  and  initially  propagate  horizontally  into  the  layer.  Subsequently, 
this  band  bifurcates  into  two  bands  that  propagate  into  the  matrix  material  along  ±45“ 
directions. 

Shear  bands  also  initiate  from  points  near  the  void  tips  and  propagate  in  the  ±45* 
directions.  When  the  layer  material  is  stronger  than  the  matrix  material,  these  bands  do  not 
pass  through  the  layer  and  are  deflected  back  upon  arriving  at  the  matrix/layer  interface. 
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However,  they  pass  through  easily  through  a  softer  layer. 
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Contours  of  the  maximum  principal  logarithmic  strain  at  7  avg  =  0.079.  a  „  layer 
(1/5)  a  0  matnx.  The  major  axis  of  the  ellipsoidal  void  coincures  with  the  verti¬ 
cal  centroidal  axis. 
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a  0  lay«r  ■  (1/5)  a  0  matnx,  the  major  axis  of  the  ellipsoidal  void  coincides  with 
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Fig.  4.  Contours  of  the  maximum  principal  logarithmic  strain  at  7  av„  =  0.0175. 
a  o  layer  _  (  1/5)  g  #  matm  \  Void  tip  is  at  the  matrix/layer  interface. 
- 0.015, .  0.20,  —  0.025, - 0.030, - 0.03f 
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Fig.  5.  Contours  of  the  maximum  principal  logarithmic  strain  at  7avg  =  0.122. 

a  0  layer  =  5  o0  niatnx  The  major  axis  of  the  ellipsoidal  voidrcoincides  with 
the  vertical  centroidal  axis. 
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ABSTRACT 

In  the  constitutive  modeling  of  metals  there  has  long  been  the  need  for  a  sim¬ 
ple,  test  methodology  to  achieve  large  defo  nation  with  a  uniform  stress  and  strain 
state.  The  torsion  test  has  been  recognized  ur  its  potential  to  fill  this  need.  The  ge¬ 
ometry  of  the  test  specimen  does  not  change  significantly  even  for  very  large  deforma¬ 
tion.  In  recent  years,  the  thin-walled  tubular  specimen  having  a  short  gauge  length 
has  been  proposed  to  meet  this  need.  In  this  paper,  a  nonlinear  finite  element  analy¬ 
sis  of  a  typical  specimen  geometry  has  been  conducted  using  the  ABAQUS  finite  ele¬ 
ment  code.  Several  different  three  dimensional  models  have  been  used. 

The  analysis  shows  the  suitability  and  limitations  of  applying  a  simple  shear 
approximation  to  the  deformation  of  the  gauge  section.  Quantitative  assessment  is 
made  of  the  macroscopically  derived  stress  and  strain  states  with  the  numerically  pre¬ 
dicted  variations  within  the  gauge  region.  A  correction  factor  is  found  to  be  required 
when  converting  the  applied  twist  at  the  grips  to  average  shear  strain  across  the 
gauge  section.  Plastic  deformation  extends  from  the  gauge  section  into  the  shoulder 
region  whether  there  is  an  abrupt  or  gradual  transition  from  the  large  shoulder  region 
to  the  gauge  section.  This  also  causes  the  development  of  axial  strain  in  the  gauge 
section  even  when  the  axial  motion  of  the  grips  is  constrained  to  be  zero. 

The  usefulness  of  the  thin-walled,  short  gauge  length  specimen  is  discussed 
in  light  of  the  detailed  analysis.  The  need  for  accurate  modeling  of  experimental  pro¬ 
cedures  is  highlighted. 
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Introduction 


Determining  the  behavior  of  materials,  particularly  metals,  to  large  deformation  condi¬ 
tions  has  presented  substantial  experimental  difficulties  from  the  time  of  Tresca’s  early  met¬ 
alworking  experiments  [Tresca,  1864]  to  the  present.  Primary  difficulties  have  been  in  deter¬ 
mining  the  stress  and  strain  accurately  in  an  experimental  test  specimen  that  undergoes 
gross  deformation.  A  homogeneous  test  region  is  required  which  is  free  from  large  stress  or 
strain  gradients  and  is  large  enough  to  measure  displacements  from  which  strain  can  be  in¬ 
ferred.  Tension,  compression,  rolling,  extrusion,  torsion  and  drawing  are  all  types  of  proce¬ 
dures  that  have  been  applied  to  metals  to  achieve  large  deformation. 

Torsion  of  solid  rods  and  thin-walled  tubular  specimens  have  been  particularly  attrac¬ 
tive  because  they  do  not  undergo  a  large  change  in  dimensions  during  a  test  The  biggest 
drawback  of  the  solid  rod  torsion  specimen  is  the  large  radial  gradient  in  stress  and  strain. 
The  distribution  of  shear  strain  in  a  twisted  rod  is  linear  varying  from  zero  at  the  center  to  a 
maximum  on  the  surface.  The  variation  of  shear  stress  depends  upon  the  constitutive  behav¬ 
ior.  A  solid  specimen  can  have  a  very  three  dimensional  stress  distribution  even  if  the  ends 
of  the  specimen  are  free  to  expand  or  contract  Symmetry  conditions  require  that  the  cross 
sectional  planes  of  material  must  remain  plane  during  deformation  [Crandall,  Dahl  and 
Lardner,  1972].  An  unmeasurable  axial  stress  distribution  is  required  to  produce  that  defor¬ 
mation  for  all  but  the  simplest  constitutive  models.  There  are  two  very  attractive  features  of 
this  type  of  test:  the  geometry  is  very  simple  and  easily  machined,  and  the  solid  rod  is  very 
stable  against  buckling.  These  two  features  must  be  measured  against  the  uncertainty  in  the 
results  due  to  the  nonhomogeneous  stress  and  strain  states. 

The  thin  wall  torsion  specimen  has  received  considerable  attention  in  the  literature  in 
recent  years  because  it  offers  the  possibility  of  a  simple  deformation  field  and  homogeneous 
stress  and  strain  states.  A  specimen  which  has  a  wall  which  is  only  a  small  fraction  of  the 
radius  of  the  section  will  have  a  nearly  uniform  strain  distribution  through  the  wall  thickness. 
The  stress  state  is  given  by  dividing  the  required  torque  or  axial  (thrust)  load  by  the  cross 
sectional  area  and  mean  radius.  The  tubular  test  specimen  has  been  historically  used  to 
probe  biaxial  stress  states  and  infinitesimal  theory  flow  rules  [Taylor  and  Quinney,  1931; 
Phillips  and  Lu,  1984],  The  specimen  is  known  to  be  unstable  at  large  strains  when  it  is  pro¬ 
portioned  by  conventional  means  having  long  uniform  gauge  sections  with  gradual  transition 
to  the  gripping  region.  Torsional  buckling  is  a  primary  mode  of  failure  for  such  a  specimen. 
The  problem  of  torsional  buckling  can  be  avoided  by  shortening  the  gauge  length  so  that  it  is 
only  a  fraction  of  the  diameter  of  the  gauge  section.  It  is  much  harder  to  buckle  a  short,  squat 
cylinder  than  a  long,  thin  one.  A  specimen  of  this  type  was  first  proposed  by  Hodieme  in 
1962  for  use  in  hot  working  studies  [Hodieme,  1962]  and  popularized  by  Lindholm  et  al. 
[1980].  Figure  1  shows  a  generic  sketch  of  this  type  of  specimen.  Notice  the  short,  thin  wall 
gauge  section  which  quickly  transitions  to  the  thick  walled  shoulder  region  where  the  speci¬ 
men  is  gripped. 

Torsion  test  results  have  been  originally  used  to  compare  normalized  flow  stress  be¬ 
havior  with  tension/compression  results  [Hecker,  1982].  A  landmark  paper  by  Nagtegaal 
and  deJong  [1982]  showed  that  simple  shear  deformation  can  yield  startling  stress  predic¬ 
tions  which  are  very  sensitive  to  material  model.  These  results  show  the  importance  of  sim¬ 
ple  shear  deformation  for  discriminating  material  behavior,  especially  kinematic  hardening. 
Nagtegaal  and  deJong  showed  that  classical  Prager  type  kinematic  hardening  predicts  very 
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large  normal  stresses  in  simple  shear.  The  stress  components  are  seen  to  oscillate  with 
plastic  strain  for  very  large  shearing  due  to  the  objective  stress  rate  (Jaumann)  which  was 
used.  In  the  past  decade  there  has  been  intensive  work  aimed  at  describing  material  behav¬ 
ior  in  large  simple  shear  and  even  formuladng  constitutive  models  with  the  specific  goal  of 
describing  simple  shear  in  a  more  intuitively  acceptable  manner.  In  light  of  these  results  the 
torsion  test  has  a  new  importance  insofar  as  it  provides  an  approximation  to  simple  shear  de¬ 
formation.  Just  how  close  an  approximation  this  is  will  be  examined  below. 

The  kinematics  of  the  finite  strain  tension  torsion  of  a  thin-walled  tube  have  been  ex¬ 
amined  in  detail  by  McMeeking  [1982].  For  a  uniformly  deforming  tube,  the  stretching  ten¬ 
sor  (symmetric  part  of  the  velocity  gradient)  can  be  written  in  curvilinear  coordinates  as: 


D  = 


t/t  0  0 

0  t/t  y/1 

0  y/2  ej. 


where  the  1,  2,  and  3  directions  are  in  the  radial,  hoop  and  axial  directions,  respectively.  The 
torsion  test  provides  an  approximation  to  simple  shear  only  as  well  as  it  restrains  the  chang¬ 
es  in  wall  thickness,  t,  mean  radius,  r,  and  axial  strain  ,  e,  to  be  zero.  The  geometry  shown 
in  Figure  1  attempts  to  enforce  the  gauge  section  to  be  free  from  normal  stretches  by  the 
presence  of  the  large  shoulders  near  the  gauge  region.  The  massive  shoulders  provide  the 
restraint  against  changes  in  the  radius.  The  shoulders  also  provide  the  restraint  against  axi¬ 
al  deformation.  The  grips  can  be  restrained  against  axial  motion  to  within  the  stiffness  of  the 
testing  machine  frame.  The  thick  walls  of  the  shoulder  region  transmit  this  axial  stiffness  to 
the  gauge  region.  The  effectiveness  of  this  geometry  to  restrain  the  radial,  hoop  and  axial 
normal  straining  is  one  of  the  primary  things  to  be  determined  in  evaluating  the  torsion  test. 

The  thin  walled  torsion  specimen  has  been  used  largely  without  investigation,  either 
experimentally  or  numerically  as  to  its  effectiveness  in  simulating  simple  shear.  The  single 
exception  is  Lipkin,  Chiesa  and  Bammann  [1987]  who  conducted  a  numerical  analysis  of 
their  test  specimen  using  the  DYNA3D  finite  element  hydrocode.  This  analysis  will  be  re¬ 
ferred  to  below  in  comparison  with  the  present  results  but  a  few  comments  are  in  order. 

A  number  of  features  of  this  type  of  specimen  were  first  observed.  The  gauge  length 
shows  a  slight  lengthening  during  torsion  even  for  perfectly  fixed  specimen  ends.  Also,  the 
shear  strain  at  an  element  in  the  gauge  section  can  differ  from  the  average  shear  strain  calcu¬ 
lated  from  the  twist  of  the  grips. 

This  analysis  was  conducted  for  the  express  purpose  of  comparing  experimental  re¬ 
sults  of  the  authors  with  a  particular  constitutive  model.  As  such,  no  comparison  was  made 
with  better  understood,  more  classical  material  models. 

The  finite  element  mesh  that  was  used  was  very  coarse  containing  only  3  elements 
through  the  thickness.  No  mesh  convergence  comparisons  were  reported. 

The  results  of  Lipkin  et  al.  have  shed  new  light  on  the  behavior  of  the  thin-walled  tor¬ 
sion  specimen  but  have  not  provided  a  detailed,  quantitative  assessment  as  to  the  limita¬ 
tions  and  usefulness  of  the  specimen.  The  current  analysis  addresses  these  questions. 
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Finite  Element  Model 


The  specimen  geometry  that  was  simulated  corresponds  to  the  316  stainless  steel 
specimen  reported  in  White,  Anand  and  Bronkhorst  [1990].  The  gauge  length  was  5.9  mm. 
The  inside  diameter  was  19.05  mm.  The  outside  diameter  of  the  gauge  region  was  20.52  mm. 
The  outside  diameter  of  the  shoulder  region  was  38.1  mm. 

The  finite  element  analysis  was  conducted  with  the  ABAQUS  finite  element  program 
[ABAQUS,  1989].  All  of  the  elements  were  eight  node,  linear  displacement  bricks  having  a 
full  eight  material  integration  points  (C3D8  elements  in  ABAQUS).  Two  different  types  of 
mesh  designs  were  employed.  These  are  shown  in  Figure  2. 

The  first  type  of  mesh  is  one  that  represented  the  full  three-dimensional  geometry  of 
the  specimen.  Fig.  2a.  Only  one  half  of  the  specimen  was  discretized  due  to  symmetry  about 
the  midplane.  The  mesh  had  5  elements  through  the  thickness  of  the  specimen  wall.  In  the 
one-half  gauge  region  8  elements  were  used  in  the  axial  discretization  but  were  not  evenly 
spaced,  they  were  smaller  near  the  shoulder  than  at  the  specimen  midsection.  The  mesh 
was  discretized  into  12  rows  of  elements  around  the  circumference.  Thus  each  element  cov¬ 
ered  30  degrees  of  arc.  The  applied  boundary  conditions  were  very  simple.  The  nodes  on  the 
specimen  midplane  were  restrained  against  movement  in  the  axial,  and  local  hoop  directions 
but  were  allowed  free  movement  in  the  radial  direction.  The  outermost  ring  of  nodes  at  the 
top  of  the  shoulder  region  farthest  from  the  gauge  section  was  constrained  to  move  in  a  circle 
at  the  original  radius.  This  was  to  simulate  the  twist  applied  to  the  specimen  by  the  hydrau¬ 
lic,  collet  grips  of  the  testing  machine.  All  nodes  on  the  plane  furthest  from  the  midplane 
were  restrained  against  axial  motion.  All  other  free  motion  of  the  specimen  boundaries  was 
allowed  without  constraint. 

The  second  type  of  mesh.  Fig.  2b,  discretized  the  specimen  into  just  one  circumferen¬ 
tial  slice  of  either  one  or  five  degree  extent.  A  fine  mesh  within  that  slice  was  used.  Each 
plane  had  9  elements  through  the  wall  thickness  and  15  along  the  axial  length  of  the  one-half 
gauge  length.  Again,  the  nodes  on  the  midplane  of  the  gauge  region  were  restrained  against 
axial  or  circumferential  motion  but  radial  motion  was  allowed.  The  nodes  along  the  top  face 
of  the  shoulder  region  were  restrained  against  axial  motion.  The  outer  node  on  the  top  face 
was  constrained  to  move  in  a  circular  arc  simulating  the  applied  twist.  The  compatibility  en¬ 
forced  on  this  strip  to  make  it  simulate  an  entire  circumference  was  by  requiring  the  initially 
straight  radial  lines  of  nodes  to  remain  on  straight  lines  and  by  requiring  the  corresponding 
nodes  on  the  two  faces  of  the  strip  to  retain  the  prescribed  circumferential  angle  (either  one 
or  five  degrees)  between  them. 

The  material  models  that  were  used  in  the  simulations  were  classical,  large  strain 
plasticity  laws:  isotropic  hardening,  kinematic  hardening,  and  perfect  plasticity.  The  material 
was  assumed  to  behave  elastically  with  Young’s  modulus  of  200.  GPa  and  Poisson’s  ratio  of 
0.33  up  to  a  yield  stress  of  250.  MPa.  Above  this  stress  the  elastic  plastic  behavior  had  a 
constant  plastic  hardening  modulus  of  1500.  MPa.  The  objective  stress  rate  employed  was 
the  Jaumann  derivative.  These  material  models  satisfactorily  bound  the  behavior  both  of  ex¬ 
perimental  results  and  of  most  of  the  constitutive  laws  in  the  literature. 

Convergence  studies  were  conducted  of  the  step  size  and  nodal  force  tolerance  in  the 
acceptance  criterion  for  the  finite  element  studies.  The  automatic  load  stepping  was  used  in 
full  large  deformation  analysis.  Approximately  200  displacement  increments  were  used  to 
twist  the  specimen  to  a  nominal  engineering  shear  strain  of  unity. 
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Results 


The  majority  of  the  comparisons  of  the  numerical  results  are  shown  in  terms  of  macro¬ 
scopic  variables  that  would  be  determined  in  an  experiment  The  nominal  shear  stress  in  the 
gauge  section  was  determined  by  summing  the  reaction  forces  in  the  circumferential  direction 
on  the  top  face  of  the  specimen  at  the  shoulder  (the  applied  torque)  and  dividing  by  the  cross 
sectional  area  and  average  radius  of  the  gauge  section.  The  axial,  normal  stress  was  deter¬ 
mined  in  similar  manner  by  summing  the  axial  reaction  force  on  the  top  face  of  the  specimen 
and  dividing  by  the  gauge  cross  sectional  area.  The  average  strains  in  the  gauge  section 
were  determined  assuming  an  extensometer  could  measure  the  circumferential  and  axial  dis¬ 
placements  on  the  outside  of  the  gauge  section  at  the  intersection  with  the  transition  region 
to  the  shoulder.  The  nominal  axial  strain  was  determined  by  dividing  the  axial  displacement 
by  the  original  gauge  length  and  the  nominal  average  shear  strain  (engineering)  by  dividing 
the  arc  displacement  by  the  gauge  length.  These  numerical  results  then  can  be  compared 
with  the  variables  measured  in  an  experimental  test. 

In  results  not  shown  here,  the  different  models  described  above  were  compared  for 
calculations  using  both  isotropic  and  kinematic  hardening.  The  macroscopic  stress-strain  re¬ 
sults  and  the  stress  and  strain  contours  within  the  specimen  were  virtually  identical.  The 
models:  full  circumferential,  one  degree  slice  and  five  degree  slice  all  yielded  results  that 
were  almost  indistinguishable.  This  was  taken  as  verification  that  the  mesh  was  sufficiently 
fine  and  the  boundary  conditions  for  the  single  slice  meshes  were  appropriate.  Most  of  the 
succeeding  results  that  will  be  discussed  were  obtained  with  the  single  slice  model  having 
five  degrees  circumferentially. 

In  order  to  evaluate  how  well  the  thin  walled  torsion  specimen  approximates  simple 
shear  a  comparison  is  made  between  the  shear  and  normal  stress  response  inferred  from  the 
finite  element  simulation  with  that  for  the  same  constitutive  models  integrated  directly  as¬ 
suming  only  simple  shear  deformation.  The  results  of  this  comparison  are  shown  in  Figure  3 
for  both  isotropic  and  kinematic  hardening.  For  isotropic  hardening  we  see  an  almost  exact 
correlation.  No  normal  stress  develops  and  the  shear  stress  linearly  increases  with  strain. 
For  kinematic  hardening  (using  Jaumann  derivatives)  a  substantial  axial  normal  stress  is 
predicted  both  by  the  finite  element  model  of  the  specimen  and  the  assumed  simple  shear  de¬ 
formation.  The  finite  element  results  an  axial  normal  stress  approximately  20%  larger  than 
predicted  for  simple  shear.  This  provides  a  good  way  to  estimate  how  well  the  specimen  ap¬ 
proximates  simple  shear.  This  kind  of  result  is  okay  but  not  terrific.  Notice  that  the  shear 
stress  is  quite  well  correlated  by  the  specimen  and  simple  shear. 

Stress  contours  are  provided  in  Figure  4,  for  kinematic  hardening,  to  demonstrate 
what  kind  of  variability  exists  in  the  gauge  sections.  Notice  that  the  shear  stress  is  quite 
uniform  except  right  at  the  shoulder  transition.  The  axial  normal  stress  and  the  hoop  normal 
stress  show  a  much  larger  variation  across  the  wall  of  the  gauge  region.  For  a  perfect  simple 
shear  specimen  these  would  be  constant  and  of  equal  magnitude  and  opposite  sign  with  the 
axial  stress  being  compressive  and  the  hoop  stress  tensile.  In  the  specimen  the  tensile 
hoop  stress  tries  to  pull  the  radius  of  the  gauge  in  to  the  center  and  creates  a  slight  bending 
in  the  wall  which  creates  the  stress  nonuniformity.  For  the  geometry  shown  we  do  notice 
that  there  is  a  cross  section  through  the  wall  that  has  a  fairly  constant  stress  both  for  the  ax¬ 
ial  and  hoop  stresses.  This  cross  section  is  about  one  wall  thickness  away  from  the  shoul¬ 
der  transition.  At  this  location  simple  shear  appears  to  be  maintained  quite  well.  Thus  the 
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macroscopic  variables  do  give  pretty  good  results. 

The  reason  for  the  increased  axial  normal  stress  when  compared  to  simple  shear 
(Fig.  3)  can  be  traced  to  the  fact  that  the  plastic  deformation  is  not  completely  contained  in 
the  gauge  section.  Figure  5  shows  the  contours  of  equivalent  plastic  strain  at  a  shear  strain 
of  approximately  78%  .  Notice  that  the  contour  of  1%  plastic  strain  extends  into  the  transi¬ 
tion  region  for  about  one-half  gauge  length.  Thus  the  plastic  deformation  is  not  completely 
contained  in  the  gauge  section.  One  effect  of  this  is  that  the  axial  restraint  is  not  perfect. 
Figure  6  shows  the  development  of  average  axial  strain  across  the  gauge  length  with  shear 
strain.  For  the  three  plasticity  laws  considered,  an  extension  of  the  gauge  length  is  ob¬ 
served  during  testing.  The  magnitude  of  this  strain  is  small  but  it  is  enough  to  increase  the 
axial  stress,  especially  for  kinematic  hardening. 

Another  effect  of  the  plastic  strain  extending  into  the  transition  shoulder  region  is  up¬ 
on  the  shear  strain.  In  the  results  presented  above,  the  shear  strain  was  calculated  by  tak¬ 
ing  the  twist  measured  across  the  gauge  section  to  convert  into  strain.  In  experimental  prac¬ 
tice,  the  twist  applied  by  the  testing  machine  has  been  assumed  to  be  entirely  transmitted  in¬ 
to  the  gauge  section  and  hence  it  has  been  used  to  calculate  the  shear  strain.  From  the  finite 
element  results  we  compare  the  twist  measured  across  the  gauge  section  with  the  twist  ap¬ 
plied  across  the  entire  specimen.  In  Figure  7  the  ratio  of  these  twists  is  plotted  against  rota¬ 
tion.  For  the  geometry  considered  here,  the  ratio  is  independent  of  hardening  model.  It  is  in¬ 
dependent  of  applied  strain  after  an  initial  transient.  The  rotation  ratio  does  depend  slightly 
whether  it  is  measured  along  the  inner  surface  or  the  outer  surface  of  the  specimen.  For  this 
specimen,  about  78%  of  the  twist  that  is  applied  at  the  grips  actually  goes  into  the  deforma¬ 
tion  in  the  gauge  section.  It  is  very  fortuitous  that  this  ratio  is  independent  of  hardening  mod¬ 
el  and  deformation  level.  A  simple  correction  factor  can  be  applied  to  the  experimentally 
measured  twist  in  converting  to  shear  strain  (just  multiply  by  0.78).  This  correction  factor  is 
dependent  upon  the  particular  specimen  geometry  but  can  be  easily  evaluated  from  finite  ele¬ 
ment  modeling.  Of  course,  experimentally  one  would  like  to  have  a  rotational  extensometer 
to  measure  the  twist  in  the  gauge  section.  This  correction  factor  is  the  next  suitable  ap¬ 
proach. 


Conclusions 

A  finite  element  model  has  been  used  to  evaluate  the  thin-walled  torsion  specimen. 
The  specimen  is  shown  to  provide  a  reasonable  approximation  to  simple  shear.  In  evaluating 
the  specimen  response  to  twisting  the  following  observations  are  made: 

The  shear  strains  are  almost  constant  in  the  gauge  section. 

The  axial  strain  across  the  gauge  section  increases  with  deformation  but  remains  small  for  fixed  grips. 

The  shear  stress  is  quite  uniform  in  the  gauge  section. 

The  axial  and  hoop  normal  stresses  have  a  large  variation  across  the  wail  thickness  except  at  one 
location  where  they  are  uniform. 

The  plastic  deformation  extends  into  the  shoulder  transition  region  and  has  the  main  effect  of  reducing 
the  twist  that  the  gauge  section  experiences  to  a  fraction  of  the  twist  applied  by  the  grips.  This 
factor  is  well  characterized  and  can  be  used  to  correct  experimental  results. 

Future  work  is  needed  to  optimize  specimen  geometry.  The  effect  of  gauge  length 
needs  to  be  explored  as  well  as  the  onset  of  torsional  buckling.  Experimentally,  the  largest 
need  is  to  be  able  to  directly  measure  the  strain  in  the  gauge  region. 
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Figure  1,  Geometry  of  the  Thin-Walled  Torsion  Specimen. 


a)  Full  Circumferential  Mesh. 


b)  Mesh  of  Single  Slice  of  Specimen. 


Figure  2,  Finite  Element  Meshes  Used  to  Simulate  the  Torsion  Specimen. 

a)  Full  Circumferential  Mesh. 

b)  Mesh  of  Single  Slice  of  Specimen. 
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ISOTROPIC  HARDENING 


KINEMRTIC  HARDENING 


Figure  3,  Comparison  of  Finite  Element  Simulation  with  Simple  Shear  for: 

a)  Isotropic  Hardening,  and 

b)  Kinematic  Hardening. 
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Figure  5,  Contours  of  Equivalent  Plastic  Strain. 


Figure  6,  Development  of  Axial  Strain  in  the  Gauge  Section  with  Deformation. 
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ROTATION  RATIO  f  GAUGE/TOTAL  I 


EVOLUTION  OF  ROTATION  FACTOR 


Figure  7,  Ratio  of  Twist  in  the  Gauge  Section  to  Total  Twist  Applied  by 

Grips  as  a  Function  of  Applied  Twist  for  Various  Hardening  Rules. 
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WRAPPABILITY  OF  CURVES  ON  SURFACES 


Royce  W.  Soanes 

U.S.  Army  Armament  Research,  Development,  and  Engineering  Center 
Close  Combat  Armaments  Center 
Benet  Laboratories 
Watervliet,  NY  12189-4050 


ABSTRACT .  In  this  paper,  conditions  are  derived  under  which  a  path  on  a 
general  smooth  surface  is  wrappable  or  capable  of  receiving  an  essentially  one¬ 
dimensional  flexible  filament  under  tension  that  clings  to  the  surface 
throughout  its  length  and  does  not  slip.  Wrappability  considerations  are  of 
practical  importance  in  the  fabrication  of  filament-wound  composite  pressure 
vessels,  for  instance.  The  general  wrappability  conditions  derived  are  applied 
to  two  special  cases:  general  cylinders  and  general  surfaces  of  revolution. 

INTRODUCTION.  Imagine  a  rotating  spinole  accepting  string  from  some  deliv¬ 
ery  point  which  moves  parallel  to  the  axis  of  the  spindle.  This  is  the  essence 
of  the  filament  winding  process  where  the  "string"  is  replaced  by  a  band  of 
epoxy  impregnated  fiber  glass,  for  instance;  layer  upon  layer  of  this  filament  is 
evenly  laid  down;  and  the  whole  thing  is  ultimately  cured  or  baked  into  a 
filament-wound  composite  structure  -  often  a  pressure  vessel.  The  spindle  or 
mandrel  is  designed  so  that  it  may  be  broken  down  into  parts  and  removed  sub¬ 
sequent  to  curing,  leaving  only  the  wrappings  embedded  in  the  matrix  material. 
This  paper  considers  the  question  of  how  the  winding  or  wrapping  process  is 
limited  by  the  differential  geometric  nature  of  the  mandrel’s  surface. 

PRELIMINARIES.  Begin  with  point  P  in  three  space: 

P  =  iX  +  jY  +  kZ 

Restrict  P  by  parameterizing  with  respect  to  x  and  0,  defining  a  surface  S 
embedded  in  three  space: 


X  =  x 

Y  =  r(x, 0)sin  0 
Z  =  r(x,0)cos  0 

where  r(x,0)  is  the  radius  of  the  surface.  We  require  r  to  be  sufficiently 
smooth,  positive,  and  2n  periodic  in  0,  making  S  a  closed  surface  with  an  inside 
and  an  outside.  If  P  is  further  restricted  by  defining  8  in  terms  of  x,  P  will 
lie  on  a  curve  or  path  embedded  in  the  surface  S. 

Let  (  )'  denote  ~  (  )  and  let  s  denote  distance  along  curve  c.  The 

tangent  vector  t  to  curve  c  is 
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t 


as  =  p,(s’r 


and  the  curvature  vector  k  of  any  curve  c  is 

=  {P"-tS,,)(S')-* 

A  family  of  vectors  tangent  to  surface  S  at  point  P  is 

dP  =  Pxdx  +  P0d9  =  ~  dx  +  p  d0 


Two  independent  vectors  spanning  the  tangent  space  at  P  are  therefore  Px  and  P0. 
Now  form  the  vector  cross  product  of  Px  and  P0  to  obtain  v,  a  vector  normal  to 
the  surface  and  pointing  away  from  the  outside  of  the  surface. 

v  =  Px  x  P0 

WRAPPABILITV  CONDITION  I  -  NO  LIFTOFF.  Now.  in  order  for  c  to  be  a  wrap- 
pable  curve  on  surface  S,  it  is  necessary  for  a  length  of  flexible  filament 
under  tension  to  cling  to  c  and  S.  In  order  for  this  clinging  to  take  place,  it 
is  necessary  for  c  to  see  the  outside  of  S  as  being  convex.  This  will  be  the 
case  only  if  the  curvature  vector  of  c  points  away  from  the  inside  of  the  sur¬ 
face.  It  is  therefore  necessary  that  the  inner  or  dot  product  of  v  and  k  be 
negative  for  clinging  to  take  place.  Now, 

vtc  =  v  (P"-ts")  (s’  )~* 


but 


v*t  =  0 


therefore, 


VK  =  y.p"(s'  )“2 

and  since  only  the  sign  of  vk  matters  here,  the  function  A  is  defined  as 

A  =  D'P" 

When  A  is  positive,  a  filament  under  tension  tends  to  lift  off  the  surface  and 
form  a  bridge  between  two  distant  points;  when  A  is  negative,  the  filament  tends 
to  cling  to  the  surface. 

Now,  the  evaluation  of  A  in  terms  of  x,  9,  and  r  is  outlined. 
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First, 


P'  -  Px  +  P00' 
and 

P"  =  Pxx  +  2PX00‘  +  P000**  +  P00" 
but  Pg  is  in  the  tangent  space,  so 

wP0  =  0 

Hence, 

A  =  p.p"  =  vPxx  +  2i>-Px08'  +  v*P000'2 

Note  that  all  the  inner  products  defining  A  are  determined  at  a  point  on  the 
surface  independently  of  the  curve  c  and  that  the  only  thing  that  changes  A  at  a 
point  is  the  direction  of  c  determined  by  O'.  Therefore,  all  curves  with  the 
same  direction  through  a  given  point  on  the  surface  have  the  same  value  of  A  at 
that  point. 

Continuing  the  evaluation  of  v»P"  we  have 

Px  =  ’+JYx  +  kZx 
p9  *  jV0  +  kZ0 

V  =  PXxP0  =  i (VXZ0-Y02X)  -  jZg  +  kY0 
pxx  =  JYxx  +  k2xx 
Px0  =  JYx0  +  k2x0 
P00  =  JY00  +  kZgg 

The  dot  products  in  A  are  therefore 

v*pxx  =  Y0^xx  ~  z0Yxx 
y,px0  =  Y0ZX0  "  z0Yx0 
U’P00  =  Y0Z00  ~  ZgYgg 

Obtaining  the  partials  in  these  dot  products 

Y  =  r  sin  0 
Yx  =  Y^x/r 
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Y0  *  Yr0/r  +  Z 
Yxx  =  Yrxx/r 
Yx0  =  (Yrxe+Zrx)/r 
Yee  =  {Y(r00-D  +  2Zr0|/r 
Z  =  r  cos  a 
Zx  =  Zrx/r 
Z0  =  Zr0/r  -  Y 
zxx  *  Zrxx/r 
zx0  *  (Zrx0-Yrx)/r 
zea  =  |Z(r00-r)  -  2Yr0}/r 
The  dot  products  then  become 

vPxx  =  rrxx 
v*px0  *  rfx0  -  rxr0 
u*p00  =  r<'90  ~  2re*  -  r2 

Hence , 


X  =  rrxx  ♦  2(rrx0-rxr0)0’  +  (rr00-2r02-r2 )0’ 2 
=  a0‘2  +  2b0'  +  c 

=  a(0>  ♦  \) 

It  is  clear  that  the  sign  of  X  is  completely  independent  of  0'  at  points 
for  which  ac  >  b2 .  Hence,  any  curve  is  wrappable  where  ac  >  b2  and  a  <  0,  but 
the  surface  is  unwrappable  if  ac  >  b2  and  a  >  0  anywhere.  If  ac  <  b2 ,  some 
curves  will  be  wrappable  and  others  won't.  In  any  case,  given  the  radius  func¬ 
tion  r,  its  partial  derivatives  at  a  point,  and  the  direction  of  a  curve  through 
that  point,  one  can  immediately  compute  whether  or  not  a  taut  filament  following 
the  curve  will  tend  to  lift  from  the  surface. 

Consider  two  special  cases.  For  a  surface  of  revolution,  r0  =  0. 

Therefore,  X  =  rr"  -  (r0’)2  [1J.  If  r"  <  0  everywhere,  all  curves  on  the 
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surface  of  revolution  are  wrappable.  Points  for  which  r"  >  0  are  also  wrappable 
for  curves  with  9'  sufficiently  large.  No  surface  of  revolution  is  unwrappable 
in  the  sense  of  liftoff. 

For  a  cylinder,  rx  a  o,  and 

\  a  ae' *  =  (rr0e-2r0*-r*)0'2 

Hence,  every  curve  on  a  cylinder  is  wrappable  if  a  <  0  everywhere,  but  if  a  >  0 
anywhere,  the  cylinder  is  unwrappable. 

In  this  section  the  phrase  "is  wrappable"  has  meant  "does  not  experience 
filament  liftoff  or  bridging  during  winding."  In  the  next  section,  the  defini¬ 
tion  of  "wrappable"  is  augmented  by  considering  friction  between  filament  and 
surface. 

WRAPPABIL1TV  CONDITION  II  -  NO  SLIPPAGE.  If  there  were  no  friction  between 
filament  and  surface  (or  between  filament  and  filament  since  the  surface  is 
filament  after  the  first  layer  is  laid  down),  there  would  be  only  one  type  of 
path  along  which  one  might  wind  filament  without  the  filament  slipping  -  a  path 
with  no  transverse  (tangent  to  the  surface  and  perpendicular  to  the  filament) 
forces  acting  on  the  filament  -  a  path  which  curves  neither  left  nor  right  in 
the  surface  -  a  path  with  zero  geodesic  curvature  -  a  geodesic  path.  If  there 
were  no  friction  available,  and  only  geodesic  paths  could  be  wound,  there  would 
be  no  filament  winding  industry.  In  fact,  for  numerous  reasons,  it  is  seldom  if 
ever  possible  to  wind  along  geodesic  paths  in  practice  [1].  The  geodesic  path 
remains  as  an  ideal,  however,  and  in  this  section  the  degree  of  closeness  to 
this  ideal  is  quantified. 


Lev 


<f>  =  force  per  unit  length  that  filament  exerts  on  surface  =  tk 

where  t  is  the  scalar  tension  in  the  filament  and  ic  is  the  vector  curvature  of 
the  f i '.  went. 

No:e  that  the  curvature  vector  can  be  resolved  into  a  component  tangent  to 
the  sur  ace  and  a  component  normal  to  the  surface  k  =  Kg  +  icn  [2]  where  the 
tangent  component  of  k  is  called  the  geodesic  curvature  vector  and  the  normal 
compone  t  is  called  the  normal  curvature  vector. 

The  force  that  a  small  length  of  filament  exerts  on  the  surface  is 

4>dS  =  TKdS  =  (Kg  +  Kn)TdS  =  KgTdS  +  Knrds 
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Now,  in  order  to  avoid  slippage  of  this  small  section  of  filament,  the 
ratio  of  the  magnitude  of  the  tangent  force  to  the  magnitude  of  the  normal  force 
should  be  less  than  p,  the  coefficient  of  friction 


..a!!??  .  1 5a  |  < 

0Kn>TdS  1  k  1 


(or  more  precisely,  0  <  a  - 


<  u 


since  we  want  kn  <  0).  We  call  this 


ratio  of  geodesic  to  normal  curvature  the  slippage  function  a.  This  function 
measures  how  close  a  given  path  comes  to  the  ideal  geodesic  path  {am  0).  The 
evaluation  of  a  is  now  detailed.  First,  if  P  is  on  the  surface, 


dP  =  Pxdx  +  Pgde 


and 


ds*  *  dP*dP  *  Px*Pxdx*  +  2Px*P0dxd9  +  P0-P9d0z 
a  Edx*  +  2Fdxd0  +  Gd0* 


Hence, 


(s’)2  =  E  +  2F0 '  +  G0 ’ 2 
for  a  point  on  a  path  on  the  surface. 

We  now  define  a  Cartan  frame  [2-4]  relative  to  the  surface,  i.e.,  an  ortho¬ 
normal  basis  having  two  vectors  tangent  to  the  surface  and  a  third  normal  to  it. 
Let 


and 


el  =  Px/»Px» 
=  Px/(Px’Px)* 
=  Px/E* 


e3  =  PxxP0/l*pxxP0W  *  v/Hv\\ 

but  from  vector  algebra, 

(AxB) • (CxD)  =  (A*C) (B*D)  -  (A-D) (B*C) 


Hence , 


(PxxPe)*(PxxP0)  *  IIPxxPell*  =  II v II 2 
=  (Px-Px)(P0*Pe)  -  (Px*P0)2 
=  EG  -  F* 
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Therefore, 


e3  =  PxXPfl/tEG-F*)3* 


Now 


e2  *  e3xel 

=  (PxxPe)xPx/(E*(EG-F*)*) 
but  again  from  vector  algebra, 

(AxB)xC  =  B(A*C)  -  A(B*C) 


Hence , 


(PxxPe)xPx  =  Pe(Px-Px)  -  Px(Px’Pe) 

-  EP0  -  FPX 
and 

e2  =  (EPe-FPxJ/CE^CEG-F*)’4) 

Now,  ej  and  e2  span  the  tangent  space  (plane)  at  any  point;  therefore,  the  path 
tangent  vector  t  can  be  written 


t  =  Ae*  +  Be2 

If  t  makes  an  angle  u  with  ej. 


and 


ej*t  =  A  =  cos  w 


e2»t  =  B  =  sin  w 


Hence 


t  =  e^  cos  u  +  e2  sin  w 

where  w  is  the  angle  between  the  path  and  a  meridian  (9  =  constant).  Now 


k  = 


dt 


dei 


de2 


du> 


\*J  w  JL  ^  ,  .  .  l  Jill 

ai =  ar  cos  w  +  ar sin  u  +  (e2 cos  w  -  ei  sin  as 


Let 


T  =  e3xt 

*  e3x(ei  cos  u  +  e2  S'*0  w) 
=  e2  cos  w  -  ei  sin  w 
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Therefore 


and 


but  since 


one  has 


del  de2  ,  dw 

k  =  cos  u  +  z —  sin  (i)  +  T  z~ 
ds  ds  ds 


dei  de2  dw 

T*k  =  T  •  z —  cos  w  +  T  •  z —  sin  w  +  z~ 
ds  ds  ds 


*1#«1  =  e2*e2  =  1 


dej  de2 

er  as-  =  e2  *  ar  =  0 


and  since 


Ci  • 


1 *e2  *  0 


one  has 


We  therefore  have 


and 


dei  de2 

ar  *  e2  -  -  ar  *  ei 


dei  dei 

T  *  di"  =  e2  *  85"  cos  w 


de2  de2  de^ 

T  *  di"  =  "el  *  di"  sin  u  =  e2  *  ai'  sin  u 


Therefore 


_  *  ,  ^el  •  »  dw  del  dw 

T.k  =  e2  •  a;-  cos'  u  .  e2  •  a|-  sin'  “  *  ai  *  e2  *  di"  *  di 


dej 

di 


but 


hence , 


k  =  Kg  +  Kn  and  T*Kn  =  0 


T*K  =  T*Kg  =  II  Kg  II  =  kg 


and  the  geodesic  curvature  is 


dei  dw 

kg  ■  e2  *  ds  +  di  =  (e2’el’  +  w’>/s’ 


Now 


a 
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=  Pv-Puu  +  ^(Pu-PuJv 

or 

Pv-Puu  =  (pu’pv)u  ~  H(Pu*Pu)v 

Hence 

Px*P0e  =  F0  -  *GX 
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and 


p0,pxx  =  Fx  "  *E0 


Recalling 

and  applying  d/dx, 

but 

Therefore 

Now, 


el  =  pxE_* 

el'  =  Px’E'14  +  px(E"*>' 

=  <pxx+px0«')E-*  +  px(E'H)’ 

e2*px  =  0 

e2*el'  a  (®2*pxx+e2*px0®' )/e5* 
e2  *  pxx  =  (EPg-FPx).Pxx/<E*llVU) 


*  (E(FX  -  *E0)  -  F(%EX) )/ (E^HvB ) 
e2*px0  =  ( EPg-FPx ) *  px0^ ( E^H v I ) 

*  (E(WX)  -  F(*Eg)}/(E*lh>ll) 

and  therefore 


62*6!’  *  |E(2FX-Eg)  -  FEX  +  (EGx-FEg)0’  }/(2EHvll) 
Now,  consider  the  meridian  angle  « 

ej«t  =  cos  w 
=  Px*t/E^ 


-  p»-a l'* 


=  Px*P'/(s,E5*) 
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Therefore, 


-  px*(Px+Pee‘)/(s,E,<) 

=  (E+F0' )/(s 1 E^) 

Tan  „  =  m*:r..zL**iiiiX 

t  +  rtf 

(E(E+2Fe'+Ge'*)  -  (E*+2EF0'+F20'2))^ 
. E“+'F07 


_  {EG-F*|^0'  _  HvH0' 
’e"+‘F07“  '  eTf07 


Solving  for  0',  we  have 


E  Tan  <•> 


(EG-F2)*  -  F  Tan  a) 


Therefore,  a  can  be  computed  in  terms  of  x,  0,  u,  and  u' . 

The  basic  metric  coefficients  in  terms  of  our  parameterization  are  now  com¬ 
puted: 

P  *  ix  +  jY  +  kZ 


px  =  i  +  JYx  +  kZ> 


hence. 


Yx  =  Yrx/r 
Zx  =  Zrx/r 


E  =  Px.px  =  1  +  Yx2  +  Zx2  =  1  +  (Y2+Z2)  =  1  +  rx2 


p0  =  jYe  +  k*0 


Y0  =  Yrg/r  +  Z 
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Z9  *  Zr 0/r  -  y 


Hence 


F  =  PX.P9  =  YXY9  +  ZXZ0 


Yrx  Yr0  Zrx  Zr0 

=  __i  (— 2  +  z)  +  — ■ -  (— -  -  Y) 
p  '  r  '  r  '  r  ' 


f*  xr  O 

=  (Y2+Z* ) 


Also 


rxf0 


g  =  Pe*Pe  *  Y0a  +  z6* 

Y*r0*  2YZr0 

-  —2_  + - 2  +  z* 

r*  r 

Zar9z  2YZr0 

+  ----- . -  +  Y* 

r*  r 

=  r*  +  r0* 

Now  some  of  the  more  important  relations  can  be  summarized: 

A  =  rrxx  +  2(rrx0-rxr0)0'  +  (rr90  -  2 r0«-r?)(0')2 
(s’)*  =  E  +  2F0 1  +  G(9’ ) 2 
Hull*  =  EG  -  F* 


S’ Hull  ,  ,,i 

<7  *  ~~x~~  'e2*el  +  w  M 


e2.ei'  =  { E ( 2FX-E0 )  -  FEX 


+  (EGX  -  FE0)0'  }/ (2Ellull ) 


0'  = 


E  Tan  u 


Hull  -  F  Tan  (J 


E  =  1  +  rx* 

F  =  rxr0 
G  =  r*  +  r0* 
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Now  consider  the  two  special  cases  addressed  before.  First,  the  general 
cylinder  (rx  =  0); 

E  =  l,F  =  0,G  =  r2  +  r02  ,  Gx  =  0  =  Ex  =  E0 

Hull  =  G* 

e2*el 1  =  0 
Tan  0) 

9  *-S" 

s'  =  (1+Tan2  w)*  =  Sec  w 


(rr00-2r02-r2) 


_ Sec  y_G*_G)^_ 

Tan2  u 

(rr00-2r0*-r2)  — R~ 


(r2+r02)3/2Csc  u  Cot  u  u' 

(r2+r0z)3/ 2 

(rr00-2 r02-r2) 

rr00  -  2r02  -  r2 

‘  dx  (‘CSC  W) 


hence. 


(rz+r02 ) */*|  u'  | 

r2~+~2rg2-:_rr00 


where  u  =  Csc  w.  Note  that  u  =  1  at  turning  points  and  u  >  1  between  turning 
points.  (A  turning  point  is  defined  as  a  point  at  which  0'  *  «>  or  i*t  =  0.) 

Also  note  that  at  points  for  which  u'  =  0,  the  path  is  geodesic.  In  addition, 
o  -  0  if  r00  -*  -oo,  while  r0  and  u'  are  bounded.  This  can  be  called  a  "knife 
edg?"  condition  where  a  is  zero  due  to  infinite  normal  curvature  instead  of  zero 
geodesic  curvature.  Now,  consider  the  general  surface  of  revolution  (r0  =  o): 

E  =  1  +  r'2  ,  F  =  0  ,  G  =  r2  ,  E0  =  0  =  G0 
Hull  *  (EG)*  =  rVl  +  r'2 

Q,  E  Tan  u  .E.k  T 
9  =  "M-  =  <G>  Tan  W 

e2 ^ej '  =  EG '  0 ' /  ( 2Ellvll ) 

=  — -~-r  •  (h*  Tan  u> 

2 (EG)*  G 

G'  T 

=  2G  Tan  w 
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(s')2  =  E  +  G  •  £  Tan2  « 

G 


s'  =  E3*  Sec  u  =  (l+r'^^Sec  w 


Now, 


A  = 


rr"  -  r2(0')2  =  rr"  -  r2  •  -  Tan2  o>  = 


a  = 


E*Sec  w  lEG]*  ,G*  -  „ 

rr"~-~E-Tan2  «  (2G  Tan  u 


r{l+r,22Sec  u  Tan  w  .  r' 
rr""-'n+r72)“Tan2“u  V 


M  I  J  J 

—  +  Cot  u  w'  =  --  Inr  +  z~  *n  Sin  u 
r  dx  dx 


rr"  -  (1+r'2)  Tan2  u 

+  w') 

+  Cot  u  w' ) 

=  -j-  ln(r  Sin  «) 


It  is  clear  that  if  r  Sin  u  =  constant,  a  will  be  zero  and  the  path  will  be 
geodesic  (Clairaut). 

One  can  define  a  quasi -geodesic  path  on  a  surface  of  revolution  by  replacing 
Clairaut's  relation  [2,3,5]  with  r  Sin  u  =  rQ  [l]  where  the  function  rD  has  the 
following  properties: 

•  rQ(x)  ~  r(x)  at  exactly  two  values  of  x  (turning  points),  and 

•  rQ(x)  <  r(x)  at  all  points  between  the  turning  points. 

The  function  r0  is  called  the  polar  radius  function,  because  it  is  the  radius  of 
the  surface  at  the  boundaries  of  the  uncovered  polar  regions  [1],  Now,  w  will 
be  eliminated  in  favor  of  rQ.  Since 

Sin 

one  has  that 

Sec  u  = 


Tan  (i)  = 


Sec  u  Tan 


r0 

u  =  -- 
r 


(r2-r02)^ 


(r2-r02 )H 


rr. 


w  = 


r2  -  rQ2 
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and 


t-  *"r0  =  -2- 

dx  o  rQ 

Hence,  one  finds  after  simplification  that 

H  ro'l 

CT  - - 

r*-r0* 

r°*  -  rr‘,(i;;‘") 

Note  the  following:  a  ~  |  r0'|  at  turning  points;  r0'  =  0  at  geodesic  points; 

o  ~  r0'j  if  r  is  linear;  a  -»  0  if  r"  -»  while  r'  and  rD'  are  bounded 

ro 

(knife  edge);  and  positivity  of  the  denominator  in  a  implies  that 

r3r" 

r°2  >  I+r7*+rrn 


It  has  been  shown  how  the  slippage  function  a  can  be  computed  for  a  general 
closed  surface  (rx*o*r0)  and  what  simplifications  take  place  in  the  F  ■  0  cases. 
It  should  be  emphasized,  however,  that  a  is  more  than  just  a  number  to  be  com¬ 
pared  with  the  coefficient  of  friction  p  to  determine  whether  or  not  slippage 
occurs.  The  slippage  function  a  measures  pointwise  path  quality  and  should 
ultimately  be  usable  to  synthesize  or  define  quality  wrappable  paths  on  general 
closed  surfaces. 
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Abstract.  Vibrations  of  linear,  conservative  distributed— parameter  systems  can 
be  suppressed  by  a  set  of  discrete  actuators  and  an  optimal  control  algorithm  which 
minimizes  the  current  value  of  a  positive-definite,  time-dependent  objective 
function.  The  control  procedure  depends  on  the  initial  data  and  is  independent  of  the 
ultimate  outcome. 

The  control  algorithm  that  is  developed  has  a  limited  time  interval  of 
applicability  about  the  arbitrary  initial  point.  Outside  this  interval,  negative 
damping  and  reduced  stiffness  may  arise.  However,  the  arbitrariness  of  the  initial 
point  allow  re-initiation  of  the  end-point  and  thus  provides  a  means  of  continuation 
of  control. 

Using  variational  methods  and  the  general  Duhamel  integral  representation  of 
the  solution,  explicit  representation  for  infinite  dimensional  gain  matrices  is  obtained. 
Examples  of  successful  control  under  conditions  of  resonant  loading  and  moving  loads 
are  carried  out.  The  control  algorithm  can  also  be  modified  to  control  a 
beam-column  subjected  to  transverse  loads  on  the  span  before  the  beam  is  budded. 
Due  to  loss  of  modal  stiffness  of  the  buckled  beam,  the  controller  can  only  provide 
modal  damping  to  the  structure.  The  controller  cannot  suppress  the  vibrations  of  a 
buckled  beam.  The  question  of  the  effect  of  control  force  spillover  into  higher 
uncontrolled  modes  of  the  system  is  considered. 


Introduction.  Active  control  of  continuous  flexible  structures  by  means  of  the 
dassical  variational  theories  has  been  considered  by  many  authors,  among  them 
Barnes  (1971),  Balas  (1978),  Soong  (1982),  Leipholz  and  Abdel-Rohman  (1986),  and 
Abdel-Rohman  and  Nayfeh  (1987).  For  systems  governed  by  self-adjoint  partial 
differential  equation  the  control  can  be  naturally  formulated  in  terms  of  the  spatial 
modal  functions  and  modal  amplitudes. 

In  a  series  of  papers  Meirovitch  et  al.  (1980,  1982,  1983,  1985)  have  developed 
the  method  of  independent  modal  space  control  (IMSC).  In  this  method  each  mode 
of  the  system  is  considered  to  be  controllable  independently.  The  distributed 
response  of  the  controlled  system  and  the  forces  of  the  actuators  are  obtained  from 
the  controlled  mode  shapes  and  the  modal  control  forces  using  the  orthogonal 
properties  of  the  eigenfunctions. 

Assuming  that  a  finite  number  of  sensors  and  actuators  are  placed  on  or  about 
the  structure  that  monitor  the  response  and  apply  the  control  forces,  one  would 
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expect  that  the  control  algorithm  should  be  causal  i.e.  it  should  depend  only  on  the 
information  available  up  to  the  time  at  which  control  is  being  exercised.  Yang  et  al. 
(1985,  1987)  have  considered  "instantaneous  control"  of  discrete  dynamic  systems. 
They  consider  the  discretized  state  equations  of  motion  and  represent  the  solution  in 
terms  of  the  unknown  control  forces.  Then  optimization  of  the  Hamiltonian  of  the 
system  yields  as  necessary  conditions  the  optimality  algorithm  and  the  response  state 
vector.  Komkov  (1972)  also  speaks  of  "instantly  optimal  control"  algorithms  that  are 
obtained  by  a  limiting  process  from  the  time  optimal  law  reducing  the  energy  of  the 
system  at  the  maximum  possible  rate. 

The  problem  of  active  control  of  a  distributed-parameter  system  with  discrete 
sets  of  sensors  and  actuators  is  of  the  category  of  coupled  modal  control  (Meirovitch, 
1987).  Here  an  optimal,  coupled— modal  control  algorithm  is  developed  for  continuous 
self-adjoint  structures  by  minimizing  a  time-dependent  weighted  sum  of  the  kinetic 
energy,  potential  energy,  and  the  control  effort.  This  leads  to  a  causal  optimal 
algorithm  whereby  control  forces  are  determined  solely  on  the  bs’sis  of  information 
available  up  to  the  time  at  which  control  is  being  implemented. 

The  behavior  of  the  controlled  structure  will  be  governed  by  a  system  which  is 
generally  non-self-adjoint  and  with  modified  damping  and  stiffness  matrices  of 
infinite  order.  For  the  system  considered  the  dements  of  the  matrices  consist  of 
harmonics  with  modal  periods.  It  turns  out  that  the  control  algorithm  that  is 
devdoped  has  a  limited  time-interval  of  applicability  about  the  arbitrary  initial 
point.  Outside  this  interval  negative  damping  and  reduced  stiffness  may  arise. 
However,  the  arbitrariness  of  the  initial  point  allows  definition  of  the  end— point  of 
the  interval  of  integration  as  a  new  initial  point  and  thus  provides  a  means  of 
continuation  of  the  application  of  the  control. 

The  method  is  used  to  suppress  the  vibration  of  an  undamped  beam  subjected 
to  a  periodic  load  resonant  with  the  fundamental  frequency  of  the  beam.  The  control 
of  the  beam  under  a  moving  load  is  also  considered.  Both  cases  show  that  the  control 
interval  affects  the  behavior  of  the  controlled  system  significantly.  The  control 
algorithm  is  applied  to  a  beam-column  subjected  to  an  impulse  load  at  midspan. 
The  vibrations  are  suppressed  when  the  axial  load  is  below  the  first  buckling  load  of 
the  column. 

The  Control  Problem.  Consider  a  flexible  elastic  medium  such  as  a  plate,  a  rod, 
a  column,  a  membrane  or  a  cable.  Let  z(r,t)  denote  the  deflection  of  the  structure 

with  r  and  t  respectively  representing  the  position  of  a  material  point  in  a  domain  D 

and  the  time.  Assume  n  actuators  are  at  locations  ri,  i  =  l,2,3,...,n  with  control 

forces  ui  (t).  Motion  of  the  controlled  structure  will  be  described  by 


n 

m(r)  i  +  Lz  =  f(r,t)  +  E  u5  (t)  6(r  -  r 0,  reD,  t  >  t0  (1) 

i=l 

where  m(r)  is  the  mass  distribution,  f ( r , t )  is  the  external  excitation,  6  is  the  Dirac 

delta  function  and  L  is  a  self-adjoint  spatial  operator  with  self-adjoint  boundary 
conditions.  The  initial  time  to  is  any  arbitrary  point  on  the  time  scale  at  which  the 
initial  conditions 


166 


*o(r)  =  z(r,*o)  ,  io(r)  =  4(r,t0) 


(2) 


are  known.  In  (1)  and  (2)  a  dot  denotes  time  derivative. 

Associated  with  the  above  system  is  the  positive  definite  potential  energy  U(t) 
whose  variation  6  U  due  to  a  small  variation  <5z(r,t)  is  given  by 


6V  =  fhLz  6zdB  (3) 

A  positive  performance  index  J  is  defined  for  the  current  value  of  time  and 
consists  of  the  kinetic  energy,  potential  energy  and  the  control  effort 


n 

J(t)  =  Qi  (/D  \  m  *  dD)  +  Q2  U  +  \  E  ^  Ri  u’  (4) 

The  positive  functions  Qi  (t),  Qj  (t)  and  Ri  (t)  determine  the  level  of  the 
effectiveness  cf  the  control  criteria  and  can  be  assigned  arbitrary  values  by  the 
designer.  The  control  forces  Ui  will  be  determined  such  that  J(t)  is  a  minimum 
subject  to  the  constraint  of  the  equations  of  motion  (1)  -  (2).  This  minimization 

implies  that  the  system  will  acquire  a  state  as  near  its  state  of  rest  z  =  t  =  0  as  it  is 
possible  with  limited  control  effort. 

Using  the  eigenfunction  expansion  the  displacement  z  can  be  represented  by  an 
infinite  sum  of  the  products  of  orthonormalized  modes  $k  (r)  and  the  corresponding 

modal  amplitudes  ak  (t),  k  =  1,2,3,...  .  When  this  representation  is  used  in  (1)  and 
(2),  the  following  system  is  obtained 


a  +  Q2a  =  f +  Bu,  t>t0 

/V  IV  ^  IV  IV  ' 

(5) 

J  (4o)  =  a0  ,  4(t0)  =  a0 

(6) 

where  a,  f,  a0,  a0  are  infinite  dimensional  vectors  with  their  elements  given 
respectively  by 

(aj,  aoj,  40j)  =  X  mtz>  z°’  Z<J  *i  W  dD) 


f j  =  X  f^j  dD,  j  =  1,2,3,..., 


(7) 
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and  Q  =  diag(a»j),  j  =  1,2,3,...,  where  uj  are  the  modal  frequencies  as  determined 
from  (see  Meirovitch  1967) 


=  /D  L^j  >  J  *  1.2,3,...  (8) 

T 

Also  u  =  (ui,  U2,  Uj,  ...,  un)  is  the  vector  of  actuator  forces  and  B  is  a  matrix  of 
infinite  rows  and  n  columns  with  the  elements  of  the  jth  row  given  by 

Sji  =  (£i)  ,  i  »  1,2,3,.. .n  (9) 

Solution  of  (5)  —  (6)  with  the  aid  of  the  convolution  integral  can  be  represented  by 


a  -  C(t  - 10)  a0  -  S(t  - 10)  a0  -  S  *  ( f  +  B  u) 

=  /t*  ?(t  -  t)  [f(r)  +  B  u(r)]  dr  (10) 

where  C  and  S  are  infinite  dimensional  diagonal  matrices  of  the  modal  harmonics 
given  by 

5  =  diag[wj  1  sin  wjt],  C  =  S  =  diag[cos  «jt]  j  =  1,2,3,.  (11) 

Control  Algorithm  Introducing  the  arbitrary  variations  £u  in  the  interval  [t0,t] 
we  have  for  the  variation  8 A  from  (10) 

6  =  S*Bfo  (12) 

.  m  m  M  '  ' 


By  differentiating  (12)  one  obtains 


&  =  C  *  B  $u 

M_  M  mm 


(13) 
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It  is  understood  that  certain  modes  may  be  uncontrollable  due  to  coincidence  of 
the  actuator  locations  with  nodal  points.  Also,  it  may  be  advantageous  to  limit  the 
number  of  modes  that  participate  in  the  control  in  order  to  achieve  better  controller 
design.  Therefore  variations  of  these  uncontrolled  modes  will  be  zero  either 

automatically,  <£j  (rj)  =  0,  or  by  choice.  In  all  cases  the  variations  bi  and  6z  can  be 

written  from  the  eigenfunction  representation  of  z  as 


<D  '  00 

&  =  S  &j  ,  6z  =  X  0j  &j  (14) 

j«i  j«i 


where  it  is  understood  that  certain  terms  in  the  sums  may  vanish.  Then  from  (3)  - 

(4), 


=  C  (Qi  m  z  6z  +  Q  Lz  6z)  dD  +  X  Ri  ui  Aii  (15) 

*  i*l 

These  together  with  the  relations  defining  the  eigenfunctions,  the  frequencies  and  the 
orthogonality  conditions 


L0j  =  m  uf  <t>y  £  m  ^  dD  =  5jk  ,  j,k  =  1,2,3,... 
yield  the  modal  equivalent  of  (15) 

T  T  ^  ^ 

&J  =  Qta  &  +  Qj  a  11  6a.  +  u  R  <ht 

*  *  rv  i v  "  w  ai  M  rv 

where  R  =  diagfRi,  R2,  Rj,...  Rn]. 

Substitution  of  (11)  -  (13)  into  (17)  yields 


(16) 


(17) 


A  =  Q.  W»T(0  /  SO-r)  B«  u(r)dr  +  Q2(t)aT(t)  n’  /  ‘  S(t-r)  B*j(r)dr 
to  t0 

+  u1  (t)  R(t)  «u(t)  (18) 

By  introducing  the  Heaviside  unit  step  function  H(t)  and  the  delta  function  6  (t)  one 
can  represent  (18)  as  a  single  integral  over  the  extended  interval  t0  <  r  <  oo.  Thus 
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(19) 


S3  =  /’o{H(t-r)[Qi(t)aT(t)C(t-r)+Qj(t)aT(t)n3S(t-r)]B+ 

S  (^)nT(r)R(r)}fa(r)dr 

Vanishing  of  63  for  arbitrary  implies  the  vanishing  of  the  coefficient  of  6  u(r) 
in  the  integrand  in  (19).  This  condition  is 

H(t-r)[Q,  (t)aT  (t)S(t-r)+QJ(t)aT(t)n,S(r-t)]B+«(t-r)uT(r)R(r)  =  0, 
for  t  >  t0,  t  >  t0  (20) 

Integrating  this  equation  with  respect  to  r,  from  t0  to  t,  one  obtains 
•T  T 

{Qi  a  S(t-to)  +  Qj  a  [I  -  C(t-t0)]}  B  +  u  R  =  0,  t  >  t0  (21) 

which  determines  the  control  force  in  the  time  domain. 

Assuming  that  a  certain  ordered  number  Nc  of  modes  are  to  participate  in  the 
control  algorithm  we  denote  them  by  ac  =  (alt  a*,  ...  a  )  .  The  remaining  modes 
will  be  denoted  by  ar.  In  a  corresponding  manner  vectors  and  matrices  will  be 
partitioned 
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Then  (21)  can  be  solved  for  the  control  force 


H  =  -QiDac-Q2Eac 


where 


D  *  R'1  B*  Sc  (t  - 10),  E  =  R*1  Bt  [Ic  -  Cc  (t  -  t0)l 

Meirovitch  and  Baroh  (1985)  developed  spatial  modal  filters  for  reconstruction  of  the 

modal  states  ac  and  ac.  Using  orthogonality  of  eigenfunctions  they  showed  that  the 

problem  of  observer  spillover  that  was  discussed  by  Balas  (1978)  can  be 
circumvented.  Assuming  that  the  effect  of  time  delay  due  to  computation  on  the 
system  is  small  enough  to  be  negligible,  the  behavior  of  the  controlled  modes  are 
determined  from  (5)  and  (23),  to  be 

ac  +  QtBc  Dac  +  (n’  +  Q2BC  E)  ac  =  fc  (25) 

The  residual  modes  are  subject  to  control  spillover  and  are  determined  from 

ar  +  ar  =  fr  -  Q,Br  D  ac  -  QjBr  E  ac  (26) 

It  is  evident  that  the  terms  corresponding  to  Qlf  Q2  in  the  performance  index  J 
respectively  modify  the  damping  and  the  stiffness  of  the  system.  The  gain  matrices 
E,  D  are  non-symmetric  non— diagonal  infinite  dimensional  matrices  that  couple  all 

the  modal  amplitudes.  The  governing  equations  for  the  controlled  system  are  non 
self-adjoint  raising  the  question  that  in  the  event  of  instability  the  transition  may  be 
through  flutter. 

The  elements  of  gain  matrices  consist  of  modal  harmonics  coswj(t-t0)  and 

sinwj(t-t0)  which  initially,  i.e.,  near  t  =  t„.  are  non-negative.  As  t  increases,  the 

higher  harmonics  begin  to  change  sign  thus  creating  negative  damping  and  reducing 
the  stiffness  of  system.  It  thus  becomes  clear  that  the  optimal  algorithm  derived  here 
is  valid  for  an  interval  of  time  during  which  occurrence  of  negative  damping  or 
reduction  of  stiffness  have  not  caused  amplification  of  the  response  of  those  higher 
modes  that  participate  significantly  in  the  dynamic  of  the  system.  Therefore  the 
control  interval,  i.e.,  t  -  t0  should  not  exceed  one-half  of  the  period  of  the  highest 

controlled  mode,  i.e., 

(» -Oir  <27) 
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Control  of  A  Simply  Supported  Beam.  Consider  a  uniform  beam  hinged  at  both 
ends  and  controlled  by  four  actuators,  Fig.  1.  The  corresponding  location  of  the 
actuators,  xt,  i  =  1,  2,  3,  4,  are  13,  25,  35,  and  47m  from  the  left  end  of  the  beam 

respectively.  We  assume  that  the  length  of  the  beam  is  60m,  its  bending  stiffness  El 

is  5  x  10s  N-m  and  its  mass  per  unit  length  m  is  1.0  Kg(m).  Then  the  eigenvalues 
2 

ui  and  normalized  eigenfunctions  are  given  by 

w*  =  0.039(iir)4,  0j(x)  =  0.183  sin(ix/60),  i  =  1,  2.  .  . 

The  control  parameters  are  assigned  fixed  values  Qt  =  Q2  =  5000  and  RA  =  1, 

i  =  1,  2,  3,  4.  The  lowest  ten  modes  of  the  beam  constitute  the  controlled  and  the 
uncontrolled  modes. 

In  the  first  example,  the  beam  is  subjected  to  a  unit  cyclic  load  resonant  with 
the  first  mode  of  the  beam  acting  at  the  midspan.  The  external  excitation  in  Eq.  (1) 
is  expressed  as  f(x,t)  =  sin  (1.95t)  •  6(x  -  30).  The  control  intervals  are  0.01,  0.08 
and  0.80  seconds  and  only  the  first  mode  is  subjected  to  control.  Control  of 
additional  modes  is  not  called  for  as  they  are  not  being  excited.  The  results  that  are 
presented  refer  to  the  deflection  at  midspan  of  the  beam  as  shown  in  Fig.  2.  It  can  be 
seen  that,  unlike  the  uncontrolled  beam  the  response  remains  bounded.  From  Table  1 
the  maximum  control  gain  for  the  first  mode  is  approximately  0.81  seconds.  Thus  the 
beam  is  almost  completely  controlled,  without  any  oscillations,  by  setting  the  control 
interval,  i.e.,  t—t0,  equal  to  0.80  seconds. 

In  the  second  example  the  beam  is  subjected  to  a  20N  load  that  travels  along 
the  beam  with  a  speed  of  37  m/sec.  Now  the  lowest  5  modes  are  controlled  and  the 
next  5  modes  are  uncontrolled  (Nc  =  Nr  =  5).  The  results  corresponding  to  control 

intervals  of  0.01,  0.03  and  0.06  seconds  are  shown  in  Fig.  3.  It  is  clear  that  the 
controllers  suppress  the  vibration  within  10  seconds. 


Control  of  Simply  Supported  Beam-Columns.  Beam  columns  are  members  that 
are  subjected  to  both  bending  and  axial  compression.  They  will  be  unstable  when  the 
axial  compression  reaches  certain  critical  values.  For  a  given  beam-column  with 
certain  specified  boundary  conditions,  theoretically  an  infinite  number  of  critical 
loads  and  associated  buckling  mode-shapes  exist.  In  this  section,  we  attempt  to 
follow  the  same  procedure  derived  previously  to  control  the  beam-columns  at  and 
above  their  buckling  load. 

Let  P  denote  the  axial  compressive  load  applied  to  a  beam  in  Fig.  1,  the 
eigenvalue  problem  is  given  by 

d3<ph  d3<t>,  3 

(EI  x)— i  +  P-— i  =  mirfjtj,  j  =  1,  2,  3.  .  .  .  (28) 

dx3  dx3  dx3  J  J 
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with  the  boundary  conditions 


d'fa 

fa  = - -  =  0  for  x  =  0,  l  (29) 

j  acJ 

that  hold  for  the  simply  supported  case. 

The  normalized  eigenfunctions  for  Eqs.  (28)  —  (29)  for  a  prismatic  beam  can  be 
written  in  the  form 


<t>j  =  /( 2/mi)  sin^,  j  =  1,2,.  .  . 

As  a  result  of  substitution  of  Eq.  (30)  into  Eq.  (28),  we  have 
» 

j/*7j  if  p  <  j,  j  =  1,  2, . 

ij/O'j  if  P  >  j,  j  =  1.  2, . 


where 


(30) 


(31) 


(32) 


and  P_  denotes  the  Eulet  load  of  the  beam-column,  i.e.,  P_  =  ir^EI/f2. 

£  L 


When  p  is  smaller  than  1,  i.e.,  the  axial  load  P  is  smaller  than  PE,  the  beam  is 
stable  and  the  control  law  expressed  in  Eq.  (29)  is  still  valid  with  the  Wj,  j  =  1,  2, 
3.  .  .  .  given  by  Eq.  (31).  When  the  beam  is  in  critical  condition,  i.e.  pal,  then 

sin^jt) 

is  zero.  However,  because  lim - =  t,  and  lim  cos(c1;1t)  =  1,  one  obtains 

p-«  1  ul  p-»  1 

from  Eq.  (11) 


Sc(t)  =  diagjt,  u/jl  sin(wjt)| ,  j  =  2,  3  .  .  .  Nc 
Cc(t)  =  diagjl,  cos(Wjt)J ,  j  =  2,  3,.  .  .  Nc 


(33) 
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Following  the  same  procedure  described  in  previous  sections,  one  can  obtain  the  same 
control  law  that  is  already  shown  in  Eqs.  (23)  -  (24),  in  which  S  and  C  are  now 

defined  by  Eq.  (33). 

Similarly,  when  1  <  p  <  2,  then  the  control  force  and  the  system  behavior  can 
be  given  by  Eqs.  (23)  -  (26)  wherein  S,  C  can  be  written  in  the  form 

Sc(t)  =  diag[|«l|  1  sinh |  wj t,  Wjl  sin  (wjt)j ,  j  =  2,  3, .  .  .  Nc 

(34) 

Cc(t)  =  diag  [cosh  |  wt|  t,  cos(wjt)J ,  j  =  2,  3, . Nc 

For  numerical  evaluation  the  beam  with  the  same  parameters  as  shown  in  Fig.  1 
is  used.  The  system  is  subjected  to  an  impulse  at  midspan.  The  control  intervals  are 
set  to  0.01  and  0.05  seconds  to  compare  results.  The  behavior  of  the  beam-column  is 
assumed  to  be  adequately  represented  by  its  first  ten  modes  and  of  these  the  lowest 
five  are  controlled.  Fig.  4  shows  the  response  of  the  system  over  a  10  seconds 
interval.  The  axial  compressive  load  ratio  p  =  P/P£  is  set  to  0.9.  The  deflection  of 

the  beam  remains  bounded  and  is  accompanied  by  control  force  spilling  over  into  the 
higher  modes.  The  time  histories  of  the  commanded  forces  for  the  actuator  A3  -  one 

of  the  middle  two  actuators  -  are  shown  in  Fig.  5.  The  larger  control  interval  causes 
higher  beginning  values  for  the  force  and  lower  tail  end  values.  Thus  it  may  be 
concluded  that  the  control  is  effective  at  the  axial  load  of  0.9  of  buckling  load. 

Next,  we  consider  the  control  of  beam-column  at  the  buckling  case,  i.e.,  p  =  1. 
In  this  case,  the  E  defined  in  Eq.  (24)  can  be  partitioned  as 


E  = 


rE„  e„i 

?2t  ?32 


[u- 


B 


2  l 


c2) 


(1-C2) 


B 


2  2 


R, 


(1-C2) 


B 


2n 


Rn 


where 


B 


(i-cj 


NC1 


Ri 

B 


Nc2 


R, 


B 


Ncn 


Rn 


C,‘  cos(^(t  - 10)),  i  =  1 - Nc 

Also  the  Bc,  D  matrices  and  the  ac  vector  can  be  partitioned  as 

fa, 


Bc  = 


Bn  ?  12 
B21  B22j 


,  D  = 


^U?13 

iPai  ?22J 


t 


(35) 


(36) 
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Then  the  first  modal  equation,  that  corresponds  to  the  buckling  mode  of  the 
beam— column  in  Eq.  (25),  becomes 

ai  +  Qi  Du  ai  =  fi —  Qi  ?12  D12  a2  —  Q2  (Bu  El2  +  B12  E22)  a2  (37) 

with  appropriate  initial  conditions  at  t0.  From  Eq.  (37),  it  is  clear  that  the  feedback 

control  law  can  only  provide  a  certain  amount  of  damping  to  the  system  and  the 
coupled  modal  control  force  acts  as  a  counter  force  to  balance  ft.  Because  at  is 

coupled  with  a2,  it  is  difficult  to  find  the  analytical  solution  for  av  Therefore  the 

same  numerical  procedure  employed  previously  is  used  to  find  approximate  results. 
Fig.  6  shows  the  results  of  the  control  of  the  beam— column  at  its  buckling  load  when 
the  control  intervals  are  set  to  0.01  and  0.05  seconds.  The  control  spillover  is  present 
and  the  system  acquires  a  permanent  deflection  that  depends  on  the  control  interval 
in  both  cases.  Different  control  efforts  are  pumped  into  the  system  for  the  different 
control  intervals,  and  this  is  responsible  for  the  different  levels  of  permanent 
deformations  shown  in  Fig.  6.  As  observed  form  Fig.  7  the  commanded  force  for 
smaller  control  intervals  have  higher  initial  and  lower  final  values. 

Fig.  8  and  Fig.  9  show  the  control  of  the  beam-column  when  p  =  1.1.  In  Fig.  8, 
the  system  appears  unstable  with  deflection  growing  continually  for  both  control 
intervals  of  0.01  and  0.05  seconds.  The  deflection  of  the  control  interval  of  0.05 
seconds  have  a  smaller  rate  of  growth  than  that  of  the  control  interval  of  0.01 
seconds. 

Fig.  9  shows  the  A2  the  actuator’s  commanded  force  histories  for  both  cases. 

The  importance  of  the  beginning  stages  of  control  effort,  being  a  function  of  the 
control  interval,  shows  up  prominently  in  these  cases.  In  the  case  of  the  short 
control,  i.e.,  t-t0  =  0.01  seconds,  the  commanded  force  of  the  actuator  A2  increases 

with  the  unbounded  response  of  the  controlled  system.  However,  in  the  long  control 
interval  case,  i.e.,  t— 10  =  0.05  seconds,  the  commanded  force  grows  with  a  more 

moderate  rate  compared  to  that  of  the  short  interval.  Therefore,  it  is  clear  that  the 
longer  control  intervals  are  better  in  that  they  provide  slower  rate  of  growth  of 
deflections. 

Conclusion.  The  control  algorithm  developed  here  determines  the  control  forces 
in  the  time  domain  explicitly  in  terms  of  the  coupled  modal  amplitudes  and 
velocities.  The  algorithm  is  optimal  in  that  it  minimizes  the  total  energy  of  the 
system  instantly  and  spatially  rather  than  over  a  time  domain.  Consequently  the 
algorithm  is  causal  and  can  be  implemented  on  the  basis  of  what  has  transpired  prior 
to  implementation.  Additionally  the  algorithm  allows  participation  by  a  limited 
number  of  modes  with  the  remainder  being  subject  to  spillover  effect.  The 
instantaneous  nature  of  the  algorithm  limits  its  applicability  to  the  small 
neighborhood  of  the  initial  point  which  however  is  arbitrary.  This  arbitrariness 
allows  integration  over  a  sequence  of  intervals  whose  initial  data  is  determined  by  the 
end  conditions  of  the  previous  interval. 


The  coupled-modal  control  algorithm  can  successfully  be  used  to  suppress  the 
vibration  of  the  beam  subjected  to  a  resonant  cyclic  load  and  a  moving  loaa  as  shown 
in  this  paper.  The  algorithm  is  causal,  i.e.  it  only  depends  on  the  past  history  of  the 
system  and  such  that  the  control  interval  dominates  the  system’s  behavior.  The 
maximum  control  interval  as  given  by  Eq.  (27)  is  a  function  of  the  period  of  the 
highest  controlled  mode.  The  response  of  controlled  system  is  significantly  affected 
by  the  control  interval.  Therefore,  a  proper  choice  of  the  control  interval  is  necessary 
for  the  algorithm.  The  algorithm  can  be  applied  to  control  of  a  beam-column  before 
buckling.  However,  the  controller  can  only  provide  a  damping  resistance  to  the 
buckled  mode,  and  the  algorithm  is  not  well  suited  for  controlling  a  beam  after 
buckling. 
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TABLE.1 


Mode  Max.  (t — 10),  sec.  Mode 

1  1.621  6 

2  0.405  7 

3  0.180  8 

4  0.101  9 

5  0.065  10 


Max.  (t-t0),  sec. 
0.045 
0.033 
0.025 
0.020 
0.016 


z 


El,  m 


'///////, 


f 

A, 


t 

Aj 


r 

A) 


T 

A, 


75T 


System  Parameters: 
L  :  60. m 
El :  5  *  10s  N/m2 
m  :  1.  kg/m 
x,  =  13.  m 
X]  =  25.  m 


Control  Parameters: 
Q! :  5000. 

Q2 :  5000. 

Rt :  I-,  i=l,2,3,4 
Xj  =  35.  m 
x<  =  47.  m 


Fig.  1  Simply  supported  beam  with  4  discrete  actuators,  Ai,i=  1,..4. 
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at  midspan  (cm) 


Fig.  2  The  midspan  deflection  of  the  beam  subjected  to  an  urut  periodic  load 

resonant  with  the  first  fundamental  frequency  of  the  beam,  t-t0  =  0  01, 

0.08,  0.80  seconds. 


ISO 


Deflection  at  midspan  (cm) 

-20.0  0.0  20.0  40.0 


0.0  2.0  4.0  8.0  8.0  10.0 


Time  (sec) 

Fig.  3 


The  midspan  deflection  of  the  beam  subjected  to  a  moving  load  traveling 
at  37  m/second  speed,  t— 10  =  0.01,  0.03  and  0.06  seconds. 


at  midspan  (cm) 


o 


Fig.  4  The  midspan  deflection  of  the  controlled  beam-column  subjected  to  an 

unit  impulse  at  midspan,  p  »  0.9,  t-t0  =  0.01,  0.05  seconds. 
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Control  force  (kN) 
-a.o  -e.o  -4.0  -2.0 


Deflection  at  midspan  (cm) 


Fig.  6 


The  midspan  deflection  of  the  controlled  beam-column  subjected  to  an 
unit  impulse  at  midspan,  p  =  1.0,  t— 10  =  0.01,  0.05  seconds. 


Control  force  (kN) 


Time  (sec) 


The  midspan  deflection  of  the  controlled  beam-column  subjected 
unit  impulse  at  midspan,  p  =  1-1,  t-t0  *  0.01,  0.05  seconds. 


Control  force  (kN) 


o.o  2.0  4.0  $.0  8.0  10.0 


Time  (sec) 

Fig.  9  Commanded  force  of  A,  actuator  for  the  control  of  the  beam-column, 

p  =  1.1,  t-t0  =  0.01,  0.05  seconds. 
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A  CENTRAL  LIMIT  THEOREM  FOR  INTEGRAL  FUNCTIONALS  OF  A 
STATIONARY  GAUSSIAN  PROCESS* 

Simeon  M.  Berman 

Courant  Institute  of  Mathematical  Sciences 
New  York  University- 
New  York,  NY  10012 

ABSTRACT.  Let  X (t),  t  >  0,  be  a  real  stationary  Gaussian  process  with  covariance 
function  r(f).  Let  f(x)  be  a  function  in  where  <t>(z)  is  the  standard  normal  density, 

and  assume  that  f  x  f(x)  <t>{x)  dx  ^  0.  It  is  shown  that  the  central  limit  theorem  holds  for 
the  functional  f‘  f(X(s))  ds ,  for  t  — ►  oo,  under  the  sole  assumptions  r(/)  >0  and  r(t)  — >  0 
for  t  — >  oo. 

1.  Summary. 

Let  X{t),  t  >  0,  be  a  real  stationary  Gaussian  process  with  mean  0  and  covariance 
function  r(t)  =  EX(0)  X(t).  For  a  Borel  function  f{x)  and  t  >  0.  consider  the  functional 

(1-1)  /  f(X(s))ds. 

Jo 

There  has  been  a  sustained  interest  in  proving  the  Central  Limit  Theorem,  for  t  — *  dc. 
for  such  functionals,  that  is,  determining  the  limiting  distribution  of  the  normed  random 
variable 

(1,  j;/(.YW)<fe-E[J07(.YM)rf.,] 

)V*rJ07(.YW)<fa}'/2 

*  Supported  by  the  U.  S.  Army  Research  Office. 
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Let  <f>(x)  represent  the  standard  normal  density,  and  let  Hk(x),  k  =  0, 1, . . . ,  be  the 
family  of  Hermite  polynomials.  The  assumption  commonly  used  for  the  function  /  is  that 
it  belong  to  £2(^)5  that  is,  /  |/(r)|2  4>{x)  dx  <  oo.  Every  such  function  has  an  expansion 

oo 

(1.3)  /(*)  =  £>#*(*)/ v't!  , 

*=o 

where 

(1.4)  fk  =  ~^=J  f(x)  Hk{x)4>{x)dx  . 

The  Hermite  rank  of  /  is  the  smallest  positive  integer  k  for  which  /*  ^  0. 

Limit  theorems  for  (1.2)  are  of  two  categories.  The  first  is  characterized  by  conver¬ 
gence  of  the  distribution  to  a  normal  distribution  under  a  hypothesis  of  mixing,  involving 
sufficiently  rapid  convergence  of  r(t)  to  0  for  t  — ►  oo.  The  best  results  in  this  area  up  to 
now  are  those  of  Breuer  and  Major  (1983)  for  a  discrete  time  process  and  Chambers  and 
Slud  (1989)  for  a  continuous  time  process.  The  general  result  is  that  if  the  function  /  is 
of  Hermite  rank  fc,  and  r  £  Lk(—o o,oo),  then  (1.2)  has  a  limiting  normal  distribution. 

The  second  category  of  results  is  a  collection  of  “non-Central  Limit  Theorems’’,  where 
(1.2)  has  a  limiting  distribution  which,  except  for  the  case  of  a  function  of  Hermite  rank 
1,  is  not  a  normal  distribution.  Such  theorems  state  in  their  hypotheses  that  the  process 
is  assumed  to  have  “long  range  dependence”.  More  precisely,  it  is  assumed  that  i)  The 
function  /  is  of  Hermite  rank  k ,  for  some  k  >  1,  and  ii)  For  some  a  <  l/k .  r(t)  is  regularly 
varying  of  index  —  o  for  t  —*  oo  (Dobrusin  and  Major  (1979),  and  Taqqu  (1979)). 

Our  main  result  represents  an  extension  of  previous  results  for  the  case  of  a  function 
of  Hermite  rank  1,  in  which  the  limiting  distribution  is  normal  under  either  the  mixing 
condition  r  £  L\  or  the  long  range  dependence  condition  of  the  regular  variation  of  r  for 
index  —a  >  —1.  Our  only  assumptions  on  r(t)  are  that  it  be  nonnegative,  and  converge 
to  0  for  t  — >  oo.  The  limiting  distribution  of  (1.2)  is  always  normal;  however,  the  variance 
of  the  limiting  distribution  has  two  forms,  depending  on  whether  r  £  L\  or  r  £  L\. 
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The  result  is: 

THEOREM  1.1.  If  r(t)  >  0  for  t  >  0,  and  r(t)  —+  0  for  t  — ►  oc,  then,  for  any  function 
f  G  L2(<f>)  such  that 


(1.5) 


7  =  f  y<t>{y)fi.y)dy  ±  o  , 


the  random,  variable 


6  Jo  f{X{s))  ds  - 1  JZo  f(y)  <t>(y)  dy 

(272/o(<  ~  s)r(s)cis)1/2 

has  a  limiting  normal  distribution  with  mean  0.  The  variance  of  the  limiting  distribution 
is  equal  to  \  if  r  (fc  L\  and  is  equal  to 


(1.7) 


E  T^fUrr'Wda 

72  /o°°  r(s) 


if  r  €  Lx. 

The  main  idea  of  the  proof  is  that  J0°°  r(t)dt  is  either  finite  or  +oc.  In  the  former 
case,  the  result  follows  from  that  of  Chambers  and  Slud  (1989).  In  the  latter  case  the 
process  has  a  form  of  long  range  dependence,  and  the  contribution  of  the  theorem  is  that 
the  limiting  distribution  exists  without  a  precisely  assumed  form  of  r(t)  for  t  — ►  oc. 


2.  Proof  of  the  Theorem. 

Let  represent  the  standard  bivariate  normal  density  with  correlation  r.  A 

direct  calculation  shows  that  the  variance  of  Jj  f(X(s))  ds  is  equal  to 

p  t  r  OG  POO 

2/  (t  ~  s)  /  f(^)  f{y)[d>(x,y;r{s))  -  0{x)  0(y)]  dx  dy  ds  . 

J 0  J  —OG  J  —  OO 

By  the  diagonal  expansion  of  <t>(x ,  y;  r(s))  in  Hermite  polynomials  (Cramer  1946  page  290) 
the  integral  above  is  equal  to 


(2.1) 


2 


/  (*  “  s)^fkrk(s)ds  ' 

Jo  *=i 
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with  fk  as  in  (1.4). 

Suppose  r  €  Li;  then,  for  every  k  >  1, 

t~x  f  (t  —  s)  rk (s)  ds —>  f  rk(s)ds  , 

Jo  Jo 

for  t  — »  oo,  and  so  the  expression  (2.1),  divided  by  t,  converges  to 

oo  POO 

2  Tfl  rk(3)ds. 

iS 

Since,  by  definition,  H i(x)  =  x,  we  have  f\  =  y  ^  0  under  (1.5).  Therefore,  the  assertion 
of  the  theorem  in  the  case  r  £  L\  is  the  special  case  of  the  result  of  Chambers  and  Slud 
(1989)  for  a  function  of  Hermite  rank  1. 

Next  consider  the  case  r  £  L\.  Since  7  =  fi,  (2.1)  is  equal  to  the  sum  of 


(2.2) 

and 

(2.3) 


i 


~T  /  (t  —  5)  r(s)  ds 


Jrt  OO 

'  (*  fkrk{s)]  ds  . 

0  k=2 


Our  first  aim  is  to  show  that  the  expression  (2.3)  is  of  order  smaller  than  (2.2)  for  t  — +  oo. 
and  this  will  establish  that  the  denominator  in  (1.6)  is  asymptotically  equal  to  the  standard 
deviation  of  the  numerator: 


(2.4) 


V  ar 


X(s))  ds  ~  27' 


/  (t  -  s)r(s 
Jo 


)  ds  . 


For  arbitrary  fixed  T  >  0.  take  t  >  T  in  (2.3),  and  write  the  latter  as  the  sum  of  two 
terms 

rT 


(2.5) 
and 

(2.6) 


r  1  ^ 

2/  (t-s)[Y,fkrk(s)]ds 

v'°  fc  =  2 

2  f\t-s)[jrtfirk(s)]  ds  . 
•'T  k= 2 
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The  term  (2.5)  is  at  most  equal  to 

oo 

2 tT  f\  =  constant  x  t  . 

k= 2 

This  is  of  order  smaller  than  that  of  the  right-hand  member  of  (2.4)  because,  by  L’Hospital's 
Rule  and  the  assumption  r  £  L\, 

t  1 

lim  — : - =  lim  - =  0  . 

fo  (*  “  s)  r(s) ds  t-*°°  Jo  r(5)  ds 

The  term  (2.6)  is  at  most  equal  to 


i-t  oo 

2  1  (t  —  s)  r(s )  ds  x  V'  fl  x  sup  r(s)  . 
Jo  *>T 


The  ratio  of  the  latter  expression  to  the  right  hand  member  of  (2.4)  is  at  most 

OO 

7~2  x  sup  r(s)  . 


Since  T  is  arbitrary,  and  r(t)  —*  0  for  t  —*  oo,  the  expression  above  can  be  made  arbitrarily 
small  by  choosing  T  sufficiently  large.  This  concludes  the  argument  that  (2.3)  is  of  smaller 
order  than  (2.2),  and  this  confirms  (2.4). 

By  expansion  of  /  in  Hermite  polynomials,  we  find  that 


f  f(X{s))ds-  [  f{x)4>(x)dx 

J  0  J  —  oo 


is  representable  as 


oo  ,t  , 

!>i  7*mx(s))ds- 


It  is  known  that  the  random  variables 


f  — 

Jo  \/k\ 


Hk(X(s))ds,  k  =  1,2,... 


are  uncorrelated  and  have  means  0.  The  first  term  in  (2.S)  is 


fX 


7  /  X (s)ds  , 
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which  has  a  normal  distribution  with  mean  0  and  variance  (2.2).  According  to  the  method 
of  Berman  (1979),  the  fact  that  the  variance  of  the  sum  (2.8)  is  asymptotically  equal  to 
the  variance  of  the  first  term  implies,  in  this  case,  that  the  limiting  distribution  of  the  sum 
(2.8)  is  equal  to  the  limiting  distribution  of  the  first  term  in  (2.9).  □ 
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The  Voronoi  Diagram  for  The  Euclidean  Traveling  Salesman  Problem 
is  Piecemeal  Quartic  and  Hyperbolic 
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Abstract.  It  is  shown  that  the  Voronoi  diagram  for  the  Euclidean  traveling  salesman  problem  is 
piecemeal  quartic  and  hyperbolic.  Previous  attempts  to  leverage  the  traditional  (linear)  Voronoi 
diagram  upon  the  problem  have  failed;  in  particular,  counterexamples  have  demonstrated  that  the 
optimal  tour  need  not  traverse  the  Voronoi  dual.  In  this  paper,  the  shortest  tour  is  treated  as  the  union 
of  a  set  of  perturbations  of  the  convex  hull,  with  interior  cities  added  incrementally,  one  at  a  time.  A 
perturbation  is  defined  to  be  the  union  of  a  new  city  with  a  subpath  which  connects  two  adjacent  hull 
vertices  to  a  set  (possibly  null)  of  previously  entered  interior  cities.  The  length  of  a  perturbation  is 
therefore  equal  to  the  sum  of  two  variable  distances,  minus  the  sum  of  a  set  of  fixed  distances.  This 
length  is  called  the  elliptic  length  of  the  perturbation.  Beginning  with  the  convex  hull,  a  single  city  is 
randomly  added  to  the  interior,  and  the  hull  is  perturbed  to  capture  the  new  city  in  optimal  fashion.  For 
a  perturbation  of  a  specific  elliptic  length,  this  quantity  determines  an  ellipse  symmetric  about  a  hull 
segment,  with  foci  at  the  segment  endpoints  Any  other  hull  perturbation  of  the  same  length  defines 
another  ellipse  symmetric  about  some  other  hull  segment.  As  the  perturbation  length  is  allowed  to  vary 
continuously  from  zero  to  infinity,  a  set  of  confocal  ellipses  is  produced  about  each  hull  segment,  and 
the  intersection  across  all  other  such  sets  produces  a  set  of  quartics.  For  the  special  case  in  which  hull 
segments  share  an  endpoint  (focus),  the  locus  is  a  hyperbola  Each  hull  segment  is  bounded  by  those 
quartics  for  which  the  segment  is  the  source  of  minimal  perturbation  length  The  region  of  the  plane 
thus  bounded  is  called  the  quartic  Voronoi  cell  for  that  segment.  There  is  a  quartic  cell  corresponding  to 
each  hull  segment,  and  the  union  of  all  such  cells  forms  the  Voronoi  diagram  of  the  hull  Now,  if  a 
random  city  is  injected  into  the  hull,  and  the  city  is  observed  to  lie  in  a  specific  Voronoi  cell,  we  know 
that  to  produce  the  minimal  tour,  the  city  must  be  connected  to  the  endpoints  of  the  hull  segment 
corresponding  to  the  cell,  and  in  turn  the  endpoints  of  the  hull  segment  must  be  disconnected  If  one 
maintains  the  proper  canonical  forms  to  alter  the  topology  of  the  perturbation  space  when  a  new  city  is 
added,  the  technique  may  be  extended  to  accommodate  multiple  interior  cities.  The  quartic  Voronoi 
diagram  is  shown  to  differ  from  the  traditional  Voronoi  diagram  in  three  distinct  ways:  it  depicts 
shortest  tour  connectivity  rather  than  point-to-point  proximity;  its  cell  boundaries  are  quartic  and 
hyperbolic  rather  than  linear;  and  the  diagram  is  bounded  by  the  convex  hull  rather  than  being 
unbounded  (although  this  last  constraint  may  be  relaxed  to  add  new  cities  outside  the  hull).  A  naively 
derived  ceiling  function  demonstrates  that  an  unsupervised  perturbation  approach  is  of  exponential 
complexity,  with  a  scaling  factor  as  a  function  of  the  size  of  the  hull  By  resorting  to  an  algorithm  which 
exploits  the  canonical  forms,  it  is  shown  that  this  bound  may  be  diminished  to  0(n3]  The  algorithm  is 
demonstrated  for  a  database  consisting  of  the  forty-eight  capitals  of  the  contiguous  United  States. 
Open  research  issues  include  whether  the  technique  may  be  extended  to  accommodate  a  hull  which 
encloses  an  arbitrary  number  of  cities,  and  whether  the  ceiling  function  may  be  further  reduced 

Statement  of  the  Problem.  The  Euclidean  traveling  salesman  problem  (ETSP)  is  a  special  case  of  the 
general  traveling  salesman  problem  (TSP).  Given  a  set  of  cities  and  the  associated  costs  between  pairs  of 
cities,  the  goal  of  the  TSP  is  to  find  the  optimal  tour  which  visits  every  city  exactly  once,  except  for  the 
start  city,  which  is  revisited  at  tour's  end.  Unlike  the  TSP  which  utilizes  a  general  cost  function  to  link 
cities,  the  ETSP  employs  the  Euclidean  distance  between  cities  as  the  metric,  and  equates  optimality  with 
shortest  tour  length.  The  objective  of  this  research  is  to  attempt  to  rigorously  characterize  the 
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underlying  geometry  of  ETSP  tour  construction,  and  subsequently  to  pursue  an  algorithm  for  an  exact 
solution  to  the  problem  for  city  databases  of  modest  size. 

Background.  The  traveling  salesman  problem  has  been  an  outstanding  research  issue  for  over  a  century, 
and  has  been  approached  computationally  since  the  end  of  the  second  world  war  [LI].  It  is  important  to 
differentiate  between  the  TSP  and  TSP  decision;  the  former  requests  a  list  of  cities  ordered  as  they 
appear  in  the  optimal  tour,  whereas  the  latter  seeks  a  yes  or  no  answer  to  the  question  "is  there  a  tour 
of  cost  k  or  less”?  In  1972,  it  was  proven  that  TSP  decision  is  NP-complete  [K2];  and  in  1976  it  was  shown 
that  ETSP  decision  with  discretized  distance  is  also  NP-complete  [P4,  61].  The  ETSP  with  non-discretized 
distance  is  NP-hard  in  the  strong  sense  [61].  The  failure  of  the  ETSP  to  yield  to  known  problem-solving 
strategies  has  caused  the  vast  majority  of  researchers  to  abandon  the  search  for  an  exact  algorithm,  and 
instead  strive  for  fast  approximation  techniques.  Many  heuristic  algorithms  have  been  developed  to 
date;  they  include:  k-opt  edge  exchange  [L3,  J4],  branch-and-bourd  [  L4],  simulated  annealing  [Ml,  J4], 
neural  networks  [FI,  J4],  genetic  algorithms  [84];  and  elastic  bands  [D4]  A  preeminent  researcher  in  the 
field  is  of  the  opinion  that  for  consistently  high  quality  solutions  on  databases  of  very  large  scale,  the 
Lin-Kernighan  edge  exchange  algorithm  has  few  competitors  [J5],  Chapters  5-7  of  reference  [LI] 
provide  valuable  suggestions  for  evaluating  the  performance  of  some  of  the  heuristic  methods 

An  Historical  Perspective  of  the  Euclidean  Traveling  Salesman  Problem  and  the  Voronoi  Diagram. 

Operations  research  has  been  the  historical  forum  for  ETSP.  There  have  been  very  few  efforts 
possessing  a  computational  geometry  flavor  In  the  seventies,  the  issue  was  raised  about  whether  a  ETSP 
optimal  tour  must  necessarily  traverse  adjacent  cells  of  the  Voronoi  diagram  [52],  This  conjecture  has 
subsequently  been  answered  in  the  negative  A  counterexample  for  a  degenerate  case  was  discovered 
in  1983  [K 1  ],  and  one  for  the  general  case  was  skillfully  crafted  five  years  later  [D31;  the  latter 
counterexample  is  portrayed  at  Figure  1 .  It  will  be  shown  in  this  paper  that  the  fundamental  reason  for 
the  difficulty  in  applying  the  traditional  Voronoi  diagram  to  ETSP  is  that  the  search  space  imposed  by  a 
perturbation  of  Euclidean  distances  is  non-linear  (in  particular,  it  is  quartic),  whereas  the  traditional 
Voronoi  cell  possesses  linear  boundaries.  It  will  also  be  shown  that  with  modifications,  the  traditional 
Voronoi  diagram  may  be  extended  to  portray  the  optimal  tour  for  an  n-city  problem,  given  the  optimal 
tour  for  n-1  cities. 


Figure  1.  The  traditional  Voronoi  diagram,  computed  for  the  Dillencourt  dataset.  In  the  optimal  tour,  t4  and 
t3  are  connected,  but  their  respective  Voronoi  cells  are  not.  which  counterindicates  Shamos'  conjecture  that 
an  optimal  tour  must  traverse  the  Voronoi  dual. 
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The  Euclidean  Shortest  Tour  as  a  Perturbation  of  the  Convex  Hull. 

In  1957,  Barachet  proved  that  there  exists  an  optimal  tour  which  preserves  the  relative  order  of 
the  points  on  the  convex  hull  [B1],  This  result  implies  that  the  shortest  tour  may  be  expressed  as  a  hull 
deformation  produced  by  an  excursion  into  the  interior,  to  capture  points  which  do  not  lie  on  the  hull 
(Figure  2).  In  1977,  a  heuristic  was  developed  to  utilize  the  hull  as  an  initial  starting  tour,  and  to  attach 
interior  cities  based  on  a  two  step  procedure  [S4],  First,  the  sum  of  the  distances  from  an  interior  city  to 
the  endpoints  of  an  existing  segment  are  computed,  and  the  length  of  the  segment  subtracted;  across 
all  existing  segments,  the  minimal  such  expression  associates  the  city  with  a  particular  segment.  The 
next  step  involves  selecting  a  city  to  be  inserted  based  on  the  maximal  angle  formed  with  its  associated 
segment.  The  procedure  iterates  until  a  Hamiltonian  cycle  is  formed  utilizing  all  interior  cities  Finally, 
an  arbitrating  function  decides  if  the  resultant  cycle  is  sufficiently  accurate.  Analysis  of  the  method  has 
indicated  that  it  is  superior  to  some  methods  which  do  not  utilize  the  hull  [G3]  Nevertheless,  due  to  the 
fact  that  the  approach  is  only  approximate,  the  tour  produced  is  generally  suboptimal,  and  it  is  not  well 
understood  why  the  heuristic  performs  as  it  does. 


Figure  2.  What  is  the  shortest  tour  connecting  cities  d1-d7?We  know  that  the  tour  must  preserve  the  order 
of  the  cities  lying  on  the  the  convex  hull,  so  a  natural  way  to  proceed  is  to  perturb  the  hull.  The  problem  thus 
reduces  to  finding  the  optimal  (shortest)  way  to  attach  city  d7  to  a  pair  of  adjacent  hull  vertices. 


Terminology  and  Notation. 

In  the  following  discussion,  we  shall  call  any  excursion  into  the  interior  from  two  adjacent  hull 
vertices  a  perturbation  of  the  hull.  It  is  important  to  note  that  a  perturbation  of  the  hull  entails  the 
corresponding  loss  of  the  segment  which  connects  the  two  adjacent  hull  vertices.  If  a  hull  segment  is 
unperturbed  it  is  called  a  null  perturbation.  A  tour  is  defined  to  be  the  union  of  a  set  of  perturbations  in 
counterclockwise  order,  as  they  appear  about  the  hull.  A  convention  will  be  adopted  to  represent 
certain  perturbation  concepts;  a  perturbation  is  denoted  by  the  letter  "n  ",  a  tour  is  denoted  by  the 
letter  "t  the  length  of  perturbation  n ,  or  tour  i  ,  is  denoted  respectively  "len  n or  "len  c  ",  the 
Euclidean  distance  between  points  p,  q  is  denoted  "d(p,  q)";  the  set  of  cities  lying  on  the  convex  hull  is 
denoted  "H“;  the  convex  hull  itself  is  denoted  "i  H";  the  number  of  cities  in  set  S,  also  called  the  order 
of  S,  is  denoted  (  S  ].  In  concluding  this  introductory  section,  we  formalize  the  definition  of  a 
perturbation,  and  prove  three  minor  counting  theorems. 
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Definition.  Hull  Perturbation.  Given  convex  hull  z  h  ordered  with  counterclockwise 
orientation,  and  the  set  I  of  interior  cities.  A  perturbation  of  the  hull  n  k  is  an  ordered 
subpath  ri|<  =  h*  u  IjUhv  +  i;  I,  C  I.  [Note  that  l(  may  be  the  null  set,  in  which  case  n  * 
is  a  null  perturbation]. 

Theorem.  When  computing  the  Euclidean  shortest  tour,  the  number  of  perturbations  of 
the  convex  hull  cannot  exceed  the  rank  of  the  hull.  Proof:  By  definition,  a  hull 
perturbation  is  an  excursion  into  the  interior  of  the  hull  which  connects  two  adjacent 
hull  vertices  to  a  subset  of  the  interior.  Without  regard  to  order,  there  are  n  ways  to 
connect  n  adjacent  hull  vertices  (the  first  to  the  second;  the  second  to  the  third;  ....  the 
nth  to  the  first),  producing  a  set  of  n  hull  segments.  Each  of  these  segments  may  be  the 
source  of  a  perturbation 

Theorem.  If  the  size  of  the  set  of  interior  cities  [  I  ]  exceeds  the  size  of  the  hull  [  H  ],  then 
the  shortest  Euclidean  tour  must  contain  a  hull  segment  perturbation  of  order  at  least  [  I  ] 

-  [  H  ]  +  1.  Proof:  from  the  pigeonhole  principle,  since  there  are  more  interior  cities 
than  hull  segments,  some  hull  segment  must  be  assigned  at  least  [  I  ]  -  [  H  ]  +  1  interior 
cities. 

Theorem.  If  the  size  of  the  hull  [  H  ]  exceeds  the  size  of  the  set  of  inner  cities  [  I  ],  then 
there  must  exist  at  least  [  H  -  I  ]  hull  segments  which  remain  unperturbed  when 
constructing  the  shortest  Euclidean  tour.  Proof:  again,  from  the  pigeonhole  principle, 
since  there  are  fewer  interior  cities  than  hull  segments,  at  least  [  H  -  I  ]  hull  segments 
must  be  null  perturbations. 

An  Arbitrary  Hull  Enclosing  a  Single  Interior  City. 

Since  we  know  intuitively  that  the  shortest  Euclidean  tour  may  be  represented  as  a  perturbation 
of  the  hull,  let  us  proceed  with  the  simplest  case  by  introducing  a  single  interior  city  into  an  arbitrary 
hull.  It  is  natural  to  derive  under  what  conditions  a  perturbation  initiated  from  a  given  hull  segment  to 
the  new  city  results  in  the  shortest  tour,  versus  one  initiated  from  another  segment.  If  len  z  H  represents 
the  sum  of  the  lengths  of  the  segments  which  comprise  the  convex  hull,  and  p  is  an  arbitrary  point 
introduced  into  the  hull  interior,  then  to  produce  the  shortest  tour,  one  is  interested  in  minimizing  the 
expression  len  i  H  +  [  d(  p,  h,  )  +  d(p,  h,  +  i)  -  d(  h,,  h,  *  i  )]  V  h,  €  H.  The  boundaries  of  equal  hull 
perturbation  are  those  for  which  len  n,  =  len  nr  for  distinct  elements  h„  h,  €  H.  To  formally 
characterize  the  boundaries  of  equal  perturbation  requires  that  the  expressions  for  perturbations 
initiated  from  two  distinct  hull  segments  be  set  equal  to  each  other,  and  the  resulting  equation  solved. 

The  Elliptic  Distance  of  a  Point  to  Two  Other  Points. 

During  traveling  salesman  problem  solving,  the  operation  which  decrements  the  length  of  a 
segment  from  the  sum  of  the  the  distances  from  the  segment  endpoints  to  an  arbitrary  point  is 
sufficiently  fundamental  to  be  given  a  special  name,  which  we  will  call  the  elliptic  distance. 

Definition.  The  elliptic  distance  of  a  point  p  to  two  points  q,  r,  denoted  de(p,q,r),  is 
defined  to  be. 


de(p,q,r)  =  d(p,  q)  +  d(p,  r)  -  d(q,r) 


HI 
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Derivation  of  the  Quartic  Locus  for  the  Single  Interior  City  Problem. 

Theorem.  In  general,  the  Voronoi  diagram  of  the  convex  hull  of  the  set  of  cities  for  the 
Euclidean  traveling  salesman  problem  has  quartic  edges. 

Proof:  Let  p  be  an  arbitrary  point  on  the  interior  of  convex  hull  i  h,  and  let  n„  n,  be 
perturbations  of  two  distinct  hull  segments  such  that  len  n ,  =  len  n,. 

lenn,  =  [d(p,  h,)  +  d(p,  hl  +  i)  -  d(  h„  hj  +  i )]  for  some  h,  €  H,  and 

lenrij  =  [d(p.hj)  +  d(p,  hj  + 1)  -  d(  h1(  hj  +  \ )]  for  some  h,  €  H. 

Let  lenn,  =  lenrij  =  kIJf  which  represents  some  specific  elliptic  length  Thus, 

de[p,  h„hj  +  i]  =  de[p,  hj,  hJ  +  ,  ]  =  k,,  [2] 

Equation  [2]  describes  two  ellipses,  the  first  with  major  axis  aligned  with  the  hull 
segment  having  endpoints  hj,  hl  +  i,  and  the  second  aligned  with  the  hull  segment 
having  endpoints  h,,  hJ  +  v  The  endpoints  of  the  hull  segments  are  the  respective  foci  of 
the  ellipses.  The  distances  involving  point  p  are  variable,  while  the  distances  on  the  hull 
are  constant.  Let  us  represent  the  two  ellipses  as  follows: 

%2/a  2  +  y2/b2  =  1  [3] 

Ax2  +  Bxy  +  Cy2  +  Dx  +  Ey  +  F  =  0  [4] 

Equation  [3]  denotes  one  of  the  two  ellipses  of  interest  after  it  has  been  rotated 
and  translated  to  be  in  standard  form  about  the  origin.  Equation  [4]  represents  the 
second  ellipse  with  the  coefficients  A,  B,  C,  D,  and  E  determined  in  the  coordinate  system 
of  [3].  To  characterize  the  locus  of  equal  perturbation,  we  are  required  to 
simultaneously  solve  [3]  and  [4], 


Shortest  Tour  Criteria 
s  +  t-a  =  k  j 
u+  v-b  -  k, 


Two  Ellipses 
s  +  t  =  a  +  k  | 
ut ?  =  b  f  k  j 


Figure  3.  The  lotus  of  equal  perturbation  between  hull  segments  is  obtained  by  intersecting  two  ellipses. 
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From  [3],  we  obtain: 


y  =  +  /-  ( b/a )  V  a2  -  x2 


[5] 


Substituting  the  positive  root  for  y  in  [4]  yields 


Ax2  +  Bx(  b/a  )  V  a2  -  x2  +  C  (b  /  a ) 2  (a2  -  x  2)  +  Dx  +  E  V  a2  -  x2  F  =  0  [6] 

Factoring,  and  moving  the  radical  to  the  right  side  of  the  equation  produces 


[  A  -  ( b2/a2 )  C  ]  x2  +  Dx  +  b2C  +  F  =  -(b/a)Va2-x2  (Bx  +  E)  [7] 

Squaring  both  sides  of  (7)  to  clear  the  radical,  and  gathering  coefficients  with  respective 
powers  of  x  results  in  the  equation : 

[A2-(2b2AC/a2)  +  b4C2  /  a4  +  (b2/a2)B2l  x4 

♦  [2AD-(2b2CD/a2)  +  (2b28E/a2)l  x* 

+  [2Ab2C  +  2AF - ( 2b4C2 / a2 ) - (  2b2CF / a2 )  +  b2E2/a2-b2B2  +  D2  ]  x2 
+  [  2b2CD  +  2DF  -  2b2BE  ]  x 

+  b4C2  +  2b2CF  -  b2E2  +  F2  =0.  [8] 


QED.  Thus  the  locus  of  equal  perturbation  for  inserting  a  random  city  into  the  hull  is 
defined  by  a  quartic  equation,  with  coefficients  expressed  in  terms  of  the  parameters  for 
two  ellipses,  where  the  ellipses  are  symmetric  about  segments  formed  by  linking  two 
cities. 

A  Graphic  Depiction  of  a  Simple  Quartic  Space. 

Figure  4  illustrates  an  example  of  a  quartic  space  imposed  on  a  four  city  database  The  segment 
containing  zl  and  z2  is  fixed  in  the  plane  The  segment  containing  z3  and  z4  is  allowed  to  pivot  about 
z4,  with  z3  being  rotated  counterclockwise  ninety  degrees  through  an  angle  9,  in  increments  of  ten 
degrees.  We  are  trying  to  find  the  locus  such  that  a  perturbation  from  the  segment  zl-z2  is  equal  to  a 
perturbation  from  segment  z4-z3.  Initially,  when  z3  is  on  the  x-axis,  the  locus  is  a  horizontal  line  lying 
halfway  between  the  two  segments.  For  the  sake  of  argument,  when  the  angle  0  =  0,iet  the  locus  be 
the  line  y  =  k.  At  ten  degrees,  the  locus  lifts  slightly  from  the  horizontal  and  develops  curvature  At 
about  forty-five  degrees,  the  locus  develops  a  prominent  maximum,  and  also  mainfests  two  inflection 
points;  it  visually  resembles  the  planar  curve  known  as  the  Witch  of  Agnesi  Beyond  forty-five  degrees, 
the  locus  gradually  loses  its  smooth  maximum  and  develops  a  pronounced  cusp;  the  fact  that  four 
distinct  roots  exist  is  now  apparent.  Finally,  at  ninety  degrees  of  rotation,  the  locus  becomes  a  diagonal 
line  connecting  zl  with  the  initial  position  of  z3 

Note  that  a  peculiar  phenomenon  has  occurred.  Although  the  segment  containing  z3  has  been 
allowed  to  rotate  ninety  degrees  (from  the  line  y  =  0  to  the  line  x  =  0),  the  corresponding  locus  has 
rotated  only  forty-five  degrees  (from  the  line  y  =  k  to  the  line  y  =  -x  +  2k).  Therefore,  the  angular  range 
of  the  output  is  only  half  that  of  the  input.  It  should  also  be  noted  that  realistically,  the  behavior  which 
produces  the  cusp  does  not  occur,  for  when  city  z3  is  rotated  beyond  a  certain  critical  angle,  it  is 
absorbed  by  segment  zl -z2,  and  the  shortest  tour  becomes  z  1  -z3-z2-z4-z1 ,  rather  than  zl -z2-z3-z4-z  1 
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Two  Hull  Segments  Sharing  an  Endpoint  Produce  a  Hyperbolic  Locus. 

In  this  section,  a  corollary  to  the  theorem  is  proven  to  show  that  in  a  special  case  which 
frequently  occurs  during  traveling  salesman  problem  solving,  the  locus  of  equal  perturbation  is 
hyperbolic  rather  than  quartic.  The  simplest  hypothesis  to  maintain  when  inserting  another  city  into  the 
hull  is  that  the  current  set  of  perturbations  will  merely  be  extended  by  the  new  city,  without  radically 
altering  the  topology.  As  will  become  apparent  below,  the  simple  extension  of  a  perturbation  is 
arbitrated  by  an  intrapath  hyperbolic  discriminator;  the  more  complex  operation  of  reasoning  between 
perturbations  requires  a  quartic  discriminator. 

Corollary.  When  two  hull  segments  share  a  city,  the  Voronoi  edge  is  a  semi-hyperbola 

Proof:  Refer  to  Figure  5  below  Let  the  endpoints  of  the  segments  be  respectively  h  . - ,  h, 
and  h„  h,  +  i.  Let  a  be  the  length  of  the  hull  segment  connecting  h,  and  h,  *  i,  and  b  the 
length  of  the  segment  connecting  h,  and  h,_  1 .  Let  p  be  an  arbitrary  point  on  the  locus. 

Let  x  be  the  distance  from  h,  +  i  to  p,  y  the  distance  from  h,  to  p,  and  z  the  distance  from 
hj.i  to  p.  We  proceed  to  derive  the  locus: 


de(h„  h,.,)  =  de(h„hl  +  1) 
x  +  y-  a  =  y  +  z-  b 
x  -  a  =  z  -  b 

x  -  z  =  a  -  b  [9] 

QEO  Equation  [9]  represents  a  semi-hyperbola  passing  through  hull  vertex  hj,  with  foci 
at  h,_i  and  h,  +  1.  It  is  bowed  toward  the  longer  of  the  two  hull  segments  In  the  case 
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when  the  two  hull  segments  are  of  equal  length,  the  semi-hyperbola  degenerates  to  a 
line. 


Figure  5.  A  hyperbolic  locus  results  when  two  hull  segments  share  an  endpoint. 


The  Quartic  Voronoi  Diagram  to  Determine  an  Exact  Solution  to  the  Single  Interior  City  Problem. 

It  is  clear  that  to  develop  the  delimiters  of  equal  hull  perturbation,  we  are  required  to  develop  a 
subset  of  the  union  of  quartics  and  hyperbolas  induced  by  the  elliptic  distance  between  pairs  of  hull 
segments.  This  subset  is  called  the  quartic  Voronoi  diagram  of  the  hull,  and  is  defined  as  follows 

Definition.  Given  convex  hull  t  H.  the  quartic  Voronoi  diagram  of  the  hull,  Vo rQ  (t  H),  is 
defined  to  be: 

VorQ(t  H)  =  (x  I  de(x,  h„h,  +  I)  =  de(x,  hJ(h ,  *  ,)  for  some  h,  £  H, 
and  de(x,  h„  l  +  1)  <  de(x,  hk,  <*,)  Vk  *j} 

For  the  seven  city  example  introduced  above,  VorQ  (i  H)  is  displayed  at  Figure  6  City  d7  is  properly 
contained  within  the  quartic  Voronoi  cell  corresponding  to  the  segment  connecting  d5  and  d6,  implying 
the  segment  must  be  perturbed  to  capture  d7  Note  that  prior  to  introducing  city  d7,  the  existing 
optimal  tour  is  the  convex  hull  The  Voronoi  diagram  consists  of  some  edges  passing  through  the 
existing  tour's  vertices  (the  cities  on  the  hull),  and  some  which  do  not  pass  through  any  of  the  cities  in 
the  search  space.  In  the  former  case,  the  edges  are  composed  of  hyperbolas,  whereas  in  the  latter  case 
the  edges  are  pieces  of  quartic  curves  It  is  important  to  keep  this  concept  in  mind,  because  it  will  be 
revisited  when  the  diagram  is  extended  in  general  to  accommodate  a  new  city  deposited  into  an 
arbitrary  optimal  tour  space.  The  generalization  will  be  seen  to  function  in  the  following  manner:  if 
the  existing  optimal  tour  is  extended  simply  by  inserting  a  new  city  between  two  cities  m  the  tour,  the 
locus  of  equal  perturbation  which  arbitrates  the  decision  is  a  semi-hyperbola;  otherwise,  a  mere 
complex  decision  process  must  be  invoked  to  reason  across  the  perturbation  space,  and  the  'ocus  is  a 
quartic  polynomial 


202 


.  — -  -  * 

i 

1  "  11,1?  .  n  l?,?3 

<n. 

' 

h 

i  q 

,  6120 

q 

q 

»55..14 

1 

i  s  '  h 

h  d4 

l  /  M.«1 

04.45 

|  X 

/ 

1 

V' 

h  45.5* 

q  =  quartic 

h  *  hyperbola 

05 

...  - 

.  -  -f - 

*  ■ 

di'  \ 

■  < 
13 

'  -i 

/ 

i 

> 

0( 

- 

/ 

✓ 

\ 

1? 

' 

<*> 

\ 

\ 

\ 

\ 

* 

Figure  6  (computed  quartics  and  hyperbolas).  The  quartic  Voronoi  diagram  of  the  hull  as  a  connectivity  map. 

The  curves  passing  through  the  hull  vertices  are  hyperbolas;  whereas  the  others  are  quartics.  City  d7  resides 
within  the  quartic  Voronoi  cell  corresponding  to  the  segment  connecting  dS  and  d6,  and  therefore  the 
optimal  connection  is  as  shown  at  the  right. 

A  Two-City-in-a-Hull  Example. 

We  will  continue  in  this  vein  by  commiting  to  computer  memory  the  optimal  perturbation  for 
city  d7,  and  introducing  yet  another  city  into  the  perturbed  space.  Figure  7  depicts  an  instance  of  a 
two-city-in-a-hull  quartic  Voronoi  diagram  City  d7  has  been  fixed,  after  having  discovered  its  optimal 
perturbation  (d6-d7-d5)  in  a  previous  step  A  new  city  (p)  is  about  to  be  introduced  At  the  left,  the 
quartic  edges  demarcating  optimal  connectivity  of  p  with  d2,  d3,  d5,  d6,  and  d7  are  displayed,  with  the 
globally  relevant  pieces  highlighted.  Similar  plots  may  be  obtained  for  other  pairs  of  vertices;  for  the 
complete  interaction  between  segment  d5-d6  and  each  of  the  other  five  hull  segments,  please  refer  to 
the  Appendix  (where  d7  appears  under  the  alias  of  d8).  When  computed  across  ail  relevant  pairs,  the 
quartic  Voronoi  diagram  emerges  (right).  As  an  example,  if  p  happens  to  fall  in  the  quartic  cell  labeled 
with  the  descriptor  "6p75",  the  optimal  tour  must  insert  p  between  cities  d6  and  d7,  in  the  already 
established  perturbation  d6-d7-d5.  However,  if  p  happens  to  fall  in  the  cell  labeled  "1p6;675",  two 
distinct  perturbations  are  required;  the  one  involving  p  is  issued  from  segment  d  1-d6;  whereas  the  one 
involving  d7  is  issued  from  segment  d6-d5 

Note  that  in  the  vicinity  of  city  d7,  a  single  perturbation  from  some  hull  segment  is  sufficient  to 
ensure  optimality  However,  if  city  p  happens  to  reside  in  one  of  the  quartic  cells  beyond  this  region,  it  is 
necessary  to  perturb  two  hull  segments  to  achieve  optimality.  Perhaps  the  most  intriguing  aspect  of  the 
Voronoi  diagram  as  a  connectivity  map  is  that  it  partitions  the  plane  into  ceils  which  indicate  precisely 
how  to  maintain  optimality  when  inserting  an  arbitrary  city  into  the  current  tour  What  this  really 
means  is  that  one  can  predict  how  to  attach  a  new  city  to  an  optimal  tour,  without  specifying  the 
coordinates  of  the  city  ahead  of  time.  The  implications  are  profound,  for  if  an  efficient  algorithm  can  be 
designed  to  construct  (or  perhaps  merely  reflect)  the  quartic  diagram  for  arbitrarily  large  sets  of  cities,  it 
follows  that  a  dynamic  programing  approach  is  sufficient  to  solve  the  problem  exactly  The 
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fourth-order  complexity  inherent  to  the  loci  of  tour  constraints  in  large  part  explains  why  those 
approaches  which  subscribe  to  Oantzig's  linear  simplex  have  to  date  failed  to  solve  the  problem. 


Figure  7  (computed  quartic*  and  hyperbola*).  The  quartic  Voronoi  diagram  for  two  interior  cities. 


Nested  Hull  Traversal,  Outside-in  vs.  Inside-out. 

Since  the  theory  is  based  on  a  perturbation  of  the  convex  hull,  there  is  an  obvious  requirement 
to  secure  an  algorithm  which  efficiently  computes  the  hull.  It  is  shown  in  reference  [K3]  that  in  the 
plane,  the  hull  may  be  optimally  computed  in  O  [  n  •  log  h  ]  time.  With  the  traveling  salesman  problem, 
we  interpret  n  as  the  total  number  of  cities,  and  h  as  the  number  of  cities  on  the  convex  hull.  For  our 
implementation,  we  will  compute  the  entire  nested  hull  decomposition,  sometimes  called  the  "onion" 
[E2J.  The  purpose  of  computing  the  onion  is  to  gain  control  of  the  search  space  by  attempting  to  insert 
the  cities  uniformly  into  the  hull,  to  limit  generation  of  "greedy”  perturbations.  A  greedy  perturbation 
occurs  when  by  mere  virtue  of  having  probed  sufficiently  far  into  the  hull,  a  perturbed  hull  segment 
continues  to  absorb  cities  which  rightfully  belong  to  another  perturbation.  Reference  [C2] 
demonstrates  that  a  planar  nested  hull  structure  may  be  constructed  in  0  [  n  *  log  n  J  time  However,  in 
the  implementation  described  below,  we  will  utilize  an  algorithm  due  to  Eddy  [Ell,  with  time 
complexity  O  ( n2  J,  but  with  average  run  time  O  [  n  ]. 

Recall  that  by  convention  we  order  a  hull  with  counterclockwise  orientation,  smallest  ordinate 
first.  If  we  label  the  outer  hull  with  index  0,  and  label  each  inner  hull  with  an  ordinal  number  formed  by 
incrementing  the  index  by  1,  it  is  seen  that  during  processing,  a  city  will  be  inserted  based  on  a  primary 
key  equal  to  the  ordinal  number  of  its  hull,  and  a  secondary  key  equal  to  its  relative  counterclockwise 
position  within  its  hull.  The  exception  to  this  rule  is  to  reject  the  insertion  if  it  causes  some  unprocessed 
city  to  be  bypassed. 

A  rather  startling  twist  to  the  outside-in  approach  is  based  on  the  fact  that  the  quartic  loci  of 
equal  perturbation  extend  both  inside  and  outside  a  hull.  If  we  start  with  the  innermost  hull  (the  core 
of  the  onion),  in  theory  we  should  be  able  to  probe  outward  one  hull  at  a  time,  maintaining  optimality 
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as  we  proceed.  This  technique,  which  we  will  call  the  inside-out  approach,  is  in  fact  as  valid  as  the  other. 
An  experiment  detailed  below  demonstrates  that  both  approaches  are  indeed  capable  of  producing  the 
optimal  tour. 

The  Topology  of  Quartic  Voronoi  Space,  in  the  Context  of  the  Shortest  Tour. 

In  this  section  we  develop  the  canonical  forms  required  to  maintain  quartic  incremental 
optimality.  Three  primitive  operations  will  be  informally  introduced,  and  then  developed  more 
rigorously. 

The  quartic  Voronoi  diagram  partitions  the  plane  into  cells,  the  boundaries  of  which  demarcate 
the  locus  of  the  shortest  tour  among  various  combinations  of  subtours.  It  is  intuitively  obvious  that  if  a 
newly  introduced  city  lies  within  an  arbitrarily  small  neighborhood  of  an  existing  optimal  tour,  the  new 
tour  can  be  formed  simply  by  extending  the  old  space  to  the  new  city.  This  topological  structure,  which 
we  shall  call  hyperbolic  extension  space,  is  computationally  the  simplest  hypothesis  to  be  entertained 
when  introducing  a  new  city.  Hyperbolic  extension  space  preserves  an  existing  tour  by  extending  an 
existing  perturbation  to  encompass  the  new  city. 

A  markedly  different  topology  is  manifested  when  an  extended  perturbation  interplays  with 
another  perturbation,  which  is  located  two  hull  segments  backward  (forward)  to  compel  the  issuing  of  a 
new  perturbation  at  the  preceding  (subsequent)  hull  segment,  which  is  called  a  shunt  to  the  left  ( shunt 
to  the  right).  This  topology  is  called  quartic  shunt  space.  It  addresses  the  issue  of  maintaining  optimality 
m  a  radial  fashion;  i.e.,  in  a  manner  roughly  orthogonal  to  the  convex  hull  which  defines  the  baseline 
tour.  An  intuitive  way  to  describe  this  canonical  form  is  that  it  acts  as  a  monitor  of  flanking  behavior  on 
both  sides  of  a  perturbation,  and  cedes  the  flank  to  a  neighboring  hull  segment  when  necessary  to 
maintain  optimality. 

Because  of  the  existence  of  the  two  distinct  topologies,  it  is  necessary  to  maintain  separate 
computational  hypotheses  in  parallel  (Figure  8).  An  extension  occurs  if  a  new  city  lies  in  one  of  the 
extension  cells  in  the  lower  portion  of  the  diagram  at  the  top  However,  if  the  city  lies  within  a  Voronoi 
shunt  cell  as  indicated  at  the  top,  a  transition  to  quartic  shunt  space  occurs  when  two  existing 
perturbations  are  bridged  (diagram  at  bottom).  This  not  uncommon  spatial  phenomenon  may  radically 
alter  the  global  shape  of  the  tour,  and  must  be  hypothesized  every  time  a  new  city  is  processed,  to 
guarantee  tour  optimality. 

The  final  topology  deals  with  the  issue  of  perturbation  encroachment.  There  are  instances  when 
a  perturbation  probes  suffciently  far  into  the  hull  that  for  the  sake  of  optimality  it  is  necessary  for  it  to 
claim  cities  from  another  perturbation.  This  spatial  phenomenon  produces  the  third  topology  which  we 
call  quartic  interchange  space  Quartic  interchange  space  consists  of  those  Voronoi  cells  which  indicate 
that  cities  from  one  or  more  perturbations  are  to  be  exchanged  into  an  extended  or  shunted 
perturbation.  Quartic  interchange  is  invoked  after  hyperbolic  extension  and  quartic  shunting,  which 
are  performed  in  parallel.  It  can  be  an  operation  of  quadratic  complexity,  because  an  existing 
perturbation  may  be  broken  into  sections  and  nullified  by  the  operation,  with  separate  sections  being 
absorbed  by  separate  perturbations.  Quartic  interchange  is  particularly  relevant  when  using  the 
outside-in  nested  hull  approach,  because  at  certain  moments  in  time  perturbations  from  across  the  hull 
begin  to  collide  with  those  on  the  near  side,  and  the  interaction  must  be  arbitrated  to  preserve 
optimality. 
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Assume  that  after  k  cities  have  been  optimally  connected  to  the  convex  hull,  we  would  like  to  know 
under  what  conditions  it  is  possible  to  simply  extend  the  tour  to  a  new  city,  vs.  radically  altering  the  tour 
by  permitting  the  new  city  to  link  others  which  are  currently  non-adjacent.  It  is  obvious  that  if  the  new 
city  is  within  a  small  spatial  neighborhood  of  the  existing  tour,  optimality  is  preserved  by  simply 
inserting  the  city  into  the  tour  between  two  cities.  The  question  of  which  two  cities  is  governed  by  a  set 
of  hyperbolas  which  pass  through  the  endpoints  of  the  segments  that  connect  the  ordered  list  of  cities 
defining  the  current  optimal  tour  Whether  or  not  the  new  city  is  within  a  suitable  neighborhood  of  the 
current  tour  is  arbitrated  by  a  set  of  quartic  curves  which  discriminate  if  a  shunting  operation  is  in  order. 

Hyperbolic  Extension  Space. 

It  is  a  simple  matter  to  connect  a  new  city  to  an  existing  perturbation,  if  that  is  what  is  desired  (it 
will  be  seen  below  in  the  section  on  quartic  shunting  that  optimality  is  not  always  preserved  by  simply 
extending  a  perturbation).  The  city  is  connected  to  those  two  cities  in  the  perturbation  for  which  the 
elliptic  distance  is  minimal.  In  other  words,  an  existing  perturbation  should  be  extended  to  a  new  point 
if  and  only  if  the  length  of  the  perturbation  plus  the  elliptic  length  of  the  optimal  extension  to  the  point 
is  less  than  the  corresponding  sum  for  all  other  perturbations  For  example,  referring  to  Figure  9,  if  a 
new  city  is  found  to  reside  in  quartic  Voronoi  cell  "jk",  it  must  be  connected  to  cities  j  and  k,  while  at  the 
same  time  the  segment  joining  j  to  k  must  be  deleted  In  this  way,  perturbation  ri  3  is  extended  to 
capture  the  new  city. 

There  are  cases  when  a  specific  perturbation  requires  reordering  to  maintain  optimality: 
namely,  when  some  city  is  nearer  to  the  city  to  be  inserted  than  either  of  the  endpoints  of  the  best 
segment  found  when  minimizing  the  elliptic  distance  across  all  segments  in  the  perturbation  However, 
the  algorithm  required  to  implement  this  operation  is  of  linear  time  complexity,  and  does  not  detract 
from  the  performance  of  the  general  extension  philosophy. 
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Figure  9  (estimated  quartics  and  hyperbolas).  Extending  an  existing  perturbation  is  straightforward.  When  a 
new  city  enters  the  system,  it  is  connected  to  the  perturbation  for  which  the  elliptic  distance  to  a  segment  is 
minimal,  across  all  perturbations. 


Quartic  Shunt  Space. 

A  shunt  to  the  left  is  a  bridging  operation  which  connects  a  new  city  to  the  hull  segment  to  the 
left  of  its  extended  perturbation,  whereas  a  shunt  to  the  right  connects  the  city  to  the  hull  segment 
succeeding  it.  The  left  shunt  is  formed  by  connecting  the  city  to  its  nearest  neighbor  two  perturbations 
to  the  left,  and  then  following  the  respective  perturbation  subpaths  down  to  the  hull  vertices  of  the 
perturbation  at  the  left.  Any  cities  which  become  detached  by  this  process  must  be  reconnected  to  the 
perturbation  space  The  quartic  shunt  operator  is  a  powerful  tool,  useful  for  merging  two  perturbations 
of  the  same  parity  into  one  with  opposite  parity,  lying  between  the  other  two 

An  example  of  a  quartic  shunt  to  the  left  is  shown  at  Figure  10  (the  data  is  a  handcrafted 
approximation  of  a  graphic  depicted  on  p  224,  reference  [P5]  At  the  leftside  of  the  figure,  city  c  1 2  has 
just  been  introduced  Hyperbolic  extension  space  calls  for  a  perturbation  of  hull  segment  c5-c4, 
indicated  by  the  dotted  lines.  However,  quartic  shunt  space  calls  for  a  shunt  to  be  formed  between  c  1 2 
and  c16,  which  is  the  nearest  neighbor  two  hull  segments  to  the  left  of  c12's  extended  perturbation 
The  endpoints  of  the  shunt  are  followed  down  from  c12  and  c16  respectively  to  c5  and  c6.  City  c  1 3, 
which  is  left  dangling  by  the  shunt  operation,  is  optimally  reattached  to  the  perturbation  space  by 
connecting  it  to  segment  c6-c7. 


Figure  10.  A  quartic  snunt  to  the  left,  using  the  Preparata  and  Shamos  dataset. 
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Quartic  Interchange  Space. 

Quartic  interchange  space  dictates  when  a  new  city's  perturbation,  whether  it  be  an  extension 
or  shunt,  has  encroached  sufficiently  far  into  the  hull  to  encompass  cities  which  earlier  were  optimally 
installed  in  some  other  perturbation.  Every  time  a  new  city  is  processed,  it  must  be  hypothesized  that 
the  extending  perturbation  may  now  have  encroached  deep  enough  into  the  hull  to  begin  influencing 
perturbations  on  the  other  side.  While  at  some  time  in  the  past  it  may  have  been  legitimate  to  have 
constructed  a  cross  ^jil  perturbation  to  maintain  optimality,  it  may  now  be  time  to  partially  or 
completely  "undo"  the  perturbation  by  swapping  some  of  its  cities  to  the  nearside  of  the  hull. 

Quartic  interchange  is  iterative.  The  currently  extended  perturbation  is  compared  to  all  other 
perturbations  in  the  space.  If  an  exchange  of  cities  is  warranted,  it  is  permitted  to  occur,  and  the  revised 
perturbation  space  is  subjected  to  interchange  once  again.  This  action  is  repeated  until  no 
improvement  is  obtained. 

The  General  Voronoi  Diagram  for  the  Euclidean  Traveling  Salesman  Problem. 

Earlier,  we  proved  that  the  Voronoi  diagram  of  the  convex  hull  has  quartic  edges,  but  possesses 
hyperbolic  edges  between  adjoining  hull  segments.  For  the  general  case,  this  concept  may  be  extended 
The  generalized  Voronoi  diagram  partitions  the  plane  into  three  types  of  cells:  hyperbolic  extension 
cells;  quartic  shunt  cells;  and  quartic  interchange  cells.  Before  the  k*h  city  is  introduced,  one  computes 
the  Voronoi  diagram  for  the  set  of  previously  introduced  k-1  cities.  As  in  the  convex  hull  case, 
computation  must  once  again  resort  to  an  elliptic  distance  comparison,  except  now  three  different  types 
of  tour  topologies  must  be  hypothesized,  rather  than  the  single  hypothesis  entertained  by  introducing  a 
single  city  into  the  hull.  The  space  is  once  again  quartic,  because  to  obtain  the  boundaries  of  equal 
perturbation  to  the  k*h  city,  two  variable  distances  are  added,  and  the  sum  of  a  set  of  fixed  distances 
(the  length  of  a  specific  hypothesized  subpath)  is  subtracted  Rather  than  reasoning  with  perturbed 
segments  on  the  hull,  one  must  reason  with  tangible  segments  which  are  part  of  an  existing  tour, 
hypothesized  segments  which  form  shunts  between  perturbations,  and  hypothesized  segments  which 
form  interchange  links  with  other  perturbations.  To  formalize,  the  quartic  Voronoi  diagram  indicated 
by  the  optimal  tour  for  k-1  cities,  denoted  VorQ(  i  *.1  ),  is  a  function  vy  of  five  arguments. 

VorQ(  t  k-i  )  =  ¥  k-i.  Extff  ( c  k-i  ),  Shunt[,(  1  <-1  ),  ShuntR  ( 1  k-t  ),  !nter(  1  k-i  )  1 


where. 

t  k-1 

ExtnUk-t) 
ShuntL(  c  k-i  )  = 
ShuntR  ( 1  k-i  )  = 
lnter(  t  k-i ) 


the  optimal  tour  for  k-1  cities; 
the  hyperbolic  extension  space  induced  by  1  k-i . 
the  left  quarticshunt  space  induced  by  c  k-i. 
the  right  quartic  shunt  space  induced  by  t  »-i. 
the  quartic  interchange  space  induced  by  1  <.1 


The  Principle  of  Quartic  Incremental  Optimality. 

It  is  clear  we  are  proceeding  with  a  strategy  akin  to  dynamic  programming.  Namely,  when 
adding  a  new  city  to  the  interior,  we  attach  the  city  to  an  existing  optimal  tour  in  a  way  indicated  by  the 
quartic  Voronoi  diagram  computed  just  prior  to  the  city’s  introduction.  Formally,  the  shortest  tour  t  k 
for  k  cities  is  a  function  A  of  two  arguments: 
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k  =  [  H  1  +  1, . . n; 


H  =  A  (  (ckx,  cky),VorQ(  x  k.T )  ) ; 

where: 

xk  =  the  optimal  tour  containing  k  cities; 

(ck*,ckJ  =  the  coordinates  ofthekth  city  to  be  introduced; 

X'  y 

t  k-i  =  the  optimal  tour  containing  k-1  cities; 

VorQ(  x  k-i )  =  the  quartic  Voronoi  diagram  prescribed  by  k-1  cities; 

[H]  =  the  order  of  the  convex  hull  H; 

n  =  the  total  number  of  cities  to  be  processed. 

Unique  vs.  Multiple  Numbers  of  Distinct  Optimal  Tours  as  a  Function  of  the  Quartic  Space. 

Proper  containment  within  a  quartic  Voronoi  cell  guarantees  a  unique  tour.  In  nondegenerate 
cases,  there  can  be  no  more  than  three  unique  tours  because  quartic  Voronoi  edges  converge  in  groups 
of  three  just  as  linear  Voronoi  edges  do  If  a  newly  introduced  city  is  situated  at  a  Voronoi  junction  (a 
point  where  three  quartics  come  together),  three  optimal  tours  exist;  whereas  if  the  city  lies  on  a  quartic 
but  not  on  a  junction,  two  optimal  tours  exist.  In  the  degenerate  case  when  cities  are  equispaced  in  the 
plane,  there  may  be  more  tours  than  in  the  nondegenerate  case.  For  example,  a  hull  consisting  of  a 
regular  polygon  containing  k  sides  produces  k  optimal  tours  if  the  introduced  city  lies  at  the  center  of 
the  polygon,  because  the  center  is  at  the  intersection  of  k  Voronoi  edges  (degenerate  hyperbolas).  As  a 
point  of  interest,  it  can  be  seen  that  if  one  allows  the  number  of  vertices  of  a  regular  polygon  to 
approach  infinity,  so  does  the  number  of  optimal  tours  connecting  the  hull  to  the  center.  However,  in 
this  case,  the  limiting  form  of  the  polygon  is  a  circle,  and  a  paradox  arises,  because  the  Euclidean 
distance  between  adjacent  hull  vertices  approaches  zero  as  the  number  of  distinct  optimal  tours  rises  to 
infinity. 

Future  Work  on  A  Proof  of  the  Admissibility  of  A . 

Mathematical  induction  will  be  used  in  an  attempt  to  show  that  A  is  admissable.  For  the  case  of 
inserting  a  single  city  into  the  hull  (i.e.,  k  =  1),  the  shortest  tour  is  trivially  depicted  by  VorQ  (t  H),  so  the 
initial  step  of  the  inductive  proof  is  satisfied.  What  remains  to  be  shown  is  that  the  sequencing  of  the 
operations  of  hyperbolic  extension,  quartic  shunting,  and  quartic  interchange  preserves  optimality. 

Summary  of  the  Generalization  of  the  Voronoi  Diagram  to  ETSP. 

During  the  first  decade  of  the  century,  Voronoi  s  intention  was  to  develop  a  mathematical 
structure  which  could  be  used  to  rapidly  associate  a  query  point  with  the  nearest  point  contained  within 
a  known  two-dimensional  constellation  of  points  [V2].  In  decades  of  subsequent  work,  the 
dimensionality  constraint  has  been  relaxed,  as  well  as  the  specification  that  both  the  query  object  and 
known  objects  be  points  [PS,  E2].  This  paper  has  focused  on  an  exact  solution  to  the  Euclidean  traveling 
salesman  problem,  and  consequently  has  introduced  a  new  distance  metric,  known  as  the  elliptic 
distance,  used  to  compute  the  distance  of  a  floating  query  point  from  two  fixed  points  The  limiting 
form  of  this  metric,  when  intersected  with  that  from  another  perturbed  segment,  induces  a  quartic 
structure  on  the  Voronoi  diagram.  This  non-linear  search  space  permits  a  feasible  testbed  arena  for  the 
problem  which  the  traditional  Voronoi  diagram  cannot  provide 

Dillencourt's  nondegenerate  counterexample  [D3]  to  Shamos'  conjecture  that  the  shortest  tour 
must  traverse  the  Voronoi  dual  is  shown  at  the  left  of  Figure  11.  At  the  right  is  the  quartic  Voronoi 
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diagram,  which  depicts  connectivity  of  the  three  interior  cities  to  the  convex  hull.  All  the  curves 
indicated  are  hyperbolas,  except  the  short  one  plotted  between  segment  tl  -t7  and  segment  t2-t3,  which 
is  a  quartic  locus.  The  shortest  tour  is  t1-t2-t4-t3-t5-t7-t6-t1,  but  the  linear  Voronoi  dual  does  not  permit 
t3  and  t4  to  be  connected.  Note  that  in  this  instance,  if  one  merely  attaches  each  of  the  three  interior 
cities  to  the  hull  segment  indicated  by  the  quartic  Voronoi  cell  in  which  it  resides,  using  the  hyperbolic 
extension  operator,  the  shortest  tour  is  obtained  Of  course,  the  actual  computer  run  invokes  the 
processes  of  quartic  shunting  and  quartic  interchange,  but  in  this  case  the  fourth-order  operators  fail  to 
improve  the  tour  produced  by  hyperbolic  extension  space. 


Figure  11  (computed  edge*).  The  Voronoi  diagram  for  the  Oillencourt  data  (left),  and  its  one-city-in-a-hull 
quartic  Voronoi  diagram  (right).  This  data  is  the  first  known  nondegenerate  counterexample  to  Shamos' 
conjecture  that  the  shortest  Euclidean  tour  must  traverse  adjacent  Voronoi  cells.  In  the  optimal  tour,  t4  is 
connected  to  t3.  It  is  apparent  that  t4  and  t3  can  be  connected  in  the  qurtic  diagram,  but  not  in  the  linear  one. 


The  traditional  Voronoi  diagram  is  a  proximity  map,  where  at  a  glance  it  can  be  seen  which 
object  m  a  search  space  is  nearest  to  a  query  point  The  quartic  Voronoi  diagram  is  a  connectivity  map, 
which  displays  shortest  tour  connectivity  information  for  the  city,  as  a  function  of  a  constellation  of 
k-1  fixed  cities.  It  has  been  shown  that  the  process  of  intersecting  an  infinite  set  of  confocal  ellipses 
symmetric  about  an  existing  ETSP  link  with  those  about  another  link  produces  a  quartic  curve  The 
quartic  curve,  which  in  practice  frequently  reduces  to  a  hyperbola  because  of  the  tendency  of  extension 
space  to  dominate  during  nested  hull  traversal,  serves  the  same  role  as  the  perpendicular  bisector  does 
in  the  traditional  Voronoi  diagram  Thus,  instead  of  being  piecemeal  linear,  the  extended  Voronoi 
structure  for  ETSP  is  piecemeal  quartic.  The  final  discrepancy  between  the  traditional  Voronoi  diagram 
and  the  quartic  diagram  deal s  with  the  issue  of  boundedness  Traditional  Voronoi  diagrams  are 
unbounded;  i  e  ,  cells  ^n  the  perimeter  of  the  diagram  are  permitted  to  extend  to  infinity  However,  for 
the  Euclidean  traveling  salesman  problem,  the  quartic  Voronoi  diagram  is  bounded  by  the  convex  hull 
of  cities,  so  that  no  cell  is  unbounded  Nevertheless,  this  is  not  to  say  that  the  boundedness  constraint 
cannot  be  loosened  to  incrementally  add  new  cities  exterior  to  the  hull,  which  is  the  philosophy  behind 
the  inside-out  nested  hull  approach  (an  example  of  this  technique  will  be  elaborated  upon  in  an 
example  appearing  below,  in  which  the  innermost  hull  is  used  as  a  baseline  optimal  tour  from  which  to 
add  cities  incrementally  to  the  exterior)  Table  1  summarizes  the  three  distinctions  between  the 
traditional,  linear  Voronoi  diagram,  and  its  quartic  counterpart  designed  for  exact  solution  of  the 
Euclidean  traveling  salesman  problem 
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Traditional  Usage  - >  Extension  to  ETSP 

Proximity 

Map 

Shortest-tour 

Connectivity  Map 

Cell  Boundaries 

are  Line  Segments 

Cell  Boundaries 

are  Piecemeal  Hyperbolic 

and  Quartic 

Perimeter  of  Diagram 

is  Unbounded 

Penmeter  of  Diagram 

is  Bounded  by  Convex  Hull 

Table  1.  Extension  of  the  Voronoi  Diagram  to  the  Euclidean  Traveling  Salesman  Problem. 


The  Computational  Complexity  of  the  Hull  Perturbation  Approach, 
a.  The  Time  Complexity  of  Blind  Search. 

The  convex  hull  of  a  set  of  cities  serves  as  a  control  structure  from  which  to  initiate 
perturbations  In  this  section  we  naively  derive  an  expression  for  the  time  complexity  of  the  approach,  if 
the  tack  is  taken  to  blindly  generate  perturbations  from  hull  segments  in  an  arbitrary  fashion  The 
derivation  hinges  upon  making  a  substitution  at  an  opportune  moment  when  the  binomial  coefficients 
are  manifested.  It  is  hoped  that  future  research  will  lend  insight  into  techniques  to  improve  the  naive 
bound.  The  number  of  perturbations  in  the  optimal  tour  cannot  exceed  the  size  of  the  hull,  because  a 
perturbation  is  defined  to  be  an  excursion  into  the  interior  from  a  hull  segment,  the  number  of  which  is 
equal  to  the  number  of  hull  vertices.  Let  H  be  the  set  of  cities  on  the  convex  hull,  and  I  be  the  set  of  cities 
lying  on  the  interior  of  the  hull.  Let  the  rank  of  H  be  h,  and  the  rank  of  I  be  i.  If  n  is  the  total  number  of 
cities,  then  n  =  h  +  i.  From  each  hull  segment,  the  set  of  interior  cities  may  be  visited  zero  at  a  time, 
one  at  a  time,  two  at  a  time, ...  or  i  at  a  time.  Thus  the  total  number  of  computations  required  to  find 
the  shortest  tour  is: 


h  *  ,CT  +  h*,C2  +  ...  ♦  h  *  ,C,  = 

h  *  (,Cj  *  ,C2  *  ...  *  iC.)  = 

h*(2)  = 

h  *  (2n'h  )  = 

h  .  2n 
2*  * 
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Some  observations  may  be  made  about  this  naive  bound.  Note  that  a  large  hull  is  desirable,  because  the 
effect  of  the  denominator  is  to  diminish  the  2n  term.  Although  the  complexity  is  exponential,  it  is  an 
order-of-magnitude  improvement  over  a  brute  force  approach,  which  is  of  factorial  complexity. 


b.  The  Time  Complexity  of  Quartically-Controlled  Search,  using  Nested  Hull  Traversal. 

As  a  city  is  processed  during  nested  hull  traversal,  there  are  three  general  phases  of  computing 
which  must  be  performed  in  sequence.  The  first  is  a  linear  time  operation  to  extend  the  current 
topology  by  minimizing  the  elliptic  distance  from  all  perturbations  to  the  new  city,  which  includes 
reordering  a  perturbation  if  necessary  The  second  phase  involves  two  linear  time  operations,  to 
construct  the  left  and  right  quartic  shunt  topologies,  after  which  they  are  compared  with  the  extension 
topology  to  render  the  one  with  shortest  tour  length.  Finally,  the  quartic  interchange  space  is 
computed,  which  is  a  quadratic  operation,  because  the  left  and  right  tour  edges  produced  by  the 
insertion  of  the  new  city  act  as  windows  to  possibly  absorb  whole  groups  of  cities  from  perturbations  on 
the  far  side  of  the  hull.  Therefore,  to  process  each  new  city,  the  worst  case  time  complexity  is  quadratic 
The  sum  of  a  set  of  quadratic  expressions  in  k,  where  k  ranges  from  0  to  n,  is  a  closed  form  expression 
equal  to  n  *  ( n  +  1  )  *  (  2n  +  1  )  /  6. 

In  summary,  a  computer  implementation  of  the  principle  of  quartic  incremental  optimality 
requires  0[  n  *  log  n  ]  preprocessing  time  to  compute  the  nested  hull  decomposition,  0[  n  ]  storage  for 
intercity  distances  and  optimal  partial  tours,  and  Of  n*  ]  time  complexity  to  maintain  incremental 
optimality. 

An  Example:  The  Forty-eight  Capital  Problem. 

The  shortest  tour  connecting  the  forty-eight  capitals  of  the  contiguous  United  States  remained 
an  intriguing  open  problem  until  Shen  Un  obtained  an  optimal  solution  n  1985  [A1,  A2]  Each 
coordinate  of  the  database  represents  the  location  of  a  Bell  telephone  office  in  the  capital  of  a  state 
The  principles  of  nested  hull  traversal  and  quartic  incremental  optimality  were  leveraged  against  this 
database  The  nested  hull  structure  for  this  data  is  exhibited  at  Figure  1 2.  The  implementation  to  derive 
the  convex  hull  is  based  on  art  iterative  enhancement  made  by  the  author  to  an  algorithm  developed  in 
the  seventies  by  W  F.  Eddy  [Ell. 
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Working  from  the  Outer  Hull  to  the  Inner. 

First,  the  problem  was  attacked  by  starting  with  a  baseline  tour  consisting  of  the  outer  convex 
hull,  and  probing  inwards.  Each  nested  hull  is  traversed  in  counterclockwise  order  to  insert  new  cities, 
before  the  next  inner  hull  is  processed.  A  temporal  history  of  incremental  optimality  is  shown  in  Figure 
13.  The  inserted  city  and  the  quartic  function  triggered  are  listed  below  each  graphic.  The  interesting 
cases  are  those  which  are  not  mere  hyperbolic  extensions,  but  those  which  also  involve  quartic  shunts 
and  exchanges.  The  most  dramatic  quartic  shunt  occurs  in  row  five,  column  seven,  when  the 
introduction  of  Springfield,  Illinois  produces  a  shunt  to  the  right.  Springfield  is  originally  processed  by 
the  hyperbolic  extension  operator,  which  compels  attachment  to  the  perturbation  which  contains 
Lincoln,  Nebraska.  However,  quartic  shunt  space  produces  a  shorter  tour  by  conjoining  Springfield  with 
a  perturbation  to  the  right  containing  Frankfort,  Kentucky.  Another  interesting  iteration  occurs  in  row 
four,  column  six,  when  the  introduction  of  Cheyenne,  Wyoming  into  hyperbolic  extension  space  causes 
the  transposition  of  Bismarck,  North  Dakota  with  Pierre,  South  Dakota.  Subsequently,  quartic 
interchange  causes  Salt  Lake  City  ,  Utah  to  be  drawn  out  of  its  perturbation  with  Carson  City,  Nevada 
into  the  perturbation  containing  Cheyenne.  The  algorithm  correctly  terminates  with  Lin's  optimal  tour, 
shown  in  row  six,  column  two. 

Working  from  the  Inner  Hull  to  the  Outer. 

Next,  the  same  data  was  processed  by  starting  with  the  innermost  nested  hull  and  probing 
outward.  Because  the  quartic  Voronoi  edges  extend  through  the  hull  vertices  both  on  the  inside  and 
the  outside,  the  nested  hull  technique  is  theoretically  valid  in  either  direction.  Figure  14  illustrates  the 
optimal  subtours  produced  by  the  algorithm  when  starting  with  the  innermost  hull  and  probing 
successively  outward  through  the  outer  hulls.  In  this  case,  the  innermost  hull  contains  only  four  cities,  so 
the  original  number  of  perturbations  is  four.  There  is  an  interesting  tradeoff  on  time  complexity  when 
working  with  fewer  perturbations.  Again,  note  that  the  optimal  tour  is  produced 


Figure  14.  Working  outwards  from  the  inner  hull,  employing  quartic  nested  hull  traversal. 


Summary. 

The  chief  result  of  the  research  to  date  is  a  proof  that  the  underlying  search  space  (the  Voronoi 
diagram)  for  the  Euclidean  traveling  salesman  problem  is  non-linear;  specifically,  the  space  is  quartic 
when  reasoning  across  subtours,  and  hyperbolic  when  reasoning  within  subtours  These  facts  become 
apparent  when  one  realizes  that  reasoning  about  shortest  tours  is  a  process  which  inherently  involves 
the  intersection  of  a  pair  of  ellipses,  the  foci  of  which  are  defined  by  pairs  of  cities.  Ellipse  intersection  is 
an  operation  which  in  the  worst  case  produces  a  fourth-order  equation  (quartic).  in  the  special  case  in 
which  two  ellipses  share  a  focus,  the  locus  is  a  semi-hyperbola.  The  discovery  of  the  non-linear  search 
space  has  prompted  the  author  to  devise  an  algorithm  which  utilizes  three  operators  to  constrain 
search:  hyperbolic  extension;  quartic  shunting;  and  quartic  interchange.  To  limit  the  generation  of 
greedy  perturbations,  cities  are  gradually  inserted  in  an  incremental  fashion,  according  to  their  position 
within  the  nested  hull  structure  of  the  city  database.  The  new  knowledge  about  the  non-linear  search 
space  has  resulted  in  an  Q(  n3  j  solution  to  optimality  of  a  forty-eight  city  problem.  The  solution  is 
obtained  both  by  beginning  with  the  outer  convex  hull  and  probing  inward,  or  by  starting  with  the 
innermost  hull  and  probing  outwards. 

Future  Directions  of  the  Research. 

It  is  desirable  to  pursue  a  rigorous  proof  of  the  theory  of  quartic  incremental  optimality;  the 
proof  will  proceed  by  induction  Also,  to  facilitate  further  empirical  analysis,  the  theory  as  it  currently 
stands  will  continue  to  be  developed  and  leveraged  against  several  large  databases  of  cities  for  which 
the  optimal  tour  is  known,  in  an  attempt  to  find  examples  which  counterindicate  the  algorithm 
Short-term  plans  include  runs  against  a  127-city  database  [R1],  and  a  532-city  benchmark  for  which  the 
optimal  solution  has  been  developed  (PI ).  Subsequently,  the  runtime  for  the  experimental  data  will  be 
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plotted  as  a  function  of  the  number  of  cities,  to  determine  if  the  algorithmic  ceiling  function  is  of  cubic 
order  as  predicted  by  the  analysis. 
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APPENDIX 


The  appendix  consists  of  a  series  of  five  computer  plots  which  graphically  portray  the  quartic 
(fourth-order  polynomial)  loci  which  exist  naturally  when  hypothesizing  a  solution  to  the 
two-city-in-a-hull  Euclidean  traveling  salesman  problem  described  in  the  main  body  of  text.  These  ioci 
were  discovered  empirically  by  the  author  during  the  summer  of  1989;  it  was  only  later  after  several 
months  of  research  that  a  proof  was  obtained  to  demonstrate  algebraically  that  the  loci  are  actually 
comprised  of  quartic  and  hyperbolic  curves.  It  may  be  of  interest  to  some  readers  to  know  how  the  plots 
were  obtained.  An  algorithm  was  designed  to  capture  the  knowledge  about  the  possible  ways 
(permutations)  to  connect  city  d8  and  one  other  arbitrary  city  to  the  convex  hull  (d1-d2-d3-d4-d5-d6).  In 
the  eight  city  example,  there  are  six  possible  topologies  to  compare  between  any  two  hull  segments: 
the  two  ways  to  attach  d8  and  the  arbitrary  city  to  each  of  the  two  hull  segments  (which  yields  a 
subtotal  of  four),  and  the  two  ways  to  attach  one  city  to  one  segment  and  the  second  to  the  other 
segment.  In  general,  this  means  that  there  are  fifteen  quartic  loci  (the  combination  of  six  topologies 
taken  two  at  a  time)  among  which  to  arbitrate  when  hypothesizing  a  shortest  tour.  The  algorithm  was 
encoded  in  Lisp  and  run  on  an  artificial  intelligence  computer  workstation.  A  set  of  experiments  were 
conducted  as  follows:  the  computer  mouse  was  moved  about  its  pad  on  the  desk,  which  caused  the 
cursor  to  move  about  the  monitor  screen  displaying  the  constellation  of  cities.  If  the  length  of  a  specific 
arrangement  of  cities  was  within  one  unit  of  that  of  another  arrangement,  a  black  dot  was  plotted  to 
the  screen  at  the  position  of  the  cursor.  This  action  provided  positive  feedback  to  the  author,  who 
dynamically  readjusted  the  position  of  the  mouse  to  obtain  "more  black  dots"  in  a  continuous  fashion, 
until  an  entire  quartic  curve  manifested  itself.  When  a  point  in  time  was  reached  in  which  it  became 
obvious  that  no  more  loci  were  forthcoming,  the  session  was  terminated,  and  another  pair  of  segments 
was  selected  for  experimentation  In  the  set  of  graphics  selected  for  exhibit  here,  one  of  the  pair  is 
always  segment  d6-d5.  Also  note  that  what  in  the  text  was  referred  to  as  city  "d7"  is  here  called  city 
"d8".  It  should  also  be  pointed  out  that  in  exhibits  A-3,  A-4,  and  A-5,  the  quartic  plots  are  superimposed 
over  the  solution  to  the  one-city-in-a-hull  problem  discussed  in  the  text. 
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Exhibit  A-4 .  The  quartic  interplay  between  segment  d6-d5 
and  segment  d2-dl,  when  attaching  d8  and  an  arbitrary  city 
to  the  convex  hull  to  produce  the  shortest  Euclidean  tour. 
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Abstract 

This  paper  proposes  a  new  optimal  digital  redesign  technique  for  finding  a  dynamic 
digital  control  law  from  the  available  analog  counter  part  and  simultaneously  minimiz¬ 
ing  a  quadratic  performance  index.  The  proposed  technique  can  be  applied  to  a  system 
with  a  more  general  class  of  reference  inputs,  and  the  developed  digital  regulator  can  be 
implemented  via  low  cost  microcomputers. 
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1.  Introduction 

Many  practical  dynamic  systems  are  described  by  continuous- time  state  equations  for 
which  a  state-feedback  gain  and  a  forward  gain  are  designed  based  upon  some  specific 
desired  goals.  Advances  in  digital  control  theory  and  industrial  electronics  have  made  a 
dramatic  extension  in  the  possibilities  of  replacing  these  analog  controllers  by  the  equivalent 
digital  controllers  so  that  they  can  be  implemented  via  high  performance,  low  cost  micro¬ 
processors  and  associated  microelectronics.  The  conversion  of  the  designed  continuous-time 
controller  (analog  controller)  to  an  equivalent  discrete-time  controller  (digital  controller) 
so  that  the  responses  of  the  redesigned  equivalent  digital  system  closely  match  those  of 
the  original  analog  system  for  the  same  input  and  initial  conditions  is  a  digital  redesign 
problem  [1].  The  digital  redesign  problem  can  be  described  as  follows. 

Consider  the  linear  controllable  continuous-time  system  described  by 

xc(t)  =  Axc(t)  +  Buc(t)-  xc(0)  (1) 

where  xc(<)  and  uc(t)  are  an  n  x  1  state  vector  and  an  m  x  1  input  vector,  respectively,  and 
A  and  B  are  constant  matrices  of  appropriate  dimensions.  Let  the  state-feedback  control 
law  be 

uc(t)  =  -Kcxc(t)  +  Ecr(t)  (2) 

where  Kc  is  an  m  x  n  feedback  gain,  Ec  is  an  m  x  m  forward  gain,  and  r(t)  is  an  m  x  1 
reference  input.  The  resulting  closed-loop  system  becomes 

xc(t)  -  {A  -  BKc)xc(t)  +  BEcr(t)\  xc(0)  (3) 

Let  the  state  equation  of  a  continuous-time  system  which  contains  the  same  system  matrix 
A  and  input  matrix  B  of  the  system  in  (1),  with  a  different  input,  be  represented  by 

xd(<)  =  Axd{t)  +  Bud(t);  xd(0)  (4a) 

where  ud(t)  is  an  m  x  1  piecewise-constant  input  function, 

Ud(t)  =  ud(kT)  for  kT  <  t  <  (k  +  1  )T  (46) 
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and  T  is  the  sampling  period.  A  zero-order  hold  is  utilized  in  (4).  The  solution  of  the 
state  equation  in  (4)  is 

xd(i)  =  eA(t~kT)xd{kT)  +  [  eA(t~x)Bd\ud(kT)  for  kT  <  t  <  kT  +  T  (5) 

JkT 

For  t  =  kT  +  T ,  the  equivalent  discrete-time  model  of  the  continuous-time  system  in  (4) 
can  be  written  as 

xd(kT  +  T)  =  Gxd(kT)  +  Hud(kT);  xrf(0)  (6a) 

where 

rT 

G  =  eAT  and  H=  eAX B  dX  =  [G  -  I^A'1  B  (6b) 

Jo 

Let  the  discretized  state- feedback  control  law  for  the  system  in  (4)  be 

ud(kT)  =  -Kdxd(kT)  +  Edr(kT)  (7) 

where  Kd  is  an  m  x  n  digital  feedback  gain,  Ed  is  an  m  x  m  digital  forward  gain,  and 
r(kT)  is  an  m  x  1  discrete-time  reference  input.  The  resulting  closed-loop  system  becomes 

xd(t)  =  Axd(t)  -  BKdxd(kT)  +  BEdr(kT)\  x<i(0)  for  kT  <  t  <  (k  +  l)T  (8) 

Now,  the  static  digital  redesign  problem  reduces  to  finding  the  digital  constant  state- 
feedback  gain  Kd  and  forward  gain  Ed  in  (7)  from  the  continuous  state- feedback  gain  Kc 
and  forward  gain  Ec  in  (2)  60  that  the  states  of  the  digital  model  in  (8)  are  approximately 
equal  to  the  states  of  the  analog  system  in  (3)  for  xc(0)  =  xd(0)  and  the  same  reference 
input. 

In  Kuo’s  pioneer  work  [1],  a  discrete-state  matching  method  was  proposed  to  solve 
the  static  digital  redesign  problem  and  successfully  applied  to  a  simplified  one-axis  skylab 
satellite  system  [1].  In  their  work  [1],  they  have  assumed  that  the  continuous-time  reference 
input  r(f)  in  (2)  can  be  closely  approximated  by  the  piecewise-constant  input  r(kT)  in  (8) 
and  the  continuous-time  state  xc(f)  in  (3)  can  be  closely  matched  the  continuous-time 
state  xd(t)  in  (8)  at  each  sampling  instant,  t  =  kT,  with  a  sufficiently  small  sampling 
period  T.  The  values  of  xc(t)  in  (3)  and  xd(t)  in  (8)  between  each  sampling  instant  are 
not  considered  in  their  work.  In  this  paper,  a  new  optimal  digital  regesign  technique  is 


proposed  for  finding  a  dynamic  digital  control  law,  instead  of  the  static  digital  control 
law  as  shown  in  (7),  from  the  available  analog  control  law  in  (2)  for  a  continuous-time 
reference  input  r(t).  It  is  optimal  in  the  sense  that  the  quadratic  performance  index  of 
the  errors  between  xc(t )  in  (3)  and  the  digital  redesigned  state,  controlled  by  a  dynamic 
digital  control  law,  is  minimized. 

2.  Optimal  Digital  Redesign 

Consider  the  dynamic  system  described  as  in  (3)  and  (4)  with  xc(0)  =  *d(0).  Let  the 
quadratic  cost  function  be 


J=\/Q  Mt)-xc(t)\TQ[zd(t)-xe(t)]di  (9) 

where  Q  €  “R.nXn  is  a  positive  definite  symmetric  weighting  matrix,  xc(f)  is  the  state 
of  the  system  in  (3),  and  Xd{t)  is  the  state  of  the  system  in  (4)  to  be  redesigned.  It  is 
desirable  to  find  a  dynamic  digital  control  law  for  the  system  in  (6a)  such  that  J  in  (9) 
is  minimized.  The  above  optimization  problem  is  slightly  different  from  an  optimal  state 
tracking  problem  [2]  in  the  sense  that  the  states  of  interest  in  (9)  are  those  of  the  dynamic 
systems  in  (3)  and  (4)  which  involve  the  same  system  matrix  A  and  input  matrix  B  with 
different  input  functions. 

An  alternative  expression  of  J  in  (9)  is 


where 


-±u 

k=Q  K 
oo 


kT+T 


(xd(f)  -  xc{t)]TQ\xd{t)  -  *c(f)J  dt 


(10a) 


fc= o 


Jk 


i  rk 7 

=  2  fkT 


kT+T 


{xd(t)  -  xc(t)}T Q[xd(t)  -  Tc{t ))  dt 


(106) 


Assume  that  the  continuous-time  reference  input  r(t)  in  (3)  can  be  realized  via  zero-input 
state  equations  (an  example  of  this  method  is  shown  in  Appendix  7.1)  as  follows: 


yr{t)  =  Aryr(t)-,  yr(0) 
r(t)  =  Cryr{t) 
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(11a) 

(life) 


where  yr(t)  is  an  pr  x  1  state  vector,  r(t)  in  (lib)  is  an  m  x  1  output  vector  (the  reference 
input  of  the  system  in  (3)),  and  Ar  and  Cr  are  constant  matrices  of  appropriate  dimensions. 
Combining  the  state  equations  in  (3)  and  (11)  leads  to 


*c(<) 

Ac  BEcCr 

xc(t) 

'lc(O)' 

M 0. 

0  Ar 

yr(t). 

J 

,yr(o) 

€  ft™*1 


or 


9(0  =  Ajg(t);  q(0) 


(12a) 


(126) 


where  ni  =  n  +  pr,  Ac  =  A  —  BKC ,  and 


A  A 

A\  - 


Ac  BEcCr 
0  Ar 


eft™*™,  q(t)  = 


*c(0 

yr{t) 


e  ftni 


XI 


The  solution  of  the  state  equation  in  (12)  is  given  by 

q(t)  =  e^'-^qikT)  for  kT  <  t  <  kT  +  T 
The  equivalent  discrete- time  model  of  the  system  in  (13a)  is 

q(kT  +  T)  =  Giq(kTy,  g(0) 

where 


(13a) 


(136) 


G,  =  e 


^  bAxT  A 


Gc  Hc 

0  Gr 


€ft™*™,  q{kT)  = 


xc(kT) 

Wwjj 


e  n 


T»1  Xl 


with  Gc  =  eAcT ,  Gr  =  eA'T,  and  Hc  =  GCJ e~A'xBEcCreA'x  d\  (Note  that  Hc  can 
be  solved  using  the  method  developed  in  Appendix  7.2).  Also,  by  combining  the  dynamic 
systems  in  (4)  and  (I2),we  obtain 


A  0  ' 

*tf(0 

1 

B 

.  9(0  . 

0  Ai 

.  9(0  . 

0 

ud{kT) 


(14a) 


The  solution  of  the  state  equation  in  (14a)  can  be  obtained  by  combining  the  solutions  in 
(5)  and  (13a)  as 


r*-(oi 

■  eA(t-kT) 

0 

xd{kT) 

1 

[  HT  eA{t~x)B  dX 

9(0 

0 

eAi(t-kT) 

.  q(kT)  . 

T* 

0 

ud(kT) 


(146) 


for  kT  <t<  kT  +  T 


For  t  =  kT  +  T ,  the  equivalent  discrete-time  model  of  the  augmented  system  in  (14) 


becomes 


where 


z(kT  +  T)  =  Gz{kT)  +  Hud{kT);  z(0) 


z{kT)  =  [xJ(kT),  qT(kT)]T  €  7l(n+ni)xl 

G  =  block  diag^Gj]  6  n{n+n')x{n+ni)  with  G  =  eAT  and  G,  =  eA'T 

T 

and  H  =  [£rT,0]T€^(n+n,)xm  with  H=f  eAXB  d\  =  [G  -  In)A~'  B 

Jo 

Note  that  the  augmented  system  in  (15)  contains  the  reference  subsystem  in  (11),  whereas 
the  cost  function  in  (9)  does  not  include  the  state  of  the  reference  subsystem  in  (11).  To 
include  the  state  of  the  reference  subsystem  in  (11)  into  the  cost  function  in  (9),  we  modify 
the  cost  function  in  (10)  as  follows: 

1  #fcT*fT  P  1  r  /i  \  1 


5 /„  «('>•*■> '[-Q  -<?]  [::$ 

1  ,feT+T 

=  5  /  zT(t)Qz(t)dt 
Z  J kT 


where 


zT(t)Qz{t)  dt 


z(t)  =  [xJ(t),qT(t)}T  =  (*2’(t),xf(t),yf(t)]T  €  ft(n+"l)xl 

r  Q  -Q  O' 

Q  =  -Q  Q  o  e  7j(n+nl)x(n+nl) 


[0  0  OJ 

Substituting  (14b)  into  (16)  and  making  some  algebraic  simplifications  results  in 

Jk  =  ~zT(kT)Qz(kT)  +  zT(kT)Mud{kT)  +  \u^(kT)Rud(kT)  (17  a) 


where 


f)  ^  [  Qll  Qll  r  •Tjln+ni)  x(n+ni) 

Q-[ <?r,  <?« j6K 

rT 

Qn  =  /  eATtQeAidt  €  ft"*" 

Jo 

,T 

Q „=/  eATt(-Q.0]eAltdt  €  ^nxni 
Jo 

T 

=  1  eA[t  o  0  eAltdi£Kn'* 


(176) 
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and 


with 


and 


M  =  \^jl  J  6  rtn+n')*m 

Mi  =  f  {eATtQ  f  eAXBd\}dt  e  7lnxm 

Jo  Jo 

T  t 

M2  =  f  {eA't[-Q,0}T  [  eAXBd\}di  enn'’ 

Jo  Jo 

R=  f  {[  f  eAXBd\)TQ[  f  eAXBd\]}dten 

Jo  Jo  Jo 


(17c) 


(1 W) 


If  the  matrices  A  and  A\  satisfy  certain  conditions  (see  Appendix  7.2),  the  weighting 
matrices  Q,  M,  and  R  can  be  solved  from  a  set  of  Lyapunov  equations.  Thus,  the  quadratic 
cost  function  in  (10)  can  be  rewritten  as 

00  1  1 
J  =  Y,  [~zT{kT)Qz{kT)  +  zT(kT)Mud(kT)  +  ^ uJ{kT)Rud(kT )]  (18) 

k=o 

Now,  we  can  easily  identify  that  the  cost  function  in  (18)  and  the  dynamic  equation 
in  (15)  constitute  a  standard  discrete-time  optimal  regulator  problem  [1,2].  The  ordmal 
control  law  is  given  [1,2]  by 


ud{kT)  =  -( R  +  Hl PH)~l{HTPG  +  Mr)z(kT) 


(19a) 


where  P  €  V.(n+n')*(n+ni)  is  the  positive  definite  symmetric  solution  of  the  discrete-time 
Riccati  equation: 


P  =  GTPG  +  Q-  ( G1  PH  +  M)(R  +  H1  PH)~1{GTPH  +  M) 


(196) 


Since  the  adjoint  system  in  (15)  is  not  completely  controllable,  it  is  not  always  possible 
to  find  a  positive  semidefinite  symmetric  matrix  P  from  (19b).  However,  for  a  stable 
subsystem  matrix  G\ ,  there  exists  a  positive  definite  symmetric  matrix  P  [3]  which  can  be 
solved  as  follows. 

Define  the  matrix  P  as 

‘  [Pn  Pl 21  (20a) 


P  = 


PL  P22  j 
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where  Pu  e  PnXn ,  Pu  6  PnXn‘,  and  P22  G  PniXni.  The  Riccati  equation  in  (19b)  can 
be  partitioned  into  separate  equations  for  Pu,  Pi2,  and  P22: 

P„  =  GT PnG  +  Qii  —  (GtPuH  +  Mi)(R  +  HT Pi1H)~1(Gt PuH  +  Mi)t  {20b) 
Pi 2  =  GT PnG i  +  Q,2  -  {GT P\\H  +  Af,)(P  +  HTPuH)-\GlP?2H  +  A/2)T  (20c) 
P22  =GlP22G1  +Q22-{GjP^2H +  M2){R  +  HTPuH)-i{GjP^H  +  M2)T  (20 d) 

Equation  (20b)  is  a  discrete-time  algebraic  Riccati  equation  and  can  be  solved  via  eigenvalue 
eigenvector  approach  [4]  or  sign  algorithm  [5].  Once  Pu  has  been  found,  it  is  substituted 
into  (20c),  which  can  be  rearranged  into  the  following  Lyapunov  equation: 

[G  -  H{R  +  HTPnH)-1(HTPnG  4-  A/jr)]rP12  -  PUG ^ 

+  [Qu  -(GrP„P-f-A/1)(P  +  PTPuP)-1A/2T]Gr1  =0  (21) 

Equation  (21)  can  be  solved  via  a  matrix  direct-product  method  [6].  The  desired  optimal 
digital  control  law  in  (19a)  becomes 

ud(kT )  =  - Kdxd(kT )  4-  Kqq{kT)  (22) 


where 

Kd  =  {R  +  HTPnH)~'{HTPnG  4  A/1t) 

Kq  =  -{R+  HTPuH)-l{HTP12G1  +A/2t) 

Because  q{kT )  in  (22)  is  generated  from  the  dynamic  system  in  (12)  or  (13),  i.e., 

n(1cT  4-  _  xc(kT  4-  T)  _  Ge  Hc  xc(kT )  *c(0)  /ji) 

q[  +*)-  [yr(kT  +  T)\  -  [  0  Gr  j  [  yr(kT)  ’  [yr(0)J  1  ’ 

We  decompose  the  dynamic  gain  Kq  in  (22)  as  Kq  =  [A’c,Ar]  where  Kc  £  TZmxn  and 

Kr  €  Pmxp'.  Hence  the  desired  optimal  dynamic  digital  control  law  in  (22)  can  be 


rewritten  as 


ud{kT)  =  -Kdxd(kT)  +  Kcxc{kT)  +  KTyr{kT ) 


(24a) 


xc{kT)  =  G**d(  0)  +  ^G*  — 3Pcyr(tP) 


(2  4b) 
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and 


Vr(kT )  =  C?rfcyr(0) 


(24c) 


Thus,  the  digital  redesigned  system  via  the  optimal  dynamic  digital  controller  in  (24) 
becomes 


xd{t)  =  Axd{t)  -  BKdxd(kT)  +  BKcxc(kT)  +  BKryr(kT );  ®d(0)  (25) 


The  digital  redesigned  closed-loop  system  is  shown  in  Fig.  1. 

If  y r(t)  in  (11)  is  measurable,  or  the  initial  vector  yr( 0)  is  available,  the  control  law  in 
(24a)  can  be  realized  via  a  microcomputer.  However,  in  practice,  it  is  quite  possible  that 
only  an  incoming  signal  r{t)  is  available.  In  this  case,  an  estimator  can  be  constructed 
with  r(t)  as  an  input,  and  the  estimated  state  yr(f)  of  yr(t)  as  an  output  [2]  provided  that 
the  pair  [Ar,  Cr]  are  observable. 

When  r(t)  in  (11)  is  a  step  function,  then  Cr  =  Im,  yr(t)  =  r(<),  and  yr(kT)  =  r(kT). 
The  optimal  dynamic  digital  control  law  in  (24)  reduces  to 


Ud(kT)  =  - Kdxd(kT )  +  Kcxc{kT )  +  Krv(kT) 

fc-i 

=  —Kdxd(kT)  +  KcGkcxd{ 0)  +  [Kc  G'-'-'Hc  +  Kr}r{kT) 

t'=0 


(26) 


3.  Illustrative  Example 

Consider  an  unstable  system  in  (1)  with 
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Kc  = 


(28) 


Using  the  optimal  pole-placement  method  proposed  in  [7],  the  optimal  state  feedback 
gain  Kc  in  (2)  is  found  as 

'7.871  -0.563  3.255  -0.137  0.754' 

1.625  -1.247  1.297  -1.003  0.182 

Utilizing  the  feedback  gain  Kc,  the  eigenvalues  of  the  closed-loop  system  in  (3)  are  placed 
within  the  common  region  of  an  open  sector  (with  a  sector  angle  ±45°  from  the  negative 
real  axis)  and  the  left-hand  side  of  a  —1.1  vertical  line  on  the  negative  real  axis  in  the 
complex  a-plane,  and  <r(A  —  BKe)  =  {-4.6789  ±  j4. 6518, -1.8983 ±  jl. 898, -8.0}. 

Assume  Ee  —  Jj  in  (2),  and  let  that  the  reference  input  r(t)  in  (2)  contain  a  sine 
function  (sin(u>t))  with  an  angular  frequency  u>  =  3.0  and  an  unit-step  function,  that  is 


’(<)  = 


sin(3.0f) 

,r2(0j 

1 

t  >  0 


(29) 


The  reference  input  r(t )  can  be  represented  by  a  zero-input  state  equation  in  (ll)  with 

Ar  = 


0.0 

3.0 

0.0 

-3.0 

0.0 

0.0 
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0.0 

0.0 
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0 J 
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and  an  initial  vector  yr(0)  =  [0.0  1.0  1.0]T. 

Using  the  method  proposed  in  this  paper,  we  obtain  the  dynamic  digital  control  law 
in  (24)  with  a  sampling  period  T  =  0.5 (sec.)  as 


where 


ud(kT)  =  - Kdxd{kT )  +  Kcxc(kT)  +  Kryr(kT) 
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The  simulation  results  of  the  closed-loop  systems  in  (3)  and  (25)  are  shown  in  Fig. 2 


for  both  xc(t)  in  (3)  and  xd(t )  in  (25),  and  those  of  the  controls  uc(t)  in  (2)  and  u^t)  in 
(24)  are  shown  in  Fig. 3.  The  simulation  results  have  shown  that  x d(t)  is  very  close 
tc  xc(t)  even  with  a  rather  larger  sampling  period  (considering  the  dynamics  of  the  given 


system  and  the  frequency  of  the  reference  input). 
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4.  Conclusion 

A  new  optimal  digital  redesign  technique  has  been  developed  for  finding  a  dynamic 
digital  control  law  in  (24)  from  the  available  analog  counterpart  in  (2)  and  simultaneously 
minimizing  a  quadratic  performance  index  in  (9).  First,  an  augmented  system  in  (14a), 
which  consists  of  a  reference  model  in  (11),  an  orginal  closed-loop  system  in  (3)  and  the 
digital  controlled  original  open-loop  system  in  (4a),  is  constructed  and  converted  into 
a  discrete-time  dynamic  system  in  (15)  to  be  designed.  Next,  a  quadratic  performance 
index  in  (18)  is  established  from  that  in  (9)  using  a  set  of  weighting  matrices  in  (17). 
A  set  of  Lyapunov  equations  have  been  developed  in  Appendix  7.2  to  find  the  weighting 
matrices  for  the  quadratic  performance  index.  Then,  a  standard  discrete-time  optimal 
regulator  in  (22)  is  determined  by  solving  a  small  dimensional  Riccati  equation  in  (206) 
and  a  Lyapunov  equation  in  (21).  Finally,  the  desired  digital  state-feedback  gain  and 
forward  gain  in  (24a)  can  be  computed  from  those  in  (22),  (246)  and  (24c).  An  illustrative 
example  has  been  presented  to  demonstrate  the  effectiveness  of  the  proposed  method.  The 
developed  dynamic  digital  redesigned  control  law  enables  an  optimally  close  matching  of 
the  states  of  the  digital  redesigned  closed-loop  system  as  compared  to  the  states  of  the 
original  closed-loop  system  and  it  can  be  implemented  via  low  cost  microcomputers.  The 
proposed  technique  can  be  applied  to  a  system  with  a  more  general  class  of  reference  inputs 
having  a  relatively  large  sampling  period. 

5.  Acknowledgements 

This  work  was  supported  in  part  by  the  U.S.  Army  Research  Office  under  contract 
DAAL-03-87-K-0001. 

0.  References 

[1.]  KUO,  B.C.  :“Digital  Control  Systems”  (Holt.  Rinehart  and  Winston,  1980). 

[2.]  ANDERSON,  B.D.O.  and  Moore,  J.B.  :“Linear  Optimal  Control”  (Prentice-Hall,  En¬ 
glewood  Cliffs,  New  Jersey,  1971) 

[3.]  DRESSLER,  R.M.,  and  Larson,  R.E.  :“Computation  of  Optimal  Control  in  Partially 
Controlled  Linear  Systems”,  Proceedings  of  the  1968  Joint  Automatic  Control  Con¬ 
ference,  1968,  pp. 71 1-715. 

[4.]  PATEL,  R.V.,  and  Munro,  N.  :“Multivariable  System  Theory  and  Design”  (Pergamon 
Press,  1982). 


235 


[5.]  SHIEH,  L.S.,  Wang,  C.T.,  and  Tsay,  Y.T.  -.“Fast  Suboptimal  State-space  Self-tuner  for 
Linear  Stochastic  Multivariable  Systems”,  IEE  Proc.  D.  Control  Theory  and  AppJ., 
1983, (4),  pp. 143-154. 

[6.]  BELLMAN,  R.  .-“Introduction  to  Matrix  Analysis”  (McGraw-Hill,  1970). 

[7.]  SHIEH,  L.S.,  Dib,  H.M.,  and  Ganesan,  S.  : “Continuous- time  Quadratic  Regulators 
and  Pseudo- continuous- time  Quadratic  Regulators  with  Pole  Placement  in  a  Specific 
Region”,  IEE  Proc.,  D.,  1987,  (5),  pp. 138-346. 

[8.]  SHIEH,  L.S.,  Chen,  C.F.,  and  Huang,  C.J.  :“On  Identifying  Transfer  Functions  and 
State  Equations  for  Linear  Syaytems”,  IEEE  Trans,  on  Aerospace  and  Electronic 
Systems,  1972,  AES-8,  pp. 811-820. 

7.  Appendices 
Appendix  7.1 

Let  an  m  x  1  output  rational  function  R(s),  which  is  the  product  of  a  transfer  func¬ 
tion  matrix  and  an  input  function,  be  represented  by  an  irredu  ble  left  matrix  fraction 
description  [4]  as 

R(s)  =  {ImS  +  D,sp-'  -I-  ...  4-  Dp)~'  [JVj  +  . . .  +  NPJ  (A.  1) 

where  D ,  €  7 Zmxm  and  N,  €  72mxm  for  i  =  l,2,...,p. 

The  left  matrix  fraction  description  can  be  realized  by  the  following  zero-input  state 
equtation: 


yr{i)  =  Aryr{i );  yr{0) 

r(t)  =  Cryr{t) 


where  yr{t)  €  npmXi,  r(l)  €  RmXl 
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The  above  result  can  be  obtained  by  following  the  method  shown  in  [8] 


(A.2) 

(A.3) 
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Appendix  7.2 

Some  useful  formulas  for  computing  the  matrices  Qn,  Qi2,  Mlf  A f3,  and  R  are  given 
as  follows. 

Let  the  matrix  Qn  be  defined  as 


where 


Qu 


A 


eATtQeM  dt 


FQ(t )  =  eATtQeAt 


Taking  the  derivative  of  (A. 4b)  with  respect  to  t  gives 


(A. 4a) 


(AAb) 


FQ(t)  =  ATeAT*QeM  +  eATtQeAtA  =  AT  FQ(t)  +  Fq{t)A  {A.  5) 


Integrating  (A. 5)  on  both  sides  from  0  to  T  yields 

T 

f  F9(t)it  =  F„(()|0T  =  eA’rQeAT  -  Q 
Jo 

fT  rT 

=  At  I  Fq(t)dt+  /  FQ{t)dtA 
Jo  Jo 

Thus 

A^Qn  +  QiiA  —  GtQG  —  Q  (A. 6) 

where  G  =  eAT. 

When  A  is  nonsingular,  the  unique  solution  Qu  can  be  solved  from  the  Lyapunov 
equation  in  (A. 6)  via  the  matrix  direct-product  method  [6]. 

Let  Q12  be  defined  as 

Qu  =  r^Qe^dt  (A.  7) 

Jo 

where  Q  =  [— Q,0]  €  'Rnxnx.  Similar  to  the  above  derivation,  if  A  and  A]  are  nonsingular 
and  <Tj(A)  4-  <Tj{A\)  ^  0  for  all  t,j,  where  tr(-)  denotes  the  eignspectrum  of  (•),  then  the 
unique  solution  Q12  can  be  obtained  from  the  following  Lyapunov  equation: 


AT  Q  u  +  QnAi  =  GtQGi  —  Q 


(A.8) 


where  G  =  eAT  and  Gj  =  eAlT. 
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Let  A/j  be  defined  as 


(4.9a) 


A/, 


=  JT  FM{t) 
Jo 


dt 


where 


FM(t)  =  eATiQ  [  eAXBdX  (4.96) 

Jo 

Carrying  out  the  differentiation  of  (4.96)  with  respect  to  t,  we  obtain 

FM(t)  =  ATeATtQ  f  eAXBd\  +  cATtQeAtB 

Jo  (4.10) 

=  4tFm(0  +  Fq(t)B 

Integrating  both  sides  of  (A. 10)  from  0  to  T  gives 

Fm(*)|0t  =  *aTtQ  [T*AXBd\  =  gtqh 

Jo 

fT  .T 

=  4r  /  FM{t)dt+  FQ{t)dtB  =  AtMi +QUB 
Jo  Jo 

where  H  =  eAX  B  d\.  Thus, 

4tA/j  +  QnB  =  GtQH  (4.11) 


If  A  is  nonsingular,  then 

A/,  =  (Ar)-1[GrQJ/  —  Q\\B] 

(4.12) 

Define 

Mi  =  [  {eA^QT[  [  eAXBd\]}dt 

Jo  Jo 

Similar  to  the  derivations  of  (A. 9)  through  (A. 12),  if  Aj  is  nonsingular,  M3 

as 

M3=(Aj)-i\GjQTH  -Qj2B] 

can  be  found 

Let  R  be  defined  as 

=  fT  {[  f'AXBdX]TQ[  f'cAXBd\]}dt 

Jo  Jo  Jo 

(4.13) 

If  4  is  nonsigular,  then 

f  eAX B  dX  =  [eAt  -  In}A~'  B 

Jo 

(4.14) 
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Substitute  (A. 14)  into  (A. 13),  we  have 


T 

R  =  (A-'B)tJ  [eATtQe*t-eATtQ-QeM  +  Q]it(A-'B)  ^  J5) 

=  (A'1  B)t\Qw  -  (G  -  U)t(At)~'Q  -  Q(G  -  I„)A-'  +  QT}(A-' B) 

If  A  and/or  Ai  are  singular,  the  matrices  Qii,  Qi 2,  Mi,  Mj,  and  R  can  be  computed 
by  any  numerical  integration  method. 
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Fig.l  The  digital  redesigne 


e  responses  of  the  original  closed-loop  system  in 
digital  redesigned  closed-loop  system  in  (25). 
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ABSTRACT 

A  methodology  is  developed  to  assess  the  effectiveness  of  a  class 
of  smart  munition  system.  Following  a  formal  analysis  of  the  aggregate 
problem  and  the  characterization  of  the  elements  of  the  system,  a 
prototype  mathematical  model  is  formulated.  This  model  describes  the 
temporal  operation  of  the  weapon  system  in  the  battlefield  in  the 
presence  of  threats  and  countermeasures.  Simultaneously,  it  captures 
the  uncertainty  element  typically  arising  in  such  problems.  The 
solution  results  derived  may  be  used  to  develop  cost- free  and  cost- 
related  measures  of  effectiveness  to  evaluate  and  select  smart  munitions 
weapon  systems. 


I .  INTRODUCTION 

Over  the  last  decade,  various  DOD  agencies  have  been  involved  in 
developing  a  family  of  weapon  systems  known  as  smart  munitions  (SM) 
which  could  significantly  enhance  the  U.S.  capability  in  the 
battlefield,  while  simultaneously  improving  mission  survivability. 

SMs  have  the  autonomous  capability  to  search,  detect,  acquire  and 
engage  targets.  They  can  be  delivered  by  a  variety  of  means  such  as 
rockets,  guns,  dispensers,  etc...,  in  large  quantities  over  a  large 
arrays  of  land-mobile  targets.  They  can  simultaneously  engage  multiple 
targets  and  be  accurately  delivered  on  selected  targets  without 
requiring  an  operator  on  the  loop.  The  development  of  a  methodology  to 
assess  the  effectiveness  of  this  new  weapon  system  in  the  battlefield 


243 


while  incorporating  threats  and  countermeasures ,  becomes  an  important 
problem  to  be  analyzed  and  studied. 


II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT 

As  part  of  an  ongoing  program,  the  Analysis  Division  of  the  Air 
Force  Armament  Laboratory  at  Eglin  AFB  has  requested  its  Technology 
Assessment  Branch  to  provide  an  assessment  for  a  Smart  Submunition 
Technology  program. 

The  objective  of  this  program  is  to  integrate  advanced  technologies 
for  the  next  generation  of  smart  submunitions  (SM).  The  technologies 
would  be  advanced  technology  sensors,  warhead,  and  maneuvering, 
compatible  with  advanced  aircraft  and  dispenser  delivery  systems,  and 
capable  of  providing  substantial  increases  in  effectiveness  over  current 
weapons  against  ground  mobile  targets.  The  target  set  being  considered 
for  the  program  spans  the  spectrum  of  ground  mobile  targets ,  and 
includes  heavy  armour,  softer  vehicles  ranging  from  air  defense  targets 
to  light  armor,  and  rail  transport. 

The  technology  assessment  necessitated  the  development  of  an 
appropriate  methodology  for  evaluating  the  proposed  system.  As  a 
result,  the  following  tasks  had  to  be  undertaken: 

1.  Study  the  operational  characteristics  of  the  Smart  Submunition 
Weapon  Systems; 

2.  Analyze  the  system  by  identifying  the  various  components  in  the 
operation  of  the  system  as  well  as  the  targets; 

3.  Characterize  each  component.  Develop  appropriate  descriptive 
parameters  which  may  be  used  as  inputs  in  an  effectiveness  model; 

4.  Formulate  a  mathematical  model  which  describes  the  operation  of  the 
weapon  systems  under  combat  condition  by  incorporating  threat  and 
by  capturing  the  stochastic  nature  of  the  problem; 

5.  Provide  a  systematic  methodology  to  solve  the  mathematical  model; 

6.  Develop  appropriate  measures  of  effectiveness. 

A  brief  review  of  items  (1),  (2)  and  (3)  is  provided.  The  emphasis 
of  the  present  report  is 
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1)  to  formulate  a  stochastic  model  and  to  provide  a  method  of 
solution  to  a  prototype  model  describing  the  operations  of  the  weapon 
system; 

2)  to  provide  selected  measures  of  effectiveness  of  the  operation 
of  the  weapon  system. 


III.  THE  SMART  SDBMUNITION  WEAPON  SYSTEMS  (SSWS) 

The  effectiveness  in  the  use  of  a  SSWS  depends  on  the  assumption 
that  the  ultimate  engagement  in  the  battlefield  is  a  " many -on- many " .  A 
number  of  smart  submunitions  (SS)  are  delivered  in  the  proximity  of  an 
area  where  several  targets  are  located  such  as  tanks  in  a  tank  company. 
Through  its  sensors,  each  submunition  is  capable  of  locating,  detecting, 
acquiring  and  engaging  a  target  of  a  given  type.  The  sensors  have  the 
capability  of  identifying  the  kind  of  targets  (e.g.  tanks  or  APC)  that 
they  are  to  engage.  In  general,  the  design  of  a  SS  is  governed  by  the 
environment  within  which  it  will  be  deployed  and  the  characteristics  of 
the  targets.  As  many  factors  as  possible  are  accounted  for  in 
developing  the  configuration  of  the  SS.  Even  then,  the  effectiveness  of 
the  SS  may  be  enhanced  or  deterred  depending  upon  the  mode  by  which  ch 
weapons  are  delivered  in  the  vicinity  of  the  target.  For  example,  a 
parachute -suspended  SS  is  typically  not  highly  maneuverable  and  uses  a 
small  search  footprint.  As  such,  It  would  not  be  very  effective  against 
moving  targets.  A  better  design  would  be  a  parafoil -suspended  SS  or  an 
inflatable-wing  SS  both  of  which  are  highly  maneuverable  and  have  a 
larger  footprint.  They  have  the  capability  of  guiding  the  SS  more 
rapidly  towards  the  target,  thus  being  more  effective  against  moving 
targets.  In  addition,  a  larger  footprint  increases  the  probability  of 
acquiring  a  target. 

For  the  engagement  to  be  successful,  it  is  necessary  to  deliver  a 
large  number  of  weapons  over  a  given  area,  and  to  provide  each  weapon 
with  the  capability  to  search  and  locate  a  target  with  a  high 
probability  of  success.  This  requires  that  a  large  number  of  targets  be 
available  within  the  footprint  of  the  delivery  weapon.  The  larger  the 
footprint,  the  more  likely  the  weapon  will  acquire  a  target  during  its 
search,  assuming  the  same  target  density  in  the  area.  Once  acquired, 
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the  discriminating  sensors  carried  by  the  weapon  will  identify  the 
target,  select  the  ones  to  be  attacked  and  thus  pair  each  munition  to  a 
target  of  a  kind.  An  obvious  disadvantage  of  this  system  is  the 
likelihood  of  more  than  one  weapon  attacking  one  particular  target 
unless  specific  algorithms  are  built  into  the  weapon  system  to  preclude 
such  situations.  A  second  disadvantage  is  that  in  order  to  be 
effective,  the  SS  must  be  placed  in  the  vicinity  of  the  target  by  a  SS 
delivery  system.  Finally,  in  order  to  acquire  and  to  precision  guide 
towards  the  target,  it  becomes  necessary  to  slow  down  the  search  and 
acquisition  process.  These  disadvantages  do  not  appear  in  most  of  the 
existing  weapons  involving  one-on-one  engagement  in  which  the  weapon  is 
delivered  from  a  much  longer  distance  at  a  very  high  speed. 

So  far,  of  these  three  disadvantages,  a  solution  has  been  found 
only  to  the  second  one  in  which  an  unmanned  carrier  or  dispenser 
launched  from  a  platform  at  a  standoff  position  is  used  to  place  the 
weapons  in  the  vicinity  of  the  target.  This  however  requires  the  use  of 
a  data  link  system  which  provides  the  platform  with  the  necessary 
information  about  target  area  coordinates  and  target  movement  so  that 
the  carrier  is  launched  and  directed  towards  the  vicinity  of  the  target 
area.  Additional  information  may  have  to  be  continuously  provided  to 
the  dispenser  regarding  target  location  and  the  specific  time  at  which 
weapons  are  to  be  released.  Aerial  and  ground  sensors  are  typical  means 
to  collect  information  on  target  location  and  movement.  Aerial  sensors 
may  be  in  the  form  of  remotely  piloted  vehicles  (RPV)  or  unmanned  air 
vehicles  (UAV)  or  AWACS .  A  combination  of  aerial  and  ground  sensors  may 
be  used.  The  information  provided  is  transmitted  to  a  C  I  post  which 
then  relays  it  to  the  launch  platform.  The  launch  platform  function  is 
to  transport  the  dispenser  and  to  utilize  the  relayed  information  from 
the  C  I  post  to  aim  and  launch  the  dispenser  from  an  appropriate 
location  at  a  given  time.  The  dispenser  transports  the  SS  subpacks  to  a 
given  location  and  drops  them  so  as  to  create  a  dispersal  pattern  which 
results  in  the  best  engagement  opportunities  for  the  SS .  It  must  be 
noted  that  once  the  SS  is  dispensed,  it  depends  solely  on  its  own 
seekers  and  sensors  to  guide  it  terminally  towards  the  target. 

In  detailing  the  effectiveness  of  the  aggregate  weapon  system  in 
the  context  of  the  mission  it  is  supposed  to  be  performing,  one  may  not 
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neglect  the  contribution  of  the  intelligence  gathering  system  used  in 

3 

support  of  the  mission  as  well  as  the  contribution  of  C  .  The 
reliability  of  the  mission  is  as  good  as  the  reliability  of  its 

3 

components  and  Cl  must  be  considered  as  an  integral  part  of  the  overall 
system. 

When  developing  the  appropriate  equations  to  compute  the  overall 
mission  reliability  as  a  function  of  time,  it  may  be  assumed  that  the 
system  is  made  up  of  two  subsystems.  The  first  subsystem  consists  of 
the  Cl  components  providing  the  data  link.  The  second  subsystem 
consists  of  the  smart  submunition  weapon  system  (SSWS)  whose  components 
are  the  platform,  the  dispenser,  the  parafoil  and  the  submunition. 
Assuming  independence  of  operation  of  these  two  subsystems,  the  overall 
mission  reliability  is  the  product  of  the  reliability  of  the  first 
subsystem  and  the  second  subsystem.  Mission  reliability  is  sometimes 
used  as  one  of  the  measures  of  effectiveness . 

A  very  important  consideration  at  this  stage  is  that  time  becomes 
an  important  parameter  to  be  accounted  for  in  the  development  of 
appropriate  measures  of  effectiveness.  This  is  insignificant  when 
studying  the  mission  performance  of  traditional  weapons  since  they  rely 
on  their  high  speed  for  delivery  and  on  the  element  of  surprise  when 
attacking  targets.  However,  in  the  case  of  SSWS,  weapon  delivery  time 
is  much  longer.  This  in  turn  eliminates  the  element  of  surprise  and 
provides  the  enemy  significant  more  time  to  react  to  the  attacking 
weapon  system.  Thus  the  enemy  will  have  increased  capability  to  perform 
such  actions  as: 

maneuvering  out  of  the  range  of  incoming  weapons; 

visually  acquiring  and  destroying  the  guiding  vehicle  of  the 
weapon; 

initiating  countermeasures  to  minimize  or  eliminate  the 
effectiveness  of  the  weapon; 

securing  positions  by  scattering  or  scrambling  so  as  to  decrease 
the  target  density  in  the  area  of  attack. 

Typical  components  of  the  SSWS  are: 

1.  The  air  platform; 

2.  The  dispenser; 
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3.  The  parafoil; 

4.  The  submunition  warhead; 

5 .  The  submunition  sensors ; 

Each  of  these  components  is  characterized  next.  Following  some 
remarks  concerning  the  operational  effectiveness  of  the  weapon  system, 
the  target  element  is  characterized. 

1.  The  Air  Platform 

In  the  delivery  of  smart  submunitions,  it  is  envisioned  that  an 
aircraft  will  carry  a  number  of  dispensers,  each  loaded  with  several 
subpacks  of  submunitions.  From  a  standoff  position,  and  following  a 
process  of  target  area  acquisition  and  location,  the  aircraft  will  fire 
the  dispensers  either  in  salvo  or  in  sequence  so  that  each  dispenser  is 
capable  of  maneuvering  towards  the  target  area. 

Descriptive  Parameters 

-  Range  and  speed; 

-  Average  time  to  release  dispenser  from  the  moment  the  aircraft  enters 
enemy  terr  t ory : 

-  Intensity  f  threat  encounter; 

-  Probability  of  aircraft  being  killed; 

-  Intensity  of  electronic  jamming  encounter; 

-  Probability  of  aircraft's  communication  and  data  link  being  jammed  and 
loosing  its  mission  capability. 

2 •  The  Dispenser 

The  dispenser  is  a  container  capable  of  self  propulsion.  It  can 
either  be  preprogrammed  to  move  from  its  launch  platform  towards  the 
target  area  or  it  can  be  directed  towards  the  area  through  a  data  link. 
Alternatively,  it  is  conceivable  that  through  its  own  sensors  it  has  its 
own  capability  of  homing  towards  the  target  area  (autopilot) .  Once 
within  target  area  vicinity,  it  releases  the  submunition  subpacks  either 
in  salvo,  or  in  sequence  though  an  intervalometer  setting.  The  release 
time  of  the  subpacks  from  the  time  of  the  dispenser's  release  from  the 
air  platform  is  a  variable  and  is  constrained  by  the  range  of  the 
dispenser.  Once  the  subpacks  are  released,  the  dispenser  being  not 
programmed  to  be  recovered  is  allowed  to  crash  on  landing  and/or  self- 
destruct. 
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The  choice  of  the  dispenser  depends  on  the  mission  to  be 
accomplished.  This  will  dictate  its  capacity,  load,  speed,  range  and 
other  characteristics.  For  example  a  dispenser  with  a  longer  range  is 
desired  if  the  weapons  have  to  be  carried  from  the  air  platform  in  a 
distant  position  to  second  echelon  supply  lines  in  an  interdiction  role. 
In  addition  the  dispenser  may  have  to  generate  a  larger  footprint. 
Descriptive  Parameters 
Range  and  speed; 

Average  time  from  instant  of  dispenser  release  from  air  platform  to 
instant  of  subpack  releases; 

Intensity  of  threat  encounter; 

Probability  of  dispenser  being  killed; 

Intensity  of  electronic  jamming  encounter; 

Probability  of  dispenser's  data  link  being  jammed  and  loosing  its 
mission  capability. 

3 .  The  Parafoil 

The  parafoil  is  the  element  used  for  the  indirect  delivery  of  the 
submunition  in  widely  dispersed  area  of  target  elements.  It  is  a  wide- 
area  search  and  control  system  of  the  targets  with  its  own  guidance.  It 
generates  the  search  of  a  target  over  a  relatively  large  footprint  and 
once  the  target  is  acquired  by  the  submunition  sensors,  it  controls  and 
directs  its  motion  towards  the  target  so  as  to  achieve  a  range  from 
which  the  warhead  could  be  fired.  The  payload  consists  of  the  sensor 
and  the  warhead.  At  the  desired  altitude,  the  subpacks  in  the  dispenser 
are  ejected  and  the  submunitions  released.  The  parafoils  are  then 
deployed  and  the  search  mode  initiated.  The  wide  area  scan  greatly 
increases  the  search  area  to  compensate  for  large  delivery  errors.  Once 
the  fuze  ignites  the  charge  and  an  explosion  is  set  up,  the  slug  is 
formed.  Simultaneously  the  submunition  sensors  and  the  parafoil  are 
destroyed.  The  ability  of  a  parafoil -controlled  submunition  to  glide 
provides  an  increased  search  area  and  control  to  target.  With  an 
ability  to  change  horizontal  to  vertical  velocity  ratio  and  to  brake, 
v.he  parafoil  can  be  programmed  to  provide  a  simple  terminal  homing 
capability . 
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Descriptive  Parameters 

-  Average  time  to  search  and  acquire  a  target. 

4.  The  Submunition  Warhead 

The  objective  of  the  warhead  is  to  achieve  mobility  kill  in  ground 
mobile  targets  including  heavy  armor  (tanks),  softer  vehicles  (APCs ,  air 
defense  targets,  light  armor)  and  rail  transport.  The  means  of 
attaining  this  objective  is  a  modular  Explosively  Formed  Penetrator 
(EFP) .  The  warheads  are  used  in  shoot-to-kill  sensor  fuzed 
munitions (SFM) . 

In  EFP  warheads,  the  fuze  ignites  the  explosive  material  (chemical 
energy  warheads) .  Some  of  the  explosive  energy  is  used  to  reshape  the 
liner  and  accelerate  it  towards  the  target. 

5 .  The  Submunition  Sensors 

The  submunition  sensors  perform  five  basic  functions: 

a.  A  search  function  involving  target  search,  detection, 
identification,  discrimination,  classification  and  acquisition; 

b.  A  target  location  function  involving  the  location,  relative  speeds, 
coordinates  and  other  dynamic  characteristics  of  the  target  with 
respect  to  the  submunition  for  target  engagement; 

c.  A  maneuvering  function  so  that  once  target  is  acquired,  the 
submunition  will  maneuver  to  an  optimum  lethal  range  and  position 
to  fire  a  warhead  at  a  predetermined  aimpoint; 

d.  Auxiliary  functions  such  as  false  target  rejection,  false  alarm 
elimination,  warhead  mode  selection,  etc.... 

Sensors  perform  their  functions  by  receiving  electromagnetic 
radiation  emitted  by  targets  and  their  surrounding  environment. 
Variations  in  electrical  pulses  due  to  radiation  changes  are  sent  to  a 
signal  processor  which  perform  the  above  functions.  Sensors  are 
characterized  by  their  operating  mode  (passive,  active  or  dual)  and 
their  operating  waveband  (infrared,  millimeter  wave,  etc...). 

In  a  passive  mode,  sensors  receive  radiation  through  a  receiver, 
emitted  or  reflected  by  objects  on  the  battlefield.  In  an  active  mode, 
sensors  transmit  radiation  through  a  transmitter  and  receive  the 
associated  reflections  as  well  as  radiation  from  other  sources  through  a 
receiver.  Sensors  operating  in  a  dual  mode  include  typically  an  active 
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mode  for  target  acquisition  and  tracking  and  a  passive  mode  for  the 
terminal  phase. 

We  shall  discuss  three  types  of  sensors:  the  infrared  (IR) 
sensors,  the  millimeter  wave  (MMW)  sensors,  and  the  electro-optical  (EO) 
sensors . 

i.  IR  Sensors 

IR  sensors  capture  the  radiant  energy  emitted  by  heated  objects. 

In  their  simplest  type,  IR  sensors  operating  in  the  passive  mode,  scan 
optically  the  target  area  for  IR  radiation  in  a  single  waveband  by  all 
bodies.  A  more  complex  design  involves  IR  sensors  that  detect  target 
signature  in  two  different  wavebands,  thus  allowing  discrimination 
between  a  true  target  (e.g.  tank)  and  a  decoy  (e.g.  flare).  Finally, 
for  high  angular  resolution  for  target  detection  and  tracking,  imaging 
infrared  (HR)  resulting  in  image  like  properties  of  the  target,  may  be 
used. 

ii .  MMW  Sensors 

MMW  sensors  capture  the  radiant  energy  emitted  by  the  reflection  of 
metal  objects. 

iii .  EO  Sensors 

EO  sensors  typically  integrate  optics  and  lasers  and  employ 
advanced  forward-looking  infrared  (FLIR)  sensors  and  image-processing 
computers  for 

-  automatic  target  recognition; 

-  intelligent  tracking; 

-  prioritization  of  multiple  targets; 

-  sensor  input  integration. 

They  contain  a  laser  designator  for  directing  laser-guided  weapons  and 
for  helping  see  at  night  where  smoke,  dust,  haze  and  smog  are  present. 
Descriptive  Parameters  of  the  Submunition 

-  Number  of  submunitions; 

-  Average  time  to  search  and  detect  a  target; 

-  Probability  of  acquiring  a  target  given  that  it  is  detected. 

-  Probability  of  acquiring  a  false  target. 

6 .  Remarks 

i.  The  use  of  multi-mode  (active  and  passive)  and  multispectral  (MMW 
and  IR)  systems  present  several  advantages  such  as: 
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-  provide  greater  accuracy  in  target  hit; 

-  reduce  false  alarms ; 

-  improve  target  detectability  and  acquisition; 

-  defeat  enemy  countermeasures . 

ii.  The  final  disposition  of  the  submunition  is  a  critical  one 

particularly  if  it  is  not  paired  to  a  target  and  its  explosive  is 
not  activated.  In  such  a  case,  if  the  submunition  is  designed  to 
self -explode  at  a  given  altitude,  it  can  create  a  source  of  heat 
which  could  capture  other  incoming  live  submunitions.  If  it  is  not 
designed  to  self-explode  or  if  the  self-destruct  mechanism  fails  to 
be  activated,  it  is  liable  to  fall  into  enemy  hands  and  thus  be 
technologically  accessible  for  the  development  of  countermeasures. 

7 .  The  Target 

Primary  targets  are  ground  mobile  targets  including  heavy  armors 
(tanks)  as  well  as  softer  vehicles  ranging  from  air  defense  targets  to 
light  armor  and  rail  transport. 

The  distinguishing  feature  of  smart  munitions  from  other  types  of 
anti-armor  weapons  is  that  they  home  on  their  targets  and/or  are 
activated  by  them.  They  also  attack  targets  at  their  most  vulnerable 
point  namely  the  top.  Typically,  a  large  number  of  smart  munitions  will 
be  needed  to  insure  defeat  of  a  massive  armored  assault  consisting  of 
many  targets.  A  barrage  of  thousands  of  these  munitions  would  blunt 
armored  assault  and  reinforcing  columns. 

Descriptive  Parameters 

-  Total  number  of  targets  (true  and  decoys); 

-  Proportion  of  decoys  to  total  number  of  targets; 

-  Probability  of  target  being  hit; 

-  Probability  of  target  being  killed  given  that  it  is  hit. 


IV.  THE  MATHEMATICAL  MODEL 
1.  The  Problem 

The  development  of  a  mathematical  model  depends  on  several  factors 
governing  the  actual  conditions  under  which  the  weapon  system  operates. 
This  may  include  for  example,  environment,  combat  scenario,  operation 
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sequence,  mode  of  weapon  delivery,  weapon  technology,  etc...  In 
general,  a  model  should  be  able  to  capture  the  uncertainty  element 
present  in  an  actual  combat  situation  together  with  the  evolution  of  the 
combat  state  at  successive  time  epochs.  Very  often  the  objective  is  not 
on  simply  winning  a  battle,  but  on  how  quickly  to  win  a  battle.  From 
that  point  of  view,  time  becomes  an  important  parameter  to  be 
incorporated  in  the  model. 

To  illustrate  the  methodology,  we  consider  a  situation  in  which  an 
aircraft  releases  a  single  dispenser  from  a  standoff  location  just 
before  penetrating  enemy  lines.  The  dispenser  carries  M  submunitions  to 
be  released  over  an  area  A  containing  N  targets  (M  >  N) . 

2 .  Assumptions 
Ue  assume  that: 

a.  the  dispenser  carries  M  submunitions  to  be  released  over  an  area 
containing  N  targets; 

b.  the  number  of  targets  is  reduced  only  by  the  number  of  targets 
killed  by  the  submunitions.  No  other  weapons  are  fired  against 
these  targets  and  the  targets  are  considered  to  remain  within  the 
scanned  area; 

c.  if  a  submunition  does  not  acquire  a  target,  it  searches  for  another 
target; 

d.  once  a  submunition  acquires  a  target,  it  is  locked  onto  the  target 
to  be  shot  and  killed; 

e.  no  two  or  more  submunitions  may  acquire  the  same  target; 

f.  each  submunition  acts  independently  of  any  other  submunition; 

g.  the  submunition  sensors  are  not  subject  to  threats; 

h.  a  submunition  can  kill  only  one  target; 

i.  a  submunition  may  kill  only  the  target  that  it  acquires.  That 
means  that  a  target  cannot  be  killed  "by  mistake"; 

j.  a  submunition  may  not  acquire  a  false  or  dead  target; 

k.  the  dispenser  releases  the  M  submunitions  in  salvo; 
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If  any  of  the  above  assumptions  are  changed,  the  mathematical  model 
will  have  to  be  modified  accordingly.  The  present  model  may  be  viewed 
as  a  prototype  which  may  be  used  to  construct  other  model  variants . 

The  stochastic  aspect  of  the  problem  is  characterized  by 
P(M-m,  N-n,  t) ,  the  probability  that  at  time  t  following  release  from 
the  dispenser,  there  are  exactly  M-m  remaining  live  submunitions  (m  -  0, 
1,  .  .  .  ,  M)  and  N-n  remaining  live  targets  (n  -  0,  1,  ...,  N) .  One  can 
obtain  P(M-m,  N-n,  t)  by  developing  appropriate  differential  - 
difference  equations  subject  to  a  set  of  initial  conditions.  One  way  of 
obtaining  the  solution  of  these  equations  is  through  a  recursive 
approach.  Once  P(M-m,  N-n,  t)  is  derived,  one  can  obtain  such 
characteristics  as  the  expected  number  of  targets  killed,  the  expected 
number  of  submunitions  to  kill  a  given  number  of  targets,  the  mission 
reliability,  as  well  as  other  measures  of  effectiveness. 

2 .  The  Symbols 

Adt  +  o(dt)  -  probability  that  the  dispenser  will  release  the 

submunitions  in  salvo  in  the  time  interval  (t,  t  +  dt) ; 
udt  +  o(dt)  -  probability  that  the  dispenser  will  encounter  an  enemy 
threat  in  the  time  interval  (t,  t  +  dt) ; 

Pj  -  probability  that  the  dispenser  will  be  killed  given  that 

it  encounters  an  enemy  threat; 

wdt  +  o(dt)  -  probability  that  the  dispenser  will  encounter  enemy 
countermeasures  (e.g.  jamming)  in  the  time  interval 
(t,  t  +  dt) ; 

P2  “  probability  that  the  dispenser  will  be  neutralized  by 

countermeasures  and  will  not  be  able  to  accomplish  its 
mission; 

P(i,t)  -  probability  that  at  time  t,  the  dispenser  is  in  a  state 
i,  i  -  0,  1.  State  i  -  0  corresponds  to  the  state 
"dispenser  killed";  state  i  -  1  corresponds  to  the  state 
"dispenser  not  killed"; 

r  -  time  at  which  dispenser  releases  the  submunitions, 

measured  from  time  origin  at  which  dispenser  is  ejected; 

M  -  number  of  submunitions  released  in  salvo  by  the  dispenser; 

N  -  total  number  of  targets  in  footprint  area  (includes 

decoys) ; 
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N, 


-  total  number  of  decoys  in  footprint  area; 


pdt  +  o(dt)  -  probability  that  a  submunition  will  detect  a  target  in 
the  time  interval  (t,  t  +  dt) ;  (r  is  time  origin); 

pA  -  probability  that  a  submunition  will  acquire  a  target  once 

detected; 

P£  -  probability  that  a  submunition  will  kill  a  target  once 

acquired; 

P(M-m,  N-n,  t)  -  probability  that  at  time  t,  there  are  M-m  remaining 
live  submunitions  and  n  remaining  targets, 
m  -  0 ,  1 ,  . . . ,  M  and  n-0,  1,  N. 

E [ * ]  -  the  expectation  operator. 

3 .  The  Model 
i.  The  Dispenser 
We  have 

P(l,  t  +  dt)  -  P(l,  t)(l  -  i/dt)  (1  -  wdt)  +  P(l,  t)i/dt  (1  -  wdt)  (1  -  px) 
+  P(l,  t)  wdt  (1  -  i/dt)(l  -  P2)  +  o(dt) 


dP ( 1 , t ) 

or  - — - -  (i/pj^  +  wp2)  P(l,t) 


subject  to  the  initial  condition  P(l,  0)  -  1 
This  yields  P(l,  t)  -  e'(l/Pl  +  WP2),: 

The  probability  that  the  submunitions  are  released  in  the  time  interval 
(r,  r  +  dr)  following  dispenser  ejection  at  time  origin  is: 

f (r )dr  -  P(l,  r)e’*T  Adr 

.  e-(A  +  */p!  +  uV2)t  Adr  (1) 

ii .  The  Submunitions 

In  general  the  total  number  M  of  subraunitions  exceeds  the  total 
number  N  of  targets.  Let  P(M-m,  N-n,  t)  denote  the  probability  that  at 
time  t  there  are  (M-m)  remaining  live  submunitions  and  (N-n)  remaining 
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live  targets  (n  targets  killed).  Note  that  always  m  >  n.  For 
t  -  0,  P(M,  N,  0)  -  1. 


Now  for  0  <  m  <  M  and  0  <  n  ^  N,  n  <  m 
P(M-m,  N-n,  t+dt)  - 

-  [1  -  (M-m)(N-n)  ^pA  dt]  P(M-m,  N-n,  t) 

+  (M-m+1) (N-n+1)  ^pAPk  dt  P(M-m+l,  N-n+1,  t) 

+  (M-m+1) (N-n)  /*pA  (l-pk)  dt  P(M-m+l,  N-n,  t) 

By  expanding  the  first  term,  bringing  P(M-m,  N-n,  t)  to  the  left  side, 
dividing  by  dt  and  then  taking  the  limit  as  dt  0  we  obtain: 

d  P(M-m,  N-n,  t) 

-  -  -  (M-m)(N-n)  /ipA  P(M-m,  N-n,  t) 

dt 

+  (M-m+1)  (N-n+1)  A»PApk  P  (M-m+1,  N-n+1,  t) 

+  (M-m+1)  (N-n)  /ipA  (l-pk)  P(M-m+l,  N-n,  t) 

for  m-0,  1,  ....  M,  n-0,  1 . N 

The  solution  to  this  system  of  differential-difference  equations  is 
M!  N!  1 

P(M-m,  N-n,  t) - pkn  (l-pk)m'n 

(M-m) !  (N-n)!  (m-n) ! 

n  e- (M-i-j ) (N-j )  MPAt 

Z  - 

j-0  n  [ (M-i-ft) (N-ft) - (M-i-j ) (N-j ) ] 
n 

a-o 


lim  P(M-m,  N-n,  t)  -  0 

t+® 

Also  that  for  0  s  n  s  m  -  M  and  n  <  N 

Pj^  (n)  -  lim  P(0,  N-n,  t) 
t-+=° 


m-n  (m-n) ! 

2  (_l)(n»-n-i)  - 

i-0  (m-n-i)!i! 


Now  as  t+«,  we  have 

For  0  £  n  <  m  <  M  and  n  <  N 
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M! 


(M-n) !n! 


p£  d-Pk) 


M‘n  -  D 


d-pk) 


M-n 


For  n  -  N  <  m  <  M 
P2(m)  -  lim  P(M-m,  0,  t) 

t-*» 


N 


a-Pk)”'1* 


m-N 

Z  (.l)(m-N-i) 

M 

M-N-i 

i-0 

i 

.  M-m  j 

Expressions  for  the  probability  that  all  targets  are  killed  and  the 
expected  number  of  targets  killed  are  as  follows: 

1)  for  M  >  N 


N-l 

P{all  targets  killed)  -  1-  Z 

r-0 


M 


p£  (l-pk)M'r  -  Pi(N) 


N 

Expected  number  of  targets  killed  -  Z  nP^(n) 

n—0 


2)  for  M  -  N 

P(all  targets  killed)  -  p^ 


Expected  number  of  targets  killed  -  Mp^ 

3)  for  M  <  N 

P(all  targets  killed)  -  0 
Expected  number  of  targets  killed  -  Mp^ 

Steady  state  results  for  the  probability  of  kill  and  the  expected  number 
of  targets  killed  are  presented  in  Figures  1  and  2  for  N  -  5,  p^  -  0.90 


257 


Probability  of  all  t.k 


Figure  1:  Probability  of  all  targets  killed  as  a  function  of 
munitions  used  (N  *  5) 


Exp.#  of  t.k. 


B.r  n 


Figure  2:  Expected  number  of  targets  killed  as  a  function  of 
munitions  used  (N  *  5) 


and  M  -  1 . 15. 


V.  MEASURES  OF  EFFECTIVENESS 

We  develop  five  measures  of  effectiveness 

A 

1.  Expected  number  of  targets  killed  at  time  t,  n(t): 

A  M  min  (m,N) 

E[n(t) ]  -  S  S  n  P(M-m,  N-n,  t) 

m-0  n-0 


2. 


Expected  number  of 


submunitions  to  kill  n  targets  at  time  t,  m(t) 


E[m(t) ] 


M 

S  m  P(M-m,  N-n,  t) 
m-n 


3.  Probability  that  all  targets  are  killed  at  time  t: 
M 

S  P(M-m ,  0,  t) 
m— N 


4.  Mission  reliability  for  the  dispenser 

This  is  the  probability  that  the  dispenser  successfully  releases 
all  submunitions.  Let 

A 

P(l,  0,  t)  -  probability  that  at  time  t  the  dispenser  is  operating 
successfully  and  the  submunitions  are  not  released; 

A 

P(l,  1,  t)  -  probability  that  at  time  t  the  dispenser  is  operating 
successfully  and  the  submunitions  are  released. 

It  is  then  easy  to  verify  that 


dP(l ,  0,  t) 
dt 


(/i  +  Ap^  +  j/p2)  P(l,  0,  t) 


and 


dpq,  l,  t) 

dt 


M  P(l.  0,  t) 
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subject  to 


P(l,  0,  0)  -  1  and  P(l,  1,  0)  -  0 

From  these  two  equations  one  obtains  for  mission  reliability  of  the 
dispenser 


P(l,  1,  t) - ^ -  [1-  e'^  +  APl  +  1/P2>t] 

H  +  Ap1  +  i/p2 


5.  Expected  duration  of  the  battle 

Lt  T  be  the  random  variable  denoting  the  duration  of  the  battle. 
The  battle  will  terminate  if  either  all  targets  are  killed  or  there  are 
no  munitions  left.  Hence 

M  N-l 

P{T  >  t)  -  1-  £  P(M-m,  N-N,  t)  -  £  P(M-M,  N-n,  t) 

m-N  n-0 


E[T]  - 


P(T  >  t)  dt 


After  some  algebraic  manipulation,  one  obtains 


M  M!  N! 


E[T]  -  £ 


m-N  (M-m) ! 


p"(l-pk) 


m-N 


m-N  (-1) 


m-N-  i 


i-0  (m-N- i) ! i ! 


N-l  l/(M-i-j)(N-j)ppA 

£  - 

j-0  N 

n  [<M-i-ft)(N-a)-(M-i-j)(N-j)] 

a-o 


N-l  M!  N!  M-n-1  (-l)M_n'i 

+  2  -  p "  (l-pk)M'n  S  - 

n-0  (N-n)!  k  i-0  (M-n-i)!  i! 
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N-l  1/(M  -  i- j  )  (N- j )ppA 

Z  - 

j-0  N 

n  [(M-i-a)(N-ft)-(M-i-j)(N-j)] 

a-o 


N-l  M!  N! 

+  s  -  pn  a.p  >M-n 

n-0  (M-n)'(N-n)!  K 


n-l  l/(n-j)(N-j)/ipA 

2  - 

j-0  n 

n  [(n-ft)(N-ft)-(n-J)(N-j)] 
2-0 


Remark:  Case  When  Decoys  Are  Present 

Let 

N  -  total  number  of  targets  including  decoys 
N-^  -  total  number  of  true  targets 
N-N^  -  total  number  of  decoys 

P(r|n)  -  probability  that  r  true  targets  are  killed  given  that  n  total 
targets  are  killed 

[*i)  fri) 

Then  P(r|n)  -  -  r  -  0,  1,  2,  ....  n 

[N 

P. 


The  probability  that  r  true  targets  are  killed,  n  total  targets  are 
killed  and  m  submunitions  are  used  is  [P(r |n) . P(M-m,  N-n,  t) ] .  Using 
this  expression  one  obtains  the  following 

A 

i.  Expected  number  of  true  targets  killed,  r(t). 


*  M  min(m,N)  n 

E[r(t) ]  -  Z  Z  Z  r.  P(r|n)  P(M-m,  N-n,  t) 

m-0  n-0  r-0 
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ii.  Expected  number  of  submunitions  used  to  kill  r  true  targets,  s(t). 


A  M  min(m.N) 

E[s(t) | r ]  -2  2  m  P(rjn)  .  P(M-m,  N-n,  t) 

m-r  n-r 


VI.  CONCLUSIONS 

The  effective  use  of  the  smart  submunition  weapon  system  depends  to 
a  considerable  extent  on  the  environment  in  which  it  will  be  operating, 
whether  such  environment  is  natural  or  man-created.  It  becomes  thus 
imperative  to  factor  out  all  the  elements  which  may  exist  in  an  actual 
battlefield  situation  and  which  contribute  adversely  on  the  performance 
of  these  weapons!,  either  by  neutralizing  or  degrading  their 
effectiveness.  Among  such  elements,  one  has  to  consider  the  following: 

1.  Hostile  threats  against  the  carriage  aircraft,  the  dispenser,  the 

3 

submunitions  and  the  supporting  C  I. 

2.  Adverse  weather  conditions,  particularly  rain,  snow,  sleet  and  fog. 

3.  Use  of  various  passive  and  active  countermeasures  on  the 
submunition  sensors,  such  as  smoke,  corner  reflectors,  IR  sources, 
decoys  and  camouflage. 

4.  Use  of  ECM  on  the  Cl  system  supporting  the  delivery  of  the 
submunition.  This  includes  air  and/or  ground  sensors  for  target 
area  location  and  target  movements,  which  transmit  information 
through  data  link' to  the  air  platform  and/or  the  dispenser. 

5 .  Use  of  reactive  countermeasures  by  the  enemy  particularly  as  a 
result  of: 

i.  the  visual  acquisition  of  incoming  submunitions  delivered  by 
slow  moving  devices,  such  as  parafoils; 
ii.  the  activation  of  target  mounted  sensors  warning  of  laser,  radar 
and  IR  acquisition; 

iii.  the  possibility  of  acquisition  by  the  enemy  in  the  battlefield 
of  partly  damaged  dispensers  and  undetonated  submunitions. 

6.  The  potential  use  of  presently  unknown  types  of  countermeasures. 

7.  The  acquisition  by  submunition  sensors  of  false  targets. 

8.  The  acquisition  by  submunition  sensors  of  dead  targets. 
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9.  The  engagement  of  one  target,  single  or  dead,  by  several 
submunitions . 

Consideration  needs  to  be  given  to  IR/MMW  submunition  sensors 
homing  on  targets  which  have  been  incapacitated  by  previous  attacks. 

Such  a  situation  can  considerably  deter  the  effective  engagement  and 
pairing  of  a  given  submunition  with  a  given  live  target. 

Second,  it  is  possible  that  test  results  may  have  demonstrated 
acceptable  performance  level  of  weapons  under  the  condition  that  a 
single  target  is  present  in  the  test  area.  However,  the  main 
characteristic  of  a  smart  submunition  weapon  system  is  that  it  is  a 
many-on-many  engagement  system.  Therefore,  tests  must  be  conducted 
simulating  actual  combat  conditions  when  several  targets  are  present  and 
when  they  are  being  simultaneously  engaged  by  several  submunitions. 

Finally,  when  assessing  the  performance  of  a  smart  munition  system 
either  mathematically  or  by  simulation,  it  is  essential 

1.  to  study  and  analyze  the  aggregate  system  including  the  carriage 

3 

aircraft,  the  dispenser,  the  submunitions  and  the  supporting  C  I; 

2.  to  incorporate  in  the  model  all  pertinent  and  significant  factors 
that  bear  on  the  performance  of  the  system,  including  environment, 
threats  and  countermeasures ; 

3.  to  define  likely  engagement  scenarios  and  to  perform  the  analysis 
for  each  scenario  in  order  to  assess  and  compare  their  performance. 
These  scenarios  should  be  the  ones  that  are  the  most  likely  to 
occur  in  a  realistic  combat  situation  using  existing  weapon 
technologies  for  target  acquisition,  discrimination  and 
classification. 
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ELECTROMAGNETISM  AND  GRAVITY 


Richard  A.  Weiss 

U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT .  Space  and  time  coordinates  exhibit  broken  symmetries  that  are 
carried  over  long  distances  by  gravitation.  Electromagnetism  in  space  and  time 
with  broken  internal  symmetries  has  electric  and  magnetic  field  vectors  and  po¬ 
tentials  that  have  internal  phase  angles.  These  phase  angles  enter  the  calcu¬ 
lation  of  the  measurable  electric  and  magnetic  fields.  Maxwell's  equations  with 
broken  internal  symmetries  are  solved  for  simple  physical  cases  and  the  general 
form  of  the  internal  phase  angles  of  the  field  vectors  are  determined.  The  inte¬ 
gral  equation  development  of  electromagnetism  is  formulated  in  asymmetric  space- 
time,  and  the  Gauss  theorem  and  Ampere's  law  with  broken  symmetry  are  developed. 
The  elementary  calculations  of  electromagnetism  are  examined  to  determine  the 
effects  of  the  broken  symmetry  of  space  and  time  on  electrostatics,  magnetostat¬ 
ics,  electric  currents  in  wires  and  the  propagation  of  electromagnetic  waves  in 
matter.  Over  long  distances  the  broken  symmetry  of  space  and  time  is  due  to 
gravitation  so  that  the  elementary  calculations  and  measurements  of  electromagne¬ 
tism  are  expected  to  depend  on  gravity.  Simple  electromagnetic  experiments  are 
suggested  to  determine  the  internal  phase  angles  of  space  and  time  in  the  vicin¬ 
ity  of  the  earth.  The  results  of  this  paper  will  have  applications  to  the  theory 
of  the  propagation  of  electromagnetic  waves  in  the  neighborhood  of  the  earth,  the 
measurement  of  the  electromagnetic  properties  of  matter,  and  high  temperature 
superconductivity. 

1 ,  INTRODUCTION .  Electromagnetism  is  a  gauge  field.1-3  There  are  also 
other  gauge  fields  in  nature  such  as  gravity  and  the  strong  and  weak  nuclear 
forces.  All  of  these  forces  may  individually  represent  various  degrees  of  the 
broken  gauge  symmetry  of  a  single  unified  force.  Gravity  is  described  by  a 
metric  tensor  gauge  theory  as  being  the  manifestation  of  the  curvature  of  a  four 
dimensional  spacetime  which  is  determined  by  the  mass  distribution  in  space.5 
Electromagnetism  is  a  vector  gauge  theory  whose  forces  are  mediated  by  a  mass¬ 
less  spin  one  photon.1-14  These  apparently  distinct  forces  may  be  related  through 
gauge  field  theory.6 

Before  the  advent  of  gauge  field  theory  it  was  already  known  that  gravita¬ 
tional  forces  interacted  with  electromagnetic  forces.  This  was  predicted  by 
Einstein's  theory  of  general  relativity  in  the  concept  of  the  gravitational  red 
shift  of  spectral  lines  from  the  sun,  and  in  the  bending  of  star  light  in  the 
gravitational  field  of  the  sun.7-10  The  interaction  of  gravity  with  electromag¬ 
netism  is  weak  and  it  requires  a  body  as  massive  as  the  sun  to  bend  star  light 
sufficiently  to  be  measured.  The  corresponding  effect  due  to  the  earth's  grav¬ 
ity  is  negligible  and  has  no  practical  effects. 

This  paper  considers  another  way  in  which  gravity  affects  electromagnetism. 
This  occurs  through  the  broken  symmetry  of  space  and  time  that  is  associated  with 
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a  pressure  field  in  matter.11’12  Ultimately  the  pressure  field  in  the  solid 
earth,  ocean  and  atmosphere  is  due  to  gravity,  and  therefore  the  dominant  ef¬ 
fect  producing  the  internal  phase  angles  of  space  and  time  in  the  vicinity  of 
the  earth  is  due  to  gravity.  The  method  of  determining  the  internal  phase 
angles  of  the  space  coordinates  from  the  internal  phase  angle  of  the  pressure 
has  been  presented  for  gravitating  stars  and  planets.12  It  involves  the  solu¬ 
tion  of  a  series  of  coupled  differential  equations.  Numerical  values  of  the 
internal  phase  angle  associated  with  the  radial  coordinate  can  be  determined 
experimentally  through  the  apparent  deviation  from  Newtonian  gravity  in  the 
earth,  and  from  the  Pound-Rebka-Snider  photon  red  shift  experiment. 12  It  has 
already  been  shown  that  the  field  vectors  of  the  electromagnetic  field  have  a 
broken  symmetry  which  is  represented  by  internal  phase  angles.12 

The  internal  phase  angles  of  the  electric  and  magnetic  field  vectors  and 
potentials  will  be  shown  in  this  paper  to  depend  on  the  internal  phase  angles 
of  the  space  and  time  coordinates.  For  a  relatively  weak  electromagnetic  field 
in  the  vicinity  of  the  earth,  the  internal  phase  angles  of  the  electromagnetic 
field  vectors  will  depend  on  the  gravity  induced  internal  phase  angles  of  the 
space  and  time  coordinates.  In  this  way  gravity  will  have  a  direct  and  measur¬ 
able  effect  on  the  character  of  an  electromagnetic  field. 


The  broken  symmetry  of  space  and  time  requires  that  coordinates  be  complex 
numbers  of  the  form  for  cartesian  coordinates1 2 


—  j  0x 
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and  for  spherical  coordinates 
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The  internal  phase  angles  of  the  coordinates  depend  on  the  local  energy  density 
and  pressure.  2 


The  broken  symmetry  sine  and  cosine  functions  are 
sin  ip  =  S.e^s^ 

cos  ip  =  C  e  j 
<P 

where 

=»  [sin^(ij)  cos  9^)  +  sinh^(ijj  sin  9^)]^^ 

2  2  1/2 
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(8) 


tan  0  ,  =  cot(ij)  cos  0.)  tanh(<ii  sin  0  ) 

Sp  Ip  1 p' 

tan  0^  =  tan(iji  cos  8^)  tanh(iji  sin  0  )  (9) 

The  following  angles  are  used  to  describe  the  variation  of  the  internal  phase 
angles  with  the  magnitudes  of  the  corresponding  coordinates 
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The  corresponding  measured  values  of  these  coordinates  are  given  by 
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From  these  equations  it  follows  immediately  that 
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The  skewed  nature  of  the  coordinates  is  due  to  the  skewed  nature  of  the 
pressure1 2 

P  =  Pej0P  (25) 

Whenever  an  internal  phase  angle  appears  in  the  electromagnetic  calculations  of 
this  paper  it  represents  mainly  the  effects  of  gravitationally  induced  broken 
symmetry  associated  with  the  pressure  and  spacetime  coordinates.  The  measured 
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value  of  the  pressure  is  given  by 


P  =  P  cos  9_ 
m  P 


(26) 


Time  also  has  a  broken  symmetry  represented  ay  an  internal  phase  angle,  so  that 
time  must  be  written  as  a  complex  number  as  follows 


t  = 


te 


j0t 


(27) 


The  internal  phase  angle  of  time  depends  on  the  local  energy  density  and  pres¬ 
sure  9t  =  0t(E,P)  .  The  components  of  particle  velocity  can  then  be  written  as12 

v  =  v  ej9vot  =  da/dt  (28) 

a  a 

where  va  and  9vct  are  calculated  in  Reference  12  for  a  *  x,  y,  z  . 


This  paper  determines  the  electric  and  magnetic  field  vectors  for  space  and 
time  that  has  broken  internal  symmetries.  The  vacuum  has  intrinsically  broken 
space  and  time  coordinate  symmetries. 12  Thus  the  electromagnetic  field  has  in¬ 
trinsically  broken  symmetry  associated  with  the  field  vectors.12  But  in  the 
presence  of  matter  it  is  predominantly  gravity  (with  its  infinite  range)  that 
determines  the  br.  ken  symmetry  of  the  coordinates  and  the  electromagnetic  field 
vectors.  Section  2  treats  asymmetric  gravity.  Section  3  considers  Maxwell's 
equations  with  broken  internal  symmetries.  Section  4  deals  with  broken  symmetry 
electrostatics,  Section  5  studies  asymmetric  magnetostatics,  and  Section  6 
treats  electromagnetic  waves  in  a  gravitational  field. 


2 .  ASYMMETRIC  GRAVITY .  The  force  of  gravity  for  radially  symmetric  stars 
or  planets  composed  of  matter  with  broken  internal  symmetries  is  described  by 
the  following  two  equations12 

-  29^  +  it  =  9p  +  Bpr  (29) 

cos  8^  l/r^  3/3r(r^/p  3P/3r  cos  8^  sec  B-  )  =  -  4nGp  (30) 
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where  p  *  density  magnitude  (not  the  measured  density)  and  G  =  Newtonian  gravi¬ 
tation  constant.  It  is  easy  to  show  that  equations  (29)  and  (30)  are  equivalent 
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Thus  0r  is  related  to  the  pressure  field  in  matter  which  over  macroscopic  dis¬ 
tances  is  produced  by  gravity.  In  general  9^  =  9p(p)  .  For  geometrically  asym¬ 
metric  planets,  such  as  the  earth,  the  internal  phase  angles  6^  and  6^  also  en¬ 
ter  the  calculation  of  the  gravitational  field.12  Note  that  Bpr  is  in  the  third 
quadrant  and  can  be  written  as  Bpr  =  n  +  Bpr  where  Bpr  is  in  the  first  quadrant, 
then  equation  (29)  can  be  written  as 


-  29  =  9„  +  b; 

r  P  Pr 


tan  Bpr  =  tan  Bpr 


(35A) 


For  small  angles  equations  (31)  and  (35)  give 
-  20G  -  9p  +  (P30p/3r)/(3P/3r) 


This  is  the  value  of  9r  that  is  associated  with  a  gravitating  body.  For  a  par¬ 
ticle  falling,  or  in  a  circular  orbit,  in  a  gravitational  field  the  acceleration 
phase  angle  condition  9^  -  29^  'v*  -  29^  gives 
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=  -  3/4  [ep  +  (P30p/3r) / (3P/3r) J 

For  an  elliptical  orbit  the  situation  is  more  complicated.12 

In  general  the  pressure  at  a  point  in  matter  is  due  to  other  forces  in  ad¬ 
dition  to  gravitation  such  as  the  strong  and  weak  nuclear  forces  and  electromag¬ 
netism.  The  total  radial  coordinate  internal  phase  angle  can  be  written  as 

9  =  9G  +  0SN  +  Q™  +  0™  +  9V  (37) 

r  r  r  r  r  r 


where  9, 
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,  9^*  and  9^  =  total,  gravitational,  strong  nuclear, 
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weak  nuclear,  electromagnetic  and  vacuum  values  respectively  of  the  internal 
phase  angle  of  the  radial  coordinate.  Over  long  distances  in  matter  one  has 
approximately 

9  *  9G  +  9°®  +  9V  (38) 

r  r  r  r 

EMR 

where  9r  =  internal  phase  angle  for  electromagnetic  radiation.  Static  electric 
fields  are  shielded  in  matter  and  rapidly  attenuate  with  distance.  For  matter 
in  a  star  or  planet  where  gravitation  is  the  dominant  force  equation  (38)  becomes 

9  *  9G  (39) 

r  r 


In  tenuous  isolated  matter  one  expects 
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In  empty  space  0r  =  0r  .  The  approximation  in  equation  (39)  is  expected  to  be 
valid  in  the  vicinity  of  the  earth.  It  should  be  mentioned  that  rest  mass  is 
considered  to  be  a  scalar  having  zero  internal  phase  angle.12  For  the  internal 
phase  angle  of  time  the  same  general  argument  gives 


0  -  0G  +  0SN  +  0™  +  0^ 
t  t  t  t  t 


(41) 


Similar  results  hold  for  the  internal  phase  angles  of  the  angular  coordinates. 
A.  Density  and  Mass 

The  definition  of  the  mass  density  in  broken  symmetry  space  for  spherical 
polar  coordinates  described  by  equation  (3)  is12 


p  =  d^M/  (r2  sin  <J>  dip  d$  dr) 
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where  and  0S^  are  given  by  equations  (6)  and  (8)  respectively.  For  spherical 
symmetry 


where 


p1  =  (Airr2L  )  ^dM/dr 
s 

p1=cos  8rr  (4irr2Ls)  ^dM/dr 


01  =  -  30  -  0T  -  6 

p  r  Ls  rr 

12 


L  =  L  e^0Ls  =  l/2e^<'0<^>[  1  -  cos  (ire^9^)  ] 


s  s 


(45) 

(46) 

(47) 

(48) 


is  the  complex  number  angular  factor  for  spherical  geometry  defined  by 

Lg  =>  1/(4tt)$  sin  £  dip  d^  (49) 

The  average  values  of  internal  phase  angles  are  given  by  <0w,>  =  &ip(v)  and 
<0,.>  =  0^(211)  .  The  real  and  imaginary  parts  of  equation  (48)  determine  Ls 
and  ©ls  .  For  applications  of  the  divergence  form  of  Gauss's  law  (subsection  B) 
it  is  necessary  to  have  a  density  independent  of  the  angular  factor  Ls  because 
this  factor  cancels  in  the  definition  of  the  divergence.  Therefore  an  often 
used  definition  of  complex  number  density  for  spherical  symmetry  will  be 
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(50) 
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The  measured  mass  density  is  given  by 


p  =*  p  cos  0  -  f_  p 

m  p  3r  c 


(53) 


where  the  conventionally  calculated  mass  density  is  given  by 

P. 
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and  combining  equations  (51)  and  (53)  gives 
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where  drm/dr  is  given  by  equation  (22).  The  mass  is  calculated  as  follows 
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where  p  is  given  by  equation  (51). 


For  a  cylindrical  mass  distribution  the  complex  number  density  is  given  by 
p  »  pe^0p  =  d^M/ (r  d$  dr  dz)  (57) 
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where  the  average  value  <0  >  =  0  (2it)  .  Then 
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p1  =>  (27rrLc)  ^^/(drdl) 
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As  will  be  described  in  subsection  B,  the  density  that  appears  in  the  divergence 
form  of  Gauss’s  law  for  axial  symmetry  does  not  contain  the  internal  phase  fac¬ 
tor  <0(j)>  and  for  this  usage  the  following  density  is  used  for  a  mass  distribution 
with  cylindrical  symmetry 

p  =  l/(2ir?)d"W(d?  dz) 

(65) 

*  I/(2irr)dpjl/dr 

(66) 

p  *  cos  0^^  cos  f}  /  (2irr)d^M/ (dr  dz) 

(67) 

Q  a  -  20  -  0-  0  -0 

p  r  z  rr  zz 

(67) 

where  the  mass  per  unit  length  is  given  by 
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The  measured  mass  density  is  given  by 


p  **  p  cos  9  =  f„  p 

ra  p  2r  c 


where  the  conventionally  calculated  mass  density  is 

p  =  (2itr  )  ^d^M/ (dr  dz  ) 
c  m  m  m 


=  (2?rr  )  ^dp  /dr 
m  Hem 


where 


(68) 

(69) 

(70) 

(71) 

(72) 


p  =  dM/dz  (73) 

He  m 

and  where  combining  equations  (66)  and  (71)  gives 

f_  *  cos  0  cos  0  cos  0  cos  9  dz  /dz  dr  /dr  (74) 

2r  rr  zz  r  Pm  m 
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The  mass  is  obtained  from  equation  (66),  (71)  and  (72)  as 


M  =  2irf/p  sec  6  sec  6  i  dr  dz 
*  J  rr  zz 


zz 


(75) 


*  2irffp  r  dr  dz 
■'•'cm  m  m 


2irffp  /f0  r  dr  dz 
11  m  2r  m  mm 


For  cartesian  coordinates  the  mass  density  is  given  by1' 


p  =  d  M/ (dx  dy  dz) 


p  =  cos  3  cos  3  cos  3  d  M/ (dx  dy  dz) 
xx  yy  zz 


e=_e  _e-e-3  -6  -3 

p  x  y  z  v:.  yy  zz 


(76) 

(77) 

(78) 


where  3act  is  given  by  equations  (10)  through  (12).  The  measured  mass  density  is 

(79) 


p  =  p  cos  0  =  f,  p 

m  p  3xyz  c 


where  the  conventionally  calculated  mass  density  is 


p  =  d  M / (dx  dy  dz  ) 
Fc  m  m 


(80) 


and  where 


=  cos  6  cos  3  cos  3  cos  3  dx  /dx  dy  /dy  dz  /dz 
n  xx  vv  zz  m  m  m 


3xyz  p  xx  yy  zz  m  m 

where  dc^/dci  is  given  by  equations  (19)  through  (21),  and  where 

(p„  -  P  )/p„  =  1  -  f 


(81) 


m' ' ^  c  '3xyz 

Finally,  the  mass  is  given  by 


(82) 


M  =  fp  sec  8  sec  3  sec  8  dx  dy  dz 
J  xx  yy  zz 


(83) 


=  fp  dx  dy  dz 
'  c  m  m  m 


=  f p  /  f  _  dx  dy  dz 
J  m  3xyz  m  m  m 


B.  Newton's  Gravitation  Law  for  Asymmetric  Matter 


The  gravity  field  of  radially  sitrme,'ric  spherical  mass  composed  of  matter 
with  broken  internal  symmetiy  can  be  obtained  from  Gauss's  law  and  the  diver- 
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gence  theorem  which  is  given  by  the  following  broken  symmetry  generalization  of 
the  standard  result13 

■±  -±  _  _ 

V  •  f  “  V^F^  “  ~  PG  (84) 

where  Fr  =  broken  symmetry  gravity  force  on  a  unit  mass  ■  gravitational  acceler¬ 
ation,  and  p  «  complex  number  mass  density  as  given  in  equations  (50),  (65)  or 
(76).  These  densities  do  not  contain  <0^>  and  <0^,>  because  the  factors  Ls  and 
Lc  given  by  equations  (48)  and  (61)  respectively  drop  out  of  the  calculation  of 
the  divergence  and  so  must  not  appear  in  the  right  hand  side  of  equation  (84) . 
For  a  system  with  spherical  symmetry  equation  (84)  becomes  the  following  gener¬ 


alization  of  the  standard  scalar  result13-17 

1/r2  d/dr(r2?r)  =  -  pG  (85) 

Combining  equations  (50)  and  (85)  gives 

Fr  =  Frej0Fr  =  -  GM/(47rr2)  -  -  GM/ (47rr2)e-2:j  0r  (86) 

F  =  -  GM/(4irr2)  0^  -  -  20  (87) 

r  v  Fr  r 

The  broken  symmetry  gravity  potential  is  given  by 

W  =  Wrej0Wr  =  -  GM/(4irr)  =  -  GM/(4irr)e~^0r  (88) 

W  =  -  GM/(4nr)  0tI  =  -  0  (89) 

r  Wr  r 

1  2 

The  measured  gravity  force  and  potential  for  spherical  symmetry  is  given  by 

F  =*  F  cos  0„  =  -  GM/ (4irr2)  cos(20  )  (90) 

rm  r  Fr  r 

=  -  GM/(4irr2)  cos2  0  cos(2S  ) 
m  r  r 

W  =  W  cos  0T,  =  -  GM/(4iTr)  cos  0  (91) 

rm  r  wr  r 

=  -  GM/ (4wr  )  cos2  0 
m  r 


where  0r  is  generally  a  function  of  the  radial  coordinate  magnitude. 

For  the  two  dimensional  axisymmetric  case  the  following  is  the  broken  sym¬ 
metry  generalization  of  the  standard  scalar  result13-17 

1/r  d/dr (rF^)  =  -  Gp  (92) 

Using  equations  (65)  and  (92)  gives 

F  -  F  ej9pr  =  -  Gp  / (2trr)  (93) 

r  r  l 
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Fr  “  ~  Gp  'J  (2itr) 

9  =-e  -  0  -  0 

Fr  r  z  zz 


W  ■  W  e20Wr  -  G/(2ir)pp  In  r  *  G/(2tt)“  (In  r  +  j0  ) 
r  r  &  r 


wr  *  G/ (2n)p^ (Zn2  r  +  02)1/2 


0  =*  0  +  tan  (0  /In  r) 

Wr  p£  '  r 


where  p^  ,  p^  and  0p]l  are  given  by  equations  (68)  and  (69).  The  measured  grav¬ 
ity  force  is  given  by 

F  =  F  cos  0  =  -  Gp  /(27rr)cos(0  +0+6  )  (99) 

rm  r  tr  %  r  z  zz 

=  -  Gp„  /(2irr  )cos  0  cos(0  +0  +6  )/cos(0  +6  ) 

~  ni  m  it  r  z  zz  z  zz 


The  measured  gravity  potential  is  given  by 

W  =  W  cos  0.. 
rm  r  Wr 

where  Wr  and  0Wr  are  given  by  equations  (97)  and  (98)  respectively. 


(100) 


The  case  of  constant  density  (p  =  constant  and  0p  =  constant)  is  easily 
treated  and  gives  the  following  generalization  of  the  standard  scalar  result  for 
a  sphere  of  radius  R 


Wr  =  p/6(3R 


2  2\  -J0r 

-  r  )e  J  r 


r  ^  R 


(101) 


=  p/6(3R2  -  r2e~2j0r)e'j9r 


where  9r  ~  9r  =  constant.  The  gravitational  force  is  then  given  by 


F  =  -  3W  /3r  =  -  l/3pre 
r  r 


-2j0r 


(102) 


Note  that  for  constant  density  p  ,  equations  (50)  through  (52)  give  p= constant, 
0r  =  constant,  0rr  =  0  and 


M(r)  =  47r/3r  p 


M(R)  =  4n/3RJp 


(103) 


F  =  -  M(R)r/(4trR^)e  2-^0r 


r  <  R 


(104) 


F  =  -  l/3pr 
r 


(105) 


0pr  =  -  20r  =  -  20r(R)  =  -  20r(O) 


(106) 
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rm 


F  cos  9_ 
r  Fr 


-  pr/3  cos(20r) 


r  <  R 


(107) 


-  l/3p  r  cos(28  )/[cos  9  cos(39  )] 
mm  r  r  r 


m 


P  cos(9  )  =  p  cos(39  ) 
P  r 


r  <  R 


(108) 


C.  Gauss  Law  for  Broken  Symmetry  Matter 


The  form  of  Gauss’  law  for  matter  with  broken  internal  symmetry  is  obtained 
from  equation  (84)  to  be12 


$F  •  ndA  -  -  MGL 


(109) 


where 


L  -  Lg  for  spherical  coordinates 
L  =  L£  for  cylindrical  coordinates 


(110) 

(111) 


where  Ls  and  Lc  are  given  respectively  by  equations  (48)  and  (61).  For  broken 
symmetry  space  a  factor  L(0<£, 9^)  must  be  included  in  the  right  hand  side  of  the 
Gauss  law  given  by  equation  (109). 

3 .  MAXWELL ' S  EQUATIONS .  Maxwell's  equations  are  a  set  of  vector  differ¬ 
ential  equations  that  characterize  the  electromagnetic  field.14-28  For  broken 
symmetry  space  and  time  the  coordinates,  differential  operators  and  the  field 
vectors  have  internal  phase  angles,  and  Maxwell's  equations  must  be  written  as12 


V  x  E  =  -  3B/3t 

-±  ■+:  -±  -± 

V  x  H  =  j  +  3D/3t 

-*■  -*■ 

V  •  B  =  0 
-*■  -> 

V  •  D  =  p 


(112) 

(113) 

(114) 

(115) 


where  E  =  renormalized  complex  number  electric  field  vector,  B  *  renormalized 

complex  number  magnetic  induction  vector,  H  =  renormalized  complex  number  mag- 

netic  field  vector,  j  =  renormalized  complex  number  current  density  vector, 

D  =  renormalized  complex  number  electric  displacement  vector,  and  Pq  =  complex 
number  electric  charge  density.  The  components  of  the  electric  and  magnetic 
fields  can  be  written  as 


E  e 
a 


j®Ea 


9_,  =9  -  20  +  2(6  -  8  ) 

Ea  a  t  aa  tt 


D  «*  D  e^0Da 


=0  +  8,, 
Da  e  Ea 


20-0 
t  x 


0-0 

y  z 


(116) 

(117) 


20k  -  30 
t  r 
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(118) 


H  -  H  e^0Ha 

0„  » 

a  a 

Ha 

B  *  B  e-'0Ba 

9_  - 

a  a 

Ba 

where  a  and  6  ■  x,  y. 

z  or  r 

-0-6  -  9P 

C  tt  6 


Ha 


9u  “  6r  “  (6x  +  9y  +  9z)/3  (ll9) 


4>  >  41  and  6  may  or  may  not  be  equal  to  a 


The 


phase  angles  in  equations  (116)  and  (118)  are  the  simplest  forms.  In  fact  8^ 
and  0£a  are  mutually  involved  because  they  are  determined  from  equations  (112), 
and  (113).  For  instance,  this  is  shown  in  equations  (448),  (452),  (458)  and 
(462)  for  electromagnetic  waves.  The  measured  field  vectors  are 


E 

E 

cos 

0_ 

D 

D 

cos 

9_ 

am 

a 

Ea 

am 

a 

Da 

H 

H 

cos 

B  = 

B 

cos 

0_ 

am 

a 

Ha 

am 

a 

Ba 

(120) 

(121) 


A.  Measured  Coordinates  and  Field  Vectors  in  Maxwell's  Equations. 


The  component  equations  corresponding  to  Maxwell's  equations  (112)  through 
(115)  appear  in  Reference  12.  For  instance,  the  x  component  equations  corres¬ 
ponding  to  equation  (112)  are 


cos  . 

9 

sin  Ezy 


cos 


8  sec 

yy 


3E  /3y  - 
z 


cos 

sin 


Eyz 


cos 


6  sec 
zz 


3E  /3z 
y 


cos 

sin 


Bxt 


cos 


tt 


“C  6Bxt 


3B  /3c 

X 


(122) 


with  similar  equations  for  the  y  and  z  vector  components,  where 


4>_  =  0_ 

Ezy  Ez 

+  8,,  -  0  -  6 

Ezy  y  yy 

(123) 

<{>  =9 

Eyz  Ey 

+  0Eyz  -  9z  -  Bzz 

(124) 

$  =0 
Bxt  Bx 

+  6Bxt  -  9t  -  6tt 

(125) 

tan  0„ 

Ezy 

(E  9 9_  /3y)/(3E  /3y) 

Z  cZ  Z 

(126) 

““  v  ■ 

(E  30_  /3z)/(3E  /3z) 
y  hy  y 

(127) 

tan  8Bxt  ' 

(Bx  30Bx/3t)/(3Bx/3t) 

(128) 

From  equations  (16)  through  (24)  and  equations  (120)  and  (121)  it  follows  that 
the  derivatives  in  equation  (122)  can  be  written  in  terms  of  measured  quantities 
by 


3E  / 3y  =  g_  3E  /3y  +  h  E 

z  1  °Ezy  zm  m  Ezy  zm 

3E  /3z  =  3E  /3z  +  h^  E 

y  Eyz  ym  m  Eyz  ym 


(129) 

(130) 
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(131) 


3B  /at  -  g.,  3B  /3t  +  hD  B 

x  eBxt  xo  m  Bxt  xm 


where 


sEzy  ‘  sec  9Ez  3V3y 
®Eyz  *  sec  9Ey  32m/32 


8Bxt  -  sec  9Bx  3tm/3t 


Vy  '  sec  9EZ  tan  9Ez  30Ez/3y 


hEyz  '  sec  9Ey  tan  9Ey  39Ey/32 


hBxt  *  S2C  ®Bx  tan  9Bx  39Bx/3t 


(132) 

(133) 

(134) 

(135) 

(136) 

(137) 


In  this  way  equation  (122)  can  be  written  in  terms  of  measured  quantities 

(a^3/3ym  +  dX)E  m  -  (bX3/3z  +  eX)E  =  -  (fX3/3t  +  kX)B  { 

cmczm  cmcym  cmc  xm 


(aX3/3y  +  dX)E  -  (bX3/3z  +  eX)E  =  -  (fX3/3t  +  kX)B 

s  m  szm  s  ra  s  ym  s  m  s  xm 


where 


(137A) 
( 137B) 


X 

l 

C,s 

=* 

cos 

sin 

<!> 

Ezy 

cos 

B 

yy 

sec 

BEzy 

sec 

9Ez 

3ym/3y 

( 1 37C) 

X 

t 

C.s 

SB 

cos 

sin 

‘t’rx 

Eyz 

cos 

Bzz 

sec 

6Eyz 

sec 

9Ey 

9z  /3z 
m 

(137D) 

x 

C.s 

3 ! 

cos 

sin 

<t> 

Ezy 

cos 

Byy 

sec 

BEzy 

sec 

9Ez 

tan  ®Ez 

30Ez/3y 

( 1 37E) 

X 

C.s 

= 

cos 

sin 

$ 

Eyz 

cos 

Bzz 

sec 

BEyz 

sec 

9Ey 

tan  eEy 

ae£y/az 

(137F) 

X 

c  ,s 

= 

cos 

sin 

$ 

Bxt 

cos 

6tt 

sec 

BBxt 

sec 

9Bx 

at  /at 

m 

(137G) 

X 

c.s 

= 

cos 

sin 

$ 

Bxt 

cos 

8tt 

sec 

SBxt 

sec 

9Bx 

tan  0„ 
Bx 

39Bx/3t 

( 137H) 

Equations  (137A)  and  (137B)  represent  the  x-component  of  Faraday's  induction  law 
equation  (112)  in  terms  of  measured  field  components  and  measured  space  and  time 
coordinates.  The  conventional  Maxwell  equation  corresponding  to  equations  (137A) 
and  (137B)  is 


3E  /3y  -  3E  /3z  =  -  3B  /3t 

zc  ym  yc  m  xc  m 


(1371) 


The  remainder  of  Maxwell's  equations  are  handled  in  a  similar  way.  For 
instance,  equation  (115)  becomes  the  following  two  equations 
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(138) 


COS  prtc 

•  cos  $  sec  8r.  3D  /3x  +  ,  <t>  cos  8  sec  8_  3D  /3y 

sin  Dxx  xx  Dxx  x  sin  Dyy  yy  Dyy  y  7 

+  C?S  <t>  cos  8  sec  8-  3D  /3z  *  p  c°s  8 
sin  Dzz  zz  Dzz  z  q  sin  pq 

where 


<t>  -9^+6^  -  e  -  8 

Dxx  Dx  Dxx  x  xx 


(139) 


<t>  =  +  8„  -  9  -  8 

Dyy  Dy  Dyy  y  yy 


(140) 


$Dzz  *  0Dz  +  BDzz  9z  3zz 


(141) 


tan  0Dxx  =  (Dx  aVte)/(8Dx/3x) 


(142) 


tan  Vy  "  (Dy  8Vay)/(aDy/3y) 


(143) 


tan  6Dzz  =  (DZ  3eDz/3z>/OV3*> 


(144) 


The  derivatives  appearing  in  equation  (138)  can  be  rewritten  in  terms  of  mea¬ 
sured  quantities  as 


3D  /3x  =  g  3D  /3x  +  h  D 

x  Dxx  xm  m  Dxx  xm 


(145) 


=  *Dyy  3Dym/3ym  + 


(146) 


3D  / 3z  =  g_  3D  / 3z  +  h_  D 
z  Dzz  zm  m  Dzz  zm 


(147) 


where 


g_  =  sec  0_  3x  /3x 
Dxx  Dx  m 


(148) 


=  sec  6-,  3y  /3y 
6Dyy  Dy  ym  J 


(149) 


=  sec  9„  3z  / 3z 
Dzz  Dz  m 


(150) 


h_  =  sec  9_  tan  0_.  30  /3x 

Dxx  Dx  Dx  Dx 


(151) 


hJ  =  sec  0_  tan  0^  30_,  /3y 

Dyy  Dy  Dy  Dy 


(152) 


,  =  sec  9n  tan  0  30  /3z 

)zz  Dz  Dz  Dz 


(153) 


Placing  equations  (145)  through  (147)  into  equations  (138)  gives  the  divergence 
equation  in  terms  of  measured  electric  displacement  and  measured  coordinates 
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(153A) 


m  c  xm 


s 
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pq 
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(153B) 


c  ,s 

S56 
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<t> 

Dxx 
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6xx 

sec 

6Dxx 

sec 

6Dx 

3x  /9x 
m 

(153C) 
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c  ,s 
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cos 

sin 

<j> 

Dyy 

cos 

6yy 

sec 

8Dyy 

sec 
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3y  /3y 

01 

(153D) 
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8zz 
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8Dzz 

sec 
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3z  /3z 
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(153E) 
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cos 
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Dxx 
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8xx 

sec 

8Dxx 

sec 
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Dx 

88dx/3* 

(153F) 
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c,s 

s 
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6yy 

sec 

SDyy 

sec 
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tan  0_ 
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C  ,s 
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cos 

sin 

Dzz 

cos 

8zz 

sec 

8Dzz 

sec 

6Dz 

tan  0n 
Dz 

3V/32 

(153H) 

Equations  (153A)  and  (153B)  represent  the  divergence  form  of  the  Gauss  law  equation 
(115)  expressed  in  terms  of  the  measured  field  components  and  coordinates.  The 
conventional  Maxwell  equation  corresponding  to  equations  (153A)  and  (153B)  is 


3D  /3x  +  3D  /3y  +  3D  / 3zm  =  0m 

xc  m  yc  m  zc  m  m 

The  Maxwell  equations  (113)  and  (114)  are  handled  in  a  similar  way. 
B.  Charge  Density 

The  complex  number  charge  density  is  given  by 


(1531) 


p  =  p  ej9pcl  =  dQ/dV 


(154) 


where  charge  is  taken  to  be  a  scalar  having  a  zero  internal  phase  angle.  From 
equation  (154)  it  follows  that 


Pq  =  dQ/|dV|  =  cos  dQ/dV 
e  =  -  e  -  e> 

pq  V  VV 

tan  =  V30y/3V 

Q  -  Jpq!dV|  *  Jpq  sec  6^  dV 

Case  1:  Spherical  Coordinates 
Pq=d3Q/(r2  sin  ^  d|  d$  dr) 

3  2 

p  =  cos  8  cos  6,  cos  B.,  d  Q/ (S  r  d\p  dc>  dr) 
q  rr  <f>4>  VV  V 


(155) 

(156) 

(157) 

(158) 

(159) 

(160) 
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(161) 


9 

pq 


39_  -  9„.,  -  9„.  -  0.  -  6„  -  6.,..,.  -  6 


sip 


TT 


where  and  6SI^  are  given  by  equations  (6)  and  (8)  respectively,  and  Baa  are 
given  by  equations  (13)  through  (15).  For  spherical  symmetry 


p*  =  (4irr^Ls)  *  dQ/dr 

p1  *•  cos  8  (4nr^L  )  *  dQ/dr 
q  rr  s  x 


Ls 


8 


rr 


(162) 

(163) 

(164) 


where  Ls  is  defined  in  equation  (48).  The  charge  density  that  is  used  in  the 
right  hand  side  of  the  divergence  form  of  Gauss*  law  given  in  equation  (115) 
does  not  contain  the  factor  Cs  because  this  factor  cancels  from  the  numerator 
and  denominator  of  the  surface  and  volume  integrals  that  define  the  divergence 
of  a  vector.  For  use  in  the  divergence  equation  (115)  the  following  definition 
of  charge  density  for  spherical  symmetry  is  used 


’  =  (4irr2)  *  dQ/dr 

(165) 

'  =  cos  8rr  (4irr2)  1  dQ/dr 

(166) 

=  -  30  -  8 

pq  r  rr 

(167) 

=  p  cos  9  =  p 

qm  q  pq  3r  qc 

(168) 

where  PqC  =  conventionally  calculated  charge  density  given  by 

p  =  (4irr2)  *  dQ/dr  (169) 

qc  m  m 

and  where  fjr  is  given  by  equation  (55).  From  equation  (168)  it  follows  that 


(p  -  p  )/p  =  1  -  f. 

qc  qm  qc  3r 

Case  2.  Cylindrical  Coordinates 

p  =  d^Q/ (r  d<j>  dr  dz)  =  p  e^p9 

q  q 


p  =  cos  3,,  cos  8  cos  0  d  Q/(r  d$  dr  dz) 
q  (py  rr  zz 

9  --20-0  -0.-8  -  8  -  8. . 
pq  r  z  <J>  rr  zz  99 


For  cylindrical  symmetry 

p1  =  (2irrL  )_1  d2Q/(d?di) 

q 


(170) 

(171) 

(172) 

(173) 

(174) 
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(175) 


P*  -  cos  8  cos  6  /( 2iTr)  d^Q/(drdz) 

Q  ii  Z  Z 


pq 


-  28  -  0  -  <9  .>  -  8  -  8 

r  z  <4  rr  zz 


(176) 


For  cylindrical  symmetry  the  charge  density  that  appears  in  the  divergence  equa¬ 
tion  (115)  does  not  contain  Lc  and  must  be  written  in  the  following  form 


p  =  (2ur)  *  d^Q/(drdz)  *  (2nr)  ^  dp  ./dr 

q  q* 


(177) 


where 


p  =  cos  8  cos  8  /(27rr)  d  Q/(drdz) 

(|  rr  z  z 


pq 


-28-0-0  -8 
r  z  rr  zz 


V  "  dQ/di 


p  „  =  cos  0  dO/dz 
q£  zz  < 


pq£ 


-  0  -  8 
z  zz 


(178) 

(179) 

(180) 
(181) 


where  Pq^  =  complex  number  line  charge  per  unit  length.  The  measured  charge 
density  is  given  by 


P„m  3  P„  COS  ^  9 _.P  r* 

qm  q  pq  2r  qc 


(182) 


where  p  =*  conventionally  calculated  charge  density  given  by 
qc 

p  =  (2irr  )  1  d^Q/  (drTn  dz  ) 
qc  m  mm 


(183) 


where  f„  is  given  by  equation  (74).  The  measured  line  charge  density  is  given 
by  lX 


p  „  =•  p  .  cos  0  „  =  p  „  dz  /dz  cos  0  cos(0  +6  ) 

q£m  q£  oq£  q£c  m  zz  z  zz 


(184) 


where  the  conventional  line  charge  density  is 


p  „  =  dQ/dz 

qi.c  x  m 


Case  3:  Rectangular  Coordinates 
p^  =  d^Q(dx  dy  dz)  =  p^e^pcl 

3 

p  *»  cos  8  cos  8  cos  8  d  0/  (dx  dy  dz) 
q  xx  yy  zz 


(185) 

(186) 
(187) 


2E2 


(188) 


0  =-0  -  9  -  Q  -  6  -6  -6 

pq  x  y  z  xx  yy  zz 


p  =  p  COS  0  =  f-  p 

qm  q  pq  3xyz  qc 


(189) 


where  the  conventionally  calculated  charge  density  is 


P  =  d  Q/ (dx  dy  dz  ) 
qc  x  m  7m  m 


(190) 


and 


f~  =  cos  0  cos  8  cos  8  cos  8  dx  /dx  dy  /dy  dz  /dz 
3xyz  pq  xx  yy  zz  m  -'m  m 


(191) 


where  dam/da  is  given  by  equations  (19)  through  (21). 

In  general  the  following  relationships  are  valid 

p  =  qn  p  =  qn  0=0  (192) 

q  q  M  pq  n 


where  q  =  particle  charge  and  n  =  complex  particle  number  density.  For  use  in 
Maxwell's  equations  pq  in  equation  (192)  does  not  contain  the  internal  angular 
factors  Ls  and  Lc  and  pq  is  defined  by  equations  (165)  and  (177).  For  other 
uses  (subsection  D)  the  definition  of  pq  given  in  equations  (162)  and  (174) 
must  beused  in  equation  (192)  and  n  does  contain  the  internal  angular  factors 
Ls  and  Lc  for  spherical  and  cylindrical  systems  respectively. 

C.  Electric  Current 


The  electric  current  for  broken  symmetry  spacetime  is  given  by 
I  =  iej0I  =  dQ/dt  (193) 


I  =  cos  0  dQ/dt 


9I  =  ~  9t  ‘  0tt 


The  current  density  is  given  by  the  following  vector  component  equations 

j  =  dl/dA  =  d^Q/(dtdA  )  =  p  v 
Ja  a  a  q  a 


(194) 


(195) 


where  a  =  x,y,z;r,<j),iJjorr,<t>,z;Aa  =  broken  symmetry  area  normal 
to  the  a  direction,  and  va  =  broken  symmetry  electron  velocity  along  the  a  axis. 
Equation  (195)  can  be  written  in  vector  form  as 


i  =  p  v 
J  q  a 


Writing  the  current  density  component  as 

j  =  j  ej0ia 
Ja  Ja 


(196) 


(197) 
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gives 


P  v 
q  a 


0^=0  +0  =0+0 
ja  pq  vo  n  va 

where  va  and  0va  are  given  in  equation  (28). 

For  a  rectangular  geometry  equation  (195)  gives 


j z  ■  d2!/ (dx  dy) 


j  =  cos  0  cos  8  d  I/(dxdy) 
J  z  xx  yy 


0  =  -  e  -  0  -0  -  ft  -  0  -ft 

jz  t  x  y  tt  xx  yy 


-  0  -  0  -  9 

t  x  y 


which  combined  with  equation  (199)  gives 


(198) 

(199) 


(200) 

(201) 

(202) 


0  =0+0  -  0^  -  8_. 
V2  Z  ZZ  t  tt 


(203) 


which  is  the  result  obtained  in  Reference  12  by  kinematical  reasoning.  It  is 
easy  to  see  that  for  the  current  in  the  x  and  y  directions 


j  =  cos  6  cos  8  d  I/(dydz) 
Jx  yy  zz 


0.  =  -  0,  -  0  -  0  -  B_  -  8  -  8 

jx  t  y  z  tt  yy  zz 


i  =  cos  6  cos  8  d  l/(dxdz) 
Jy  xx  zz 


0.  =  -  0  -  0  -  0  -  8_  -  P  -  6 

jy  t  x  z  tt  xx  zz 


(204) 

(205) 

(206) 
(207) 


For  spherical  systems  of  current  flow  the  current  density  is 
j  =  d2I/(r2  sin  d<£  ji£) 
j  "  cos  cos  %  d2l/(r2S^d!j;) 


(208) 

(209) 


0  =  0T  -  20  -  0  .  -  0 ,  -  0 -  0,,  -  0.  . 
j  I  r  sf  tfi  ip  <p<p 


(210) 


where  0j  is  given  in  equation  (194).  For  spherically  symmetric  systems  equation 
(208)  becomes 
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(211) 


j1  *  I/(47rr2Ls) 

j1  =I/(4ur2Ls)  6 j  =  -  26r  -  0Ls  (212) 


But  the  current  that  appears  in  the  curl  equation  (113)  does  not  contain  the 
internal  angular  factor  Ls  because  this  factor  drops  out  of  the  definition  of 
the  curl  of  a  vector  in  broken  symmetry  space.  Therefore  for  applications  to 
Maxwell's  equations  the  following  definition  of  radially  symmetric  current  den¬ 
sity  must  be  used 


j  =  1/ (4nr2) 
j  =  1/ (4irr2) 


e.  =  eT  -  26 

J  I  r 


(213) 

(214) 


where  0j  is  given  by  equation  (194). 

Similarly,  for  a  cylindrical  geometry  the  current  density  is  given  by 

(215) 

(216) 


J  =  d2i/(rdidr) 


j  =>  cos  g,,  cos  6  d  I/(rdcJ>dr) 
9®  rr 


0  . 
J 


0T  -  29  -  0  -  B  -  g 

I  r  9  rr  99 


For  axial  symmetry  this  becomes 
j1  =  dl/  (2irrL  dr) 


j  =  dl  /  (2rrrdr) 


01.  =  0T  -  29  -  g  -  <9  > 

J  I  r  rr  9 


(217) 

(218) 
(219) 


where  0j  is  given  by  equation  (194).  Again,  this  is  not  the  current  that  ap¬ 
pears  in  Maxwell's  equation  (113)  which  does  not  contain  the  internal  angle 
<©9>  .  For  application  in  equation  (113)  the  following  current  is  used  for 
axial  symmetry. 

J  =  df/(2irrdr)  (220) 

j  =  dl/(2irrdr)  0.  =  0T  -  20  -  g  (221) 

j  v  /  j  I  r  rr 

D.  Asymmetric  Constitutive  Equations 

Constitutive  equations  relate  D  to  E  for  electric  fields  in  broken  sym- 
metry  matter  and  B  to  H  for  magnetic  fields  located  in  broken  symmetry  matter. 

Case  1.  Electric  Fields 

The  generalization  of  the  standard  scalar  result  is18-28 
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(222) 


D*£EtP»eE 

o 

(222) 

-*» 

where  eQ  *  permittivity  of  the  vacuum,  P  =*  complex  number  polarization  vector 
for  broken  symmetry  matter,  and  e  =  complex  number  permittivity  for  broken  sym¬ 
metry  matter.  The  complex  number  polarization  vector  is  given  by 

2 

P  =  e  V  E 

oAe 

(223) 

where  Xe  3  complex  number  electric 

susceptibility.  Then 

P  3  P  gi  Pci  S®  £  X  E 
a  a  o*e  a 

(224) 

P  »  e  x  E 
a  o  e  a 

(225) 

0^-0  +  0_ 

Pa  xe  Ea 

(226) 

where  a  =  x  ,  y  ,  z  .  From  equations  (222)  and  (223)  it  follows  that 


e 


=  e 

o 


(1  +  xe) 


(227) 


Writing 


e  =  ee 


j06 


Xe  =  Xee 


j9 


Xe 


allows  equation  (227)  to  be  written  as 
e  cos  0  =  e  +  £  X  cos  0 


e  o  o  e  xe 


e  sin  0  =  e  x  sin  0 

e  oAe  xe 


Equations  (230)  and  (231)  give 


tan  0  =  (x  sin  0  )/(l  +  x  cos  0  ) 

e  Ae  xe  e  xe 

(e/e  )^  =  1  +  +  2y  cos  0 

o  e  e  Xe 


(228) 

(229) 

(230) 

(231) 

(232) 

(233) 


In  general  0£  ^  Q^e  ?  0  . 

In  order  to  understand  why  0e  /  0^e  one  can  use  the  fact  that  the  polari¬ 
zation  veC-or  is  equal  to  the  dipole  moment  per  unit  volume  as  follows  for  bro¬ 
ken  symmetry  matter 


P  =*  nqd  =  p  d 
a  a  q  a 


(234) 
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where  da  =  broken  symmetry  value  of  the  a  component  of  the  atomic  dipole  dis¬ 
tance.  Combining  equations  (224)  and  (234)  gives 


e  X  E  =  nqd 
oAe  a  a 


0  +  0^  =  0  +  0  . 
xe  Ea  pq  da 


(235) 

(236) 


Combining  equations  (188)  and  (236)  and  choosing  the  dipole  to  be  oriented  in 
the  z  direction  gives  the  following  approximation 


0  'v>-0  —  0  —  0  +  (0 ,  —  0  ) 

Xe  Ez  x  y  dz  z 


(237) 


From  the  theory  of  the  simple  caoacitor  the  following  simple  generalization 
holds  for  broken  symmetry  matter18-28 


D  =  eE  =  d  Q/(dxdy) 
z  z 


(238) 


from  which 


D  =  eE  =  cos  8  cos  8  d  Q/ (dx  dy) 
z  z  xx  yy 

0r,=e+e„  =  -0-0-8  -6  'v  —  0  —  0 

Dz  e  Ez  x  y  xx  yy  x  y 


(239) 

(240) 


Combining  equations  (237)  and  (240)  gives 


0  -v  0  +  (0J  -  0  ) 

xe  e  dz  z 


(241) 


Therefore  because  0dz  on  the  atomic  scale  (due  to  electromagnetic  forces)  dif¬ 
fers  from  0Z  on  the  macroscopic  scale  (due  to  gravity)  it  follows  that 

0xe  ^  6e  • 

Case  2.  Magnetic  Fields 


The  generalization  of  the  scalar  constitutive  equation  for  magnetic  fields 


in  broken  symmetry  matter  can  be  written  as 


18-2  8 


B  -  uq(H  +  M)  =  uH 


(24?) 


where  w0  =  magnetic  permeability  for  the  vacuum,  M  =  magnetic  polarization  vec¬ 
tor  for  broken  symmetry  matter,  and  u  =  complex  number  magnetic  permeability 
for  broken  symmetry  matter.  The  complex  number  magnetization  vector  is  given 
by18"28 


M  =  ^ 


(243) 


where  x^  =  complex  number  magnetic  susceptibility.  Equation  (243)  can  be  writ¬ 
ten  as 
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(244) 


M  =  M  ej9Ma 


a 

a 

M  * 

X.,H 

a 

M  a 

9  . 

=  e 

Ha 

XM 

¥a  "  ^a6 


j (9xm  +  0Ha) 


(245) 

(246) 


where  a  =  r  ,  <J>  ,  z  for  cylindrical  coordinates.  From  equations  (242)  and 
(243)  it  follows  that 


C  =  M0(l  +  XM) 


Writing  u  and  xm  complex  form  as 

J9w 


U  = 


XM  =  XMe 


J0XM 


allows  equation  (247)  to  be  written  as 

tan  0y  =  (xM  sin  0^/(1  +  xM  cos  0^) 

(u/uo)2  =  1  +  +  2xm  cos  0xM 

From  equation  (242)  it  follows  that 


(247) 

(248) 

(249) 

(250) 

(251) 


(252) 


9  =0+9 

Ba  y  Ha 


(253) 


Equation  (250)  shows  that  0y  f  9^  .  This  can  be  understood  from  an  atomic 
basis  by  considering  the  fact  that  the  magnetization  vector  is  equal  to  the 
magnetic  dipole  moment  per  unit  volume,  or  in  component  form 


M  =  nmd  =  XmHo 


a 


a 


(254) 


d  •><j  _ 

where  =  components  of  the  atomic  magnetic  dipole  moment  m  ,  and  where  n  = 

complex  number  of  dipole  moments  per  unit  volume  where  now  the  angular  factors 

<dfp>  and  <0w,>  are  included  for  spherically  symmetric  systems  and  the  factor 

<9<j,>  is  included  for  axially  symmetric  systems.  Therefore  for  spherically 

symmetric  systems  the  n  that  appears  in  equation  (254)  is  given  by 


n1*ne^9n  =  dN/dV  =  (47rr2L  )_1  dN/dr 


(255) 


(4irr2L  )_1 
s 


cos  8  dN/dr 
rr 


(256) 
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e  -  -  3e  -  eT  -  s 

n  r  Ls  rr 


(257) 


and  for  axial  symmetry  by 

n1  =  dN/dV  =  (2irrLc)_1  d2N/(drdz) 


(258) 


n1  =  (2nr)  *  cos  8  cos  8  d2N/(drdz) 
rr  zz 


(259) 


01  =  -  20  -  0  -  <0  >  -  8  -  8 
n  r  z  b  rr  zz 


(260) 


The  general  case  for  the  number  density  of  a  spherical  system  is 
n  =  d^N/(r2  sin  ip  dij)  d$  dr) 

3  2 

n  =  cos  0^  cos  8^  cos  8rr  d  N/(r  S^difi  diji  dr) 


(261) 

(262) 


0  =-30  -0  —  0  -0-0  -  6, ,  -  -  8 

n  r  sip  ip  r  ipip  d>d>  rr 


(263) 


while  for  cylindrical  coordinates 


n  =  d  N/(r  d<{>  dr  dz) 


(264) 


n  =  cos  8,,  cos  8  cos  6  d  M/(r  d<J>  dr  dz) 
o©  rr  zz 


(265) 


0  =-20  -0  -  0,  —  B  ,  -  8  -  8 

n  r  z  <b  (bi  zz  rr 


(266) 


and  for  rectangular  coordinates 


n  =  d  N/  (dx  dy  dz) 


(267) 


n  =  cos  8  cos  8  cos  0  d  N/(dxdydz) 
xx  yy  zz 


(268) 


0  =  -0-9  -  0-  6  -8  -6 

n  x  y  z  xx  yy  zz 


(269) 


Consider  a  current  in  a  wire  situated  in  the  z  direction  of  a  broken  sym¬ 
metry  spacetime.  The  magnetic  field  is  in  the  azimuthal  direction  and  equa¬ 
tion  (254)  can  be  written  as 


M«j,  53  VJ0M*  " 


(270) 


where  the  atomic  magnetic  moment  is  given  by  the  following  generalization  of 
the  standard  result18-28 


-d  -2-  7  -2-  j<0^ 

m,  =  irr.L  ,1.  =  irr.I.e 
4>  d  cd  d  d  d 


(271) 
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where  r<j  *  complex  number  atomic  radius,  Lcj  =  internal  angular  factor  for  atomic 
current  loop  given  by  equation  (61),  =  complex  number  atomic  electron  current , 

and  <0y>  =  average  value  of  the  internal  phase  angle  of  the  broken  symmetry  az¬ 
imuthal  angle  of  the  atomic  current  loop-  Equation  (271)  is  based  on  the  fact 
that  the  broken  symmetry  area  of  the  atomic  electron  orbit  is12 


-  -2-  .2  j<eS> 

A,  =  irr  ,L  ,  »  irr  ,eJ  v 
a  d  cd  d 


(272) 


From  equations  (270)  and  (271)  it  follows  that 
%  ■  xm\  ■  "dnlld 

'  V  +  V  '  “n  +  29r  +  8I  +  <6*> 


(273) 


(274) 


where  the  following  relations  were  used 


rd  "  rde 
I J  =  r  ej0I 


(275) 


(276) 


and  where  from  equation  (194)  0j  *  -  0t  where  0t  =  internal  phase  angle  of  the 
atomic  time.  Combining  equations  (273)  and  (274)  with  equations  (260),  (407) 
and  (408)  gives 


XM  =  111  jn1  (2irr) I  ./I 


(277) 


XM  M<{>  H<)> 


(278) 


(2ed  -  e  -  e  )  +  (<ed>  -  <e>)  +  (0  -  ed) 

r  r  z  <p  9  t  t 


where  the  following  results  derived  later  in  equations  (407)  and  (408)  were  used 


H.  =  I  / (2irr) 
d>  z 


(279) 


e  -v  -  e 

r  t 


6 

r 


(280) 


From  equation  (278)  it  is  clear  that  0xM  ^  0  only  if  the  internal  phase  angles 
of  the  coordinates  at  the  atomic  scale  are  different  from  the  internal  phase 
angles  at  macroscopic  distances.  At  the  atomic  scale  0*-  ,  and  0£  are  de¬ 
termined  mainly  by  electric  forces,  while  at  macroscopic  distances  0r  ,  0^  and 
0t  are  determined  by  gravitation.  Note  for  superconductors  it  is  expected  that 
the  internal  phase  angles  over  macroscopic  distances  0r  ,  0^  and  0t  are  deter¬ 
mined  mainly  by  coherent  electrical  forces  and  0xM  ^  0  .  The  analysis  in  this 
section  shows  the  importance  of  axially  symmetric  and  spherically  symmetric 
densities  that  appear  in  equations  (162),  (174),  (255)  and  (258).  The  measured 
value  of  Xm  given  generally  by  x^  =  X^  cos  0xM  * 
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E.  Broken  Symmetry  Form  of  Ohm's  Law. 


The  current  in  a  wire  is  driven  by  a  potential  difference  (as  produced  by 
a  battery  for  instance)  which  is  related  to  the  electric  field  by  the  follow¬ 
ing  broken  symmetry  generalization  of  the  standard  scalar  result 


18-2  8 


W  =  W  =  (E  •  ds  =  E  a 

a  a  J  a 


(281) 


for  the  case  of  a  constant  electric  field  in  the  a  direction.  From  equation 
(281)  it  follows  that 


W  »  E  a 
a  a 


6.,  =  9_  +0 

Wa  Ea  a 


(282) 


■  I9c  -  2Bt  +  2<6ac,  -  Btt)]  +  Ba 

’  2<Ba  -  Ba„  -  8t  -  Btt> 

^2(0  -  0  ) 
a  t' 

The  current  density  is  then  given  by  the  broken  symmetry  form  of  Ohm's  law 

J  =  a  E  (283) 

Ja  a  a 


where  the  complex  number  electric  conductivity  is 

a  =  a  e^aa 
a  a 


(284) 


so  that 


i  =  a  E 
Ja  a  a 


0.  =  0  +  6^  <V  0  +  8-20.. 

ja  a  a  Ea  aa  a  t 


(285) 

(286) 


Combining  equations  (199)  and  (286)  gives 


0  =9.  -  0_  =  0  +  9  -  0,, 

oz  jz  Ez  n  vz  Ez 


(287) 


for  the  current  in  the  z  direction.  Combining  equations  (203),  (266),  (269), 
(282)  and  (287)  gives  the  internal  phase  angle  of  the  conductivity  in  the  z 
direction  as 


9  'v-9+9-9-9-9,, 

oz  t  z  x  y  Wz 


(288) 


9-9  -  0  -  0 

t  x  y  z 


=  8  +  0 
t  n 
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where  Bact  has  been  neglected.  For  current  in  the  x  and  y  directions  respectively 


0  'v  -  0  +  9  -  0  -  0  -  0 

ox  t  x  y  z  Wx 


(289) 


9-9  -  0  -  9 

t  x  y  z 


0+9 
t  n 


0  ^  -  6  +  9  -  0  -  8  -  9„ 

oy  t  y  x  z  Wy 


(290) 


9.-9  -  e  -  9 

t  x  y  z 


0+0 
t  n 


Therefore  0^  *  0ay  =  0a2  and  their  common  value  can  be  called  90  .  The  meas¬ 
ured  voltage,  current,  current  density  and  conductivity  are  given  respectively 
by 


W  =  W  cos  0.,  W  cos [2(0  -  0  )] 

am  a  Wa  a  a  t 


(291) 


I  =1  cos  0T  *  I  cos  0_ 
am  a  la  a  t 


(292) 


i  =  j  cos  0 . 
Jzm  z  j  z 


(293) 


a  =*  o  cos  0 
am  a  aa 


(294) 


where  0jz  and  9aa  are  given  by  equations  (202)  and  (288)  through  (290)  respect¬ 
ively  . 


The  resistance  of  a  wire  located  in  a  broken  symmetry  spacetime  is  given 


by  the  following  generalization  of  the  standard  result 

R  =  R  ej6Ra  -  L  /(a  A  )  -  L  /(a  A  )ej(eLa~eoa  "  0Aa> 
a  a  a  a  a  a  a  a 


(295) 


where  Ra  =  complex  number  resistance  of  a  wire  of  complex  number  length  La 
situated  in  the  a  direction,  and  complex  number  cross  sectional  area  ^  per¬ 
pendicular  to  the  a  direction.  Equation  (295)  is  equivalent  to 


R  =  L  /  (a  A  ) 
a  a  a  a 


(296) 


0_  «  0.  -  0  -  0, 

Ra  La  a  a  Aa 


(297) 


and  the  measured  value  of  the  resistance  of  a  wire  situated  in  the  a  direction 
is  given  by 
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(298) 


R  =  R  cos  8_ 
am  a  Ra 


equations  (288) 

through  (290)  and 

the  following  simple  results 

u  *  \ 

0,  =0 

Ly  y 

0,  =0 

Lz  z 

(299) 

i  -  e  +  e 

Ax  y  z 

0,  =0  +0 

Ay  x  z 

0.  =0  +0 

Az  x  y 

(300) 

in  equation  (297)  shows  that 

0O  =  eT.  +  e  ^  2(0  -  0  )  +  0  =  20  -  0„ 

Ra  Wa  t  a  t  t  a  t 


(301) 


for  a  *  x  ,  y  ,  z  .  Equation  (301)  becomes  clear  when  it  is  realized  that  in 
general 


R  -  W  /I 
a  a  a 


(302) 


so  that 


R  =  W  /I 
a  a  a 


0_  =  0„  -  0_  ^  0rt  +0 

Ra  Wa  la  Wa  t 


(303) 


^  20  -  0 
a  t 

The  measured  resistance  is  given  by  equations  (291),  (292)  and  (303)  to  be 

R  =  R  cos  0„  =  W  /I  cos(0TI  +  0  )  (304) 

am  a  Ra  a  a  Wa  t 

=  W  /I  (1  -  tan  8  tan  0TT  )cos^0t 
am  am  t  Wa  t 


=  W  /I  {l  -  tan  8  tan[2(0  -  0  )]}cos^0 

am  am 1  t  a  t  ‘  t 


where  Wam  and  Iam  are  given  by  equations  (291)  and  (292).  Equation  (304)  is  a 
broken  symmetry  form  of  Ohm's  law. 


The  conventionally  calculated  resistance  is  given  by 

R  =  W  /I 
ac  am  am 


(305) 


and  therefore  a  comparison  of  the  measured  and  conventionally  calculated  resis¬ 
tance  is  given  by 


5  =  (R  -  R  )/R  =  sin  0  cos  0„  (tan  0  +  tan  0  ) 

a  ac  am  ac  t  t  t  Wa 

*  0t(0t  +  0Ma)  =  0teRa  -  8t^20a  "  V  }  sma11  ansles 


(306) 
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A  measurement  of  £a  may  possibly  be  used  to  determine  9C  and  0a  for  the  local 
spacetime  in  the  vicinity  of  the  experiment.  The  internal  phase  angle  of  a 
battery  voltage  Qya  for  the  potential  difference  Wa  applied  in  the  a  direction 
is  associated  with  the  operation  of  a  battery  in  matter  with  a  gravitational 
field  present.  The  measured  resistance  given  in  equation  (304)  depends  on  St¬ 
and  0a  ,  and  therefore  the  measured  resistance  will  depend  on  the  orientation 
of  the  wire  relative  to  a  gravitational  field  even  when  Wam/Iam  have  the  same 
values  for  a  *  x,  y,  z.  For  gravity  at  the  earth's  surface,  the  Pound-Rebka- 
Snider  experiment  gives  the  values  0Z  =  -  5.7°  and  0t  =  -  3/2 (5 . 7°)  *  -  8 . 55°  .12 
Accurate  measurements  of  Ram  ,  Wam  and  Iam  for  various  orientations  of  a  wire 
relative  to  the  direction  of  the  earth's  gravity  field  may  possibly  be  used  to 
determine  0a  and  0t  through  the  application  of  equation  (304)  and  the  relation 
0£  =  3/20r  which  holds  for  a  massive  particle  falling  in  a  gravitational  field. 
In  fact  even  the  measured  voltage  of  a  battery  itself  determined  by  equation 
(291)  for  various  inclinations  relative  to  the  earth's  gravity  field  may  be  used 
to  determine  9a  and  9t  .  On  the  other  hand  the  measured  conductivities  given 
by  equation  (294)  will  have  the  same  values  irrespective  of  the  orientation  of 
the  wire  with  respect  to  the  gravity  field  if  ox  =  Oy  =  oz  because  their  inter¬ 
nal  phase  angles  satisfy  the  relation  0,^  =  8ay  =  0az  . 

An  examination  of  equation  (304)  shows  chat  it  is  possible  for  the  measured 
resistance  to  have  a  zero  value  resulting  in  a  superconducting  state.  This  oc¬ 
curs  when  Ram  =  0  in  equation  (304)  and 

tan  Bt  =  cot  0Wa  0c  =  tt/2  -  0^  (306A) 

20  -  0  =  it/2  =  tt/2  -  2(0  -  0  ) 

at  at 

Therefore  in  a  broken  symmetry  state  the  degree  of  coherency  of  time  in  matter 
may  be  sufficiently  large  so  as  to  allow  equation  (306A)  to  be  satisfied.  A 
form  of  superconductivity  is  therefore  possible  if  time  in  matter  can  form  a 
coherent  state.  This  may  occur  for  all  matter  when  the  gravitational  field  is 
so  large,  as  in  the  case  of  compact  stars  such  as  white  dwarfs  and  neutron 
stars,  that  a  coherent  time  state  described  by  equation  (306A)  can  form.  For 
a  vertical  wire,  the  result  of  combining  equation  (36A)  and  (306A)  gives  the 
following  values  of  the  internal  phase  angles  for  gravitationally  induced  su¬ 
perconductivity  . 

0r  =  -  rr  9t  =  -  3 tt/2  (306B) 

On  the  other  hand,  even  at  ordinary  pressures  and  densities  it  may  be  possible 
for  the  atomic  and  molecular  structure  to  be  of  such  a  form  that  the  space  and 
time  fields  are  coherent.  For  this  case  the  charge  carrying  Cooper  pairs 
are  essentially  free  particles  with  0a  =  29^  ,  and  this  combined  with  equation 
(306A)  yields  the  following  condition  for  structurally  induced  superconductivity 

0  =  tt/3  0  =  tt  /  6  (306C) 

a  t 

The  planar  structure  of  the  copper  oxides  that  exhibit  high  temperature  super¬ 
conductivity  may  in  fact  be  a  structure  that  allows  a  large  value  of  to 
arise  from  electromagnetic  fields  as  in  equation  (41).  In  this  case,  time  may 
exhibit  a  coherent  state  due  to  electrical  forces  and  equation  (306C)  may  de¬ 
scribe  high  temperature  superconductivity  if  9t  is  temperature  dependent  and 
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becomes  large  below  a  transition  temperature.  For  this  coherent  time  state  an 
increment  of  time  would  correspond  to  an  internal  phase  rotation  of  the  form 
At  *  t9t  .  The  two  time  scales  t  and  t0t  correspond  to  different  energy  scales. 
The  relative  energies  of  superconductors  are  described  by  the  normalized  super¬ 
conductivity  energy  gap 


A'  =  2A/(kT  ) 
c 


(306D) 


where  A  =  superconductivity  gap  corresponding  to  the  transition  temperature  Tc  . 
Then  a  simple  analysis  using  the  Heisenberg  uncertainty  principle  shows  that  the 
relationship  between  the  normalized  superconductivity  energy  gap  for  the  coherent 
time  state  Act  and  the  conventional  normalized  energy  gap  for  incoherent  time 
A^t  is  given  by 


A'  =  A'  /0_  =  6/ir  A’  1 . 91A ! 
ct  xt  t  it  it 


(306E) 


corresponding  to  0t  =  it/6  for  superconductivity  in  a  collection  of  free  parti¬ 
cles  or  holes.  The  result  in  equation  (306E)  agrees  with  measured  superconduc¬ 
tivity  gaps  for  the  planar  cuprous  oxide  high  Tc  structures.30  It  should  be 
noted  that  the  normalized  energy  gap  for  a  gravitationally  induced  superconduc¬ 
tor  is  much  smaller  and  is  obtained  from  equation  (306B)  to  be 


A'  =  2/ (3it) A !  %  0.21A* 

ct  it  it 


(306F) 


Equations  (304)  and  (305)  show  that  conventional  supercondutivity  occurs  when  Rqc  =  0  . 
F.  Scalar  and  Vector  Potentials. 

The  complex  number  scalar  potential  5  and  the  complex  number  vector  poten- 
tial  A  are  defined  by  the  following  generalizations 


B  =  V  x  A 


(307) 


E  =  -  V<J>  -  3A/3t 


(308) 


where 


A  =  A  eJ 
a  a 


(310) 


For  cartesian  coordinates  equation  (307)  is  written  as 


B  =  3A  / 3y  -  3A  / 3z 
x  z  3  y 


(310) 


B  =  3A  / 3z  -  3A  / 3x 
y  x  z 


(311) 


B  =  3A  / 3x  -  3A  / 3y 
z  y  x 


(312) 


The  real  and  imaginary  parts  of  equation  (310)  are  written  as 

B  cos  0„  =  cos  4>  cos  B  „  sec  B.  9A  /3y 

x  Bx  Azy  yy  Azy  z 


(313) 


-  cos  <J>  cos  B  sec  B.  9A  / 3z 
Ayz  zz  Ayz  y 
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(314) 


B  sin  0O  =  sin  $  cos  6  sec  8.  3A  / 3y 

x  Bx  Azy  yy  Azy  z  } 

-  sin  <t>.  cos  8  sec  8.  3A  /3z 
Ayz  zz  Ayz  y 

0A  *  a*  +  8A  -  a  -  6 

Azy  Az  Azy  y  yy 
V  =  V  +  eAyz  -  ez  "  gzz 
tan  6Azy  =  (Az  3eAz/3y)/(3A2/3y) 

tan  SAyz  =  (Ay  3%/^)/<9Ay/3z) 


(315) 

(316) 

(317) 

(318) 


Equations  (311)  and  (312)  can  be  handled  in  a  similar  manner.  From  equation 
(308)  it  follows  that 

E  =  -  3^/35  -  3A  /3t  (319) 

a  a 


where  a  =  x  ,  y,  z  .  For  a  =  x  ,  the  real  and  imaginary  parts  of  equation  (319) 
are 


E  cos  0_ 
x  Ex 


E  sin  0„ 
x  Ex 


-  cos  0  ^  cos  8  _  sec  8 

Axt  tt 

sin  0.  cos  6  sec  8, 

0X  XX  0X 

-  sin  0,  .  cos  8„k  sec  8 

Axt  tt 


(320) 


(321) 


where 


0.  *  0 ,  +  8.  -0  -  8 
<(>X  0  <px  X  XX 

(322) 

<J>  -  0.  +8  -  0  -  8 

Axt  Ax  Axt  t  tt 

(323) 

tan  8  =  (030  /3x)/(30/3x) 

0X  0 

(324) 

BAxt  ‘  <W3t>/<3V3t) 

(325) 

The  y  and  z  components  of  equation  (319)  can  be  handled  in  a  similar  fashion. 
Equations  (307)  through  (325)  are  the  equations  of  the  electromagnetic  field 
in  a  broken  symmetry  spacetime  such  as  may  be  induced  by  a  gravitational  field. 

The  scalar  and  vector  potentials  for  broken  symmetry  spacetime  can  also 
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be  written  as  the  following  modifications  of  the  standard  results18  28 


4  “  (tt-nz L)  *  Jp’/jr  -  r'|e  **  (r  ~  r  ^dv' 

(326) 

l  -  5/<4„l)Jj7|!  -?,)dv' 

(327) 

where 

L  =  Ls  spherically  symmetric  system 

(328) 

L  =  Lc  axially  symmetric  system 

(329) 

where  Ls  and  Lc  are  given  by  equations  (48)  and  (61)  respectively  in  terms  of 
and  <dxp)>  .  These  average  values  are  defined  as  follows 

L  =  e^9^  =  1/  (2ir) J  d$  =  eJ0*(2lr)  (330) 

c  o 

=  1/ (2tt)  f  sec  a  ej(0^  +  ^d* 
o  ♦♦ 

and  therefore 

<0<t)>  =  V2*)  (331) 


for  cylindrical  coordinates.  For  spherical  coordinates  it  follows  that 
2u  T 

L  =  1/  (4ir)  J  J  sin  ip  d<}>  dip 
s  o  o 

2tT  TT 

=  1/(4ll)i  i  %  sec  sec  *n  eJ  m<t> ^ 

=  1/2  ejV2lT)[l  -  cosO^V10)] 

=  1/2  e3<0<^>[  1  -  cos  (iTe^0’^)  ] 


(332) 


where 


44 


=  0  +  +  0 ,  +  8  +  6 
sip  <p  ip  qxp 


44 


(333) 


<0.>  =  9,  (it) 

<l>  <P 


(334) 


The  integration  over  the  volume  elements  dv'  produces  the  factor  L  so  that  this 

factor  cancels  in  the  numerator  and  denominator  in  equations  (326)  and  (327), 

■+  12 

and  $  and  A  are  independent  of  <9^>  and  <9,j,>  .  This  means  that  the  electric 
and  magnetic  fields  are  also  independent  of  <8 and  <0^,>.  This  will  be  exam¬ 
ined  in  more  detail  in  Sections  4  and  5. 
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G.  Boundary  Conditions  in  Broken  Symmetry  Spacetime. 


For  electromagnetic  fields  in  matter  located  in  a  gravitational  field  the 
standard  four  boundary  conditions  become  complex  number  equations  due  to  the 
internal  phase  angles  of  the  field  vectors,  current  density  and  charge  density. 

Case  1.  Tangential  Components  of  the  Electric  Fields. 

The  tangential  component  of  the  electric  field  vector  is  continuous  at 
the  boundary  between  two  materials 


Etl  =  Et2 
which  gives 


(335) 


Etl  =  Et2 


9Etl  =  9Et2 


(336) 


Thus  the  magnitude  and  the  internal  phase  angle  of  the  tangential  component  of 
the  electric  field  vector  are  continuous  across  a  boundary. 

Case  2.  Tangential  Components  of  the  Magnetic  Field. 

The  tangential  components  of  the  magnetic  field  vector  on  either  side  of 
a  boundary  are  related  to  the  complex  number  surface  current  density  by 


Js  =  Hlt  -  H2t 


(337) 


which  gives 


^  s 

cos 

9. 

JS 

=  HIt 

cos  *»it 

~  H2t 

COS  8K2t 

(338) 

-'s 

sin 

9. 

JS 

=  Hlt 

sln  6Hlt 

-  H2t 

Sl"  9H2t 

(339) 

•  "u  +  H2t  -  2»ltH2t  “s(eH2t  -  9HU> 


(340) 


tan  9js  =  (H1(;  sin  9^  -  ^  sin  0H2t)/(Hlt  cos  9^  -  H2t  cos  0^)  (341) 

In  a  gravitational  field,  the  internal  angles  of  the  field  vector  must  be  con¬ 
sidered  in  the  boundary  conditions.  Equation  (338)  shows  that  the  measured 
surface  current  density  is  equal  to  the  difference  of  the  measured  values  of 
the  tangential  components  of  the  magnetic  field  vectors. 

Case  3.  Normal  Components  of  the  Magnetic  Induction  Vector. 

The  normal  component  of  the  magnetic  induction  vector  is  continuous  across 
a  boundary  between  two  materials 


Bnl  "  Bn2 


(342) 
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and  therefore 


Bnl  Bn2  0Bnl  =  9Bn2 

Both  magnitude  and  internal  phase  angle  are  continuous  across  a  boundary. 


(343) 


Case  4.  Normal  Components  of  the  Electric  Displacement  Vector. 


The  normal  components  of  the  electric  displacement  vector  are  related  to 
the  complex  number  surface  electric  charge  density  as  follows 


ps 

-  D  .  - 
nl 

6n2 

(344) 

ps 

cos  0 

ps 

=  Dnl 

cos 

0Dnl 

°n2 

cos 

9Dn2 

(344A) 

ps 

sin  0 

ps 

=  Dnl 

sin 

0Dnl 

°n2 

sin 

0Dn2 

(344B) 

2 

ps 

=  D2  + 
nl 

°n2  - 

2D 

nl 

°n2 

003(9^  ,  - 
v  Dnl 

0Dn2^ 

(345) 

tan  9  =  (D  ,  sin  0.,  .  -  D  sin  9  „)/(D  .  cos  0 „  .  -  D  ,  cos  0  )  (346) 

ps  nl  Dnl  n2  Dn2  nl  Dnl  n2  Dn2 

Equation  (344A)  states  that  the  measured  surface  charge  density  is  equal  to  the 
difference  of  the  measured  values  of  the  normal  components  of  the  electric  dis¬ 
placement  vector. 

4.  ELECTROSTATICS  WITH  BROKEN  INTERNAL  SYMMETRIES.  The  preceding  section 
suggested  that  gravitation  will  have  an  effect  on  electromagnetism  through  the 
internal  phase  angles  of  space  and  time  coordinates.  This  section  considers 
the  effect  of  a  gravity  induced  broken  spacetime  symmetry  on  the  elementary  cal¬ 
culations  of  electrostatics. 

A.  Gauss  Law  for  Broken  Symmetry  Spacetime. 

The  differential  form  of  Gauss*  law  equation  (115)  can  be  written  in  terms 
of  a  potential  function  by 

E  =  -  VW  (347) 

where  W  =  potential  function  with  an  internal  phase  angle.  Then  equation  (115) 
becomes 
_2_ 

V  W  =  -  p  /c  (348) 

q 

It  will  be  shown  that  p  in  equation  (348)  does  not  contain  the  angular  factor 
L  .  On  the  other  hand,  the  integral  form  of  the  Gauss  law  for  a  broken  symme¬ 
try  spacetime  is 

$5  *  ndS  =  LQ  (349) 
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where  L  is  given  by  equations  (48)  or  (332)  for  spherical  polar  coordinates 
and  by  equations  (61)  or  (330)  for  cylindrical  polar  coordinates.  The  factor 
L  occurs  in  equation  (349)  on  account  of  the  surface  integral.  In  general  the 
factor  L  occurs  because  of  an  integration  over  a  surface  or  a  volume  of  a  sphere 
or  cylinder  with  broken  symmetry  coordinates.  In  fact 


<j)dV  =  Lg4/3iTr3 
(j)dS  =  Ls4Trr2 


spherical  coordinates 


(350) 

(351) 


<j)dV  =  Lciir2h 
<j)dS  =  L^rrrh 
<j)ds  =  L^trr 


cylindrical  coordinates 


(352) 

(353) 


(354) 


where  Lg  and  Lc  are  given  by  equations  (332)  and  (330)  respectively,  and  where 
h  =  complex  number  height  of  a  cylinder.  Becau._  _  occurs  on  both  sides  of 
equation  (349)  it  follows  that  and  uo  not  enter  the  final  expressions 

for  the  complex  number  electric  fiel-  and  electric  displacement  vectors.  This 
conclusion  also  follows  from  the  definition  of  the  divergence  of  a  vector  with 
broken  internal  symmetry  which  .  s 


7  •  D  =  Lim  6d  •  n  dS/SdV  =  p 
V-o  9 


(355) 


and  because  L  appears  in  both  the  integrals  over  dS  and  dV  it  cancels  in  the 

-y  -y 

numerator  and  denominator  making  D  ,  E  and  Pq  independent  of  L  .  Note  that  in 
all  calculations  the  charge  is  a  scalar,  so  that  the  measured  charge,  the  con¬ 
ventional  charge  and  the  fundamental  charge  are  all  equal 


Qm  =  Qr  =  Q 

m  c 

In  this  respect  charge  and  rest  mass  are  similar. 

B.  Spherically  Symmetric  Constant  Charge  Distribution. 


(356) 


For  spherical  symmetry  equation  (115)  becomes 
l/f2d/dr(r2D  )  =  p 

r  q 


(357) 


dD  /dr  +  2/rD  =  p 
r  r  q 


(358) 


For  0^  =  constant  (p^  =  constant,  9pq 

5  =  l/3p  r  =  l/3p  re  ^®r 

r  q  q 


=  constant) 


(359) 
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which  gives 


D  =  l/3p  r 
r  Kq 


6_  -  -  20 
Dr  r 


(360) 


where  equation  (167)  was  used  with  9r  =  constant  and  6rr  =  0  because  9pcj  =  con¬ 
stant.  The  corresponding  complex  number  potential  is  given  by 


-  3W  /9r  =  l/3p  r 

r  q 

W  =  -  l/6p  r2  +  c 

r  q 


(361) 


(362) 


where  the  constant  c  is  determined  from  the  condition 
W  (I)  =  Q(a)/(4TfI)  -  l/3p  a2e  3^a 

r  q 


(363) 


where  a  =  complex  number  radius  of  the  spherical  charge.  This  gives 

c  =  l/2p  a2e~^6a  (364) 

q 

and  therefore 

Wr  =  pq/6(3a2  -  r2)e_j6a  r  ^  a  (365) 


where  9r  *  9a  =  constant.  From  equation  (166)  and  8rr  =  0  it  follows  that 


Q  =  4/37ra  p 


(366) 


Wr  =  Q/ (8ara3)  (3a2  -  r2)e 


(367) 


Wr  =  Q/(8Tra3)(3a2  -  r2) 


0„  =  -  0 

Wr  a 


r  ^  a 


(368) 


From  equation  (359)  it  follows  that 


Dr  =  Qr/(4na  ) 


9_  =  -  20 

Dr  £ 


(369) 


The  measured  value  of  the  electric  displacement  for  the  case  of  constant  charge 
density  and  for  r  ^  a  is 


D  =  D  cos  9_  =  Qr/(4ara  )  cos  (20  ) 

rm  r  Dr  a 


(370) 


and  the  measured  potential  is  for  r  ^  a 

W  =  W  cos  0..  =  Q/(8ira3)  (3a2  -  r2)  cos  0  r  ^  a 

rm  r  Wr  x  a 


(371) 
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The  conventional  calculation  of  electric  displacement  and  potential  is  given  for 
constant  density  and  for  r  ^  a  by 

D  *  Qr  /(4ira3)  =  Qr/(4ra3)  cos  2  0  (372) 

rc  m  m  a 

W  =  Q/(8ira3)(3a2  -  r2)  =  Q/(8Tra3)  (3a2  -  r2)  cos"1  9  (373) 

rc  m  m  m  a 

and  therefore 


(D  -  D  )/D 
rc  rm  rc 


(W  -  W  )/W 
rc  rm  rc 


where  the  fact  that  9r 


1  -  cos2  0  cos(20  ) 
a  a 

2 

sin  0 

a 

*=  0a  for  r  ^  a  was  used. 


(374) 

(375) 


The  general  case  of  any  spherically  symmetric  charge  distribution  is  ob¬ 
tained  from  equations  (115)  and  (165)  to  be 


l/f23/3f(r2Dr)  *  (41rr2)“1dQ/dr 

(376) 

which  gives  immediately 

=  (4n?2)~lQ(r)  =  (4irr2)  1Q(r)e  2j0r 

(377) 

so  that  in  general 

Dr  =  (4Trr2)'1Q(r)  0DR  =  '  ™ T 

(378) 

For  r  ^  a  equations  (377)  and  (378)  give 

=  Q/(4irf2)  =  Q/(4ur2)e 

r  ^  a 

(379) 

Dr  =  Q/ (4nr2)  0Qr  =  -  20r 

r  ^  a 

(380) 

and 

W  =  Q/ (4irr)  =  Q/(4irr)e 

(381) 

=»  Q/(4irr)  ®W  =  ~  9r 

r  >  a 

(382) 

The  measured  electric  displacement  and  potential 

for  r  )  a  is 

D  *  Q/(4irr2)  cos(20  ) 
rm  r 

nj 

A\ 

u 

(383) 

W  =  Q/(4irr)  cos  0 

rm  1 

r  >  a 

(384) 
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The 

corresponding  conventional  calculations  give 

2 

D  =0/  (4irr  )  = 
rc  v  m 

Q/  (4irr2)  cos  2  0^ 

r  2*  a 

(385) 

W  =  Q/ (4nr  )  = 
rc  x  m 

Q/  (4ttt)  cos  ^  6^ 

r  >  a 

(386) 

and 

therefore 

(D-D  )/D 

2 

=  1  -  cos  0  cos(20  ) 

r  >  a 

(387) 

rc  rm  rc 

r  r 

(W  -  W  )/W 
rc  rm  rc 

=  sin2  0 

r 

r  >  a 

(388) 

Equations  (379)  through  (388)  agree  with  equations  (367)  through  (375)  for 
r  =  a  .  For  constant  density  Q(r)  *  4/3irr3pq  and  0r  =  constant  and  equations 
(377)  and  (378)  become  the  constant  density  case  described  in  equations  (359) 
and  (360)  for  r  ^  a  . 


C.  Line  Charge  in  Broken  Symmetry  Space. 


The  line  charge  problem  in  broken  symmetry  space  can  be  described  by  equa¬ 
tion  (115)  written  in  cylindrical  coordinates  as 

1/rd/dr (rD^)  =  =  dp^/ (2irrdr)  (389) 


where  pq  is  given  by  equation  (177)  and  pq£  by  equation  (180). 
(389)  it  follows  that  for  a  line  charge  of  infinite  length 


Dr  "  V/(2’f) 


Dr  *  V''*2"* 


D  =  D  cos  0_ 
rm  r  Dr 


9=_0_e  _9 

Dr  z  zz  r 


The  radial  electric  field  is  given  by 


E  =  p  .  /  (2frre) 
r  q2 


Er  =  Pq£/(27Tre) 


F  •  E  cos  9_ 
rm  r  Er 


9_  =  -  0  -  8  -0-0 

Er  z  zz  r  e 


From  equation 


(390) 

(391) 


(392) 

(393) 


If  Pq£  =  constant  then  from  equation  (181)  it  follows  that  pq2,  =  Q/z  =  constant, 
0pq£  =  -  0Z  =  constant  and  8ZZ  =  0  .  For  a  line  charge  of  finite  length  the 
electric  displacement  vector  is  given  by  the  following  generalization  of  the 
well  known  result27 

D  =  p  /(2Ttr)  sin  a  (394) 

r  ql 

where  tan  a  =  h/r  ,  and  2h  =  complex  number  length  of  the  line  charge,  and 
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where  sin  a  can  be  written  as  in  equation  (4).  Equation  (394)  reduces  to  equa¬ 
tion  (390)  for  an  infinite  line  of  charge  by  noting  that 

Sa(7r/2)  =  1  8sa(ir/2)  =  °  (395) 

In  Reference  12  it  is  noted  that  only  for  the  angles  a  =  ±  it / 2  is  9a  =  0  and 
®sa  ~  0  . 


For  a  charged  cylinder  of  infinite  length  a  simple  analysis  gives  the 
electric  displacement  for  r  <  a  as 


5r  =  VF/(27r52)  =  pq?/2 


D  -  D  cos  9_ 
rm  r  Dr 


Dr  =  pqir/(2tra  > 


6_  -9  -29  -9  -3 

Dr  r  a  z  zz 


(396) 

(397) 


where  a  *  complex  radius  of  cylinder  and  *  a  cos  9a  ,  rm  =  r  cos  9r  with 


9a  ^  9r  . 


D.  Broken  Symmetry  Capacitance. 

The  generalization  of  the  standard  definition  of  capacitance  to  broken  sym¬ 
metry  space  is18-28 


C  =  Q/W 
a  x  a 


C  =  Q/W 
a  a 


B  =  -  9„  =2(9  -  9  ) 

Ca  Wa  v  t  a 


(398) 

(399) 


For  a  parallel  plate  condenser  this  becomes 


10-28 


Cz  =  Q/(Ez) 


D  =  eE  =  Q/A 


(400) 


and  therefore 


C  =  eA  / z 
z  xy 


C  =  eA  jz 
z  xy 


9_  =9  +9  +9  -9  =2(9  -0) 

Cz  e  x  y  z  t  z 


(401) 

(402) 


with  corresponding  expressions  for  Cx  and  Cy  .  For  a  coaxial  cylinder  of  length h , 
outer  radius  b  and  inner  radius  a  the  capacitance  is18-  8 


C  =  2ireh/£n(b/a) 


(403) 


=  2irehej(0E  +  9h)/[£n(b/a)  +  j  (9fe  -  0a)] 

The  magnitude  C  in  this  case  depends  on  9^,  -  9a  because 
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(404) 


C  =  2Treh/[£rt2(b/a)  +  (6  -  0  )2]1/2 

D  3. 

9C  =  9e  +  ®h  “  tan~l[(9b  ~  0a)/£n(b/a)]  (405) 

5.  ASYMMETRIC  MAGNETOSTATICS.  This  section  considers  the  basic  calcula¬ 
tions  of  magnetostatics  in  a  spacetime  that  has  broken  internal  symmetries. 

The  broken  symmetries  are  due  to  the  local  pressure  and  energy  density  of  the 
ambient  matter.  It  is  gravity  that  produces  the  pressure  in  the  earth's  atmo¬ 
sphere  and  therefore  the  local  broken  symmetry  of  spacetime  at  the  earth's 
surface  is  due  to  gravity.  The  elementary  calculations  of  magnetostatics  are 
affected  by  gravity. 

A.  Ampere's  Law  for  Asymmetric  Space  and  Time. 

The  broken  symmetry  form  of  Ampere's  law  must  be  written  as 

<j)H  •  dl  =  ILc  (406) 

where  L  is  given  by  equation  (330).  This  must  be  the  broken  symmetry  form  of 
Ampere's  law  because  the  factor  Lc  also  arises  from  the  line  integral  on  the 
left  hand  side  of  equation  (406) .  For  circular  symmetry 

2irrH  =  l  (407) 

H  =  l/(2rrr)  9H  =  9I  "  9r  =  '  9t  "  9r  ”  Btt  (408) 

This  result  must  also  follow  from  Maxwell's  equations. 

B.  Magnetic  Field  Due  to  Currents  in  a  Wire,  Solenoid  and  a  Toroid  with 
Broken  Spacetime  Symmetry. 

The  stationary  magnetic  field  for  a  current  in  a  wire  can  be  obtained  from 
equations  (113)  and  (220)  which  can  be  written  as 

l/rd/dr(?H.)  =  j  =  dl/ (2iTrdr)  (409) 

<p  z 

and  therefore  the  azimuthal  magnetic  field  associated  with  a  current  in  the  z 
direction  is  given  by 

HA  =  I /  (2tt r )  (410) 

<P 

%  '  I/(2")  °H*  ’  8I  '  9r  ‘  -  9t  -  6tt  -  9r  <411) 

The  measured  and  conventionally  calculated  magnetic  fields  are  respectively 

H,  -  H.  cos  0  =  I/(2irr)  cos(9  +  9  )  (412) 

<pm  9  H9  t  r 

H  =  I  /(2irr  )  =  I/(2Trr)  cos  8  /  cos  0  (413) 

ipc  m  m  t  r 
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(414) 


Combining  equations  (411)  and  (412)  gives 

2 

H,  =  H,  (1  -  tan  0  tan  0„)cos  0 
<j>m  <{>c  r  t  r 

(H,  -  H,  )/H,  =  sin  0  cos  0  (tan  0  +  tan  8  )  (415) 

<f>c  $m  <j>c  rr  r  t 

^  0  (0  +  0  ) 
r  r  t 


From  equation  (411)  and  (413)  it  follows  that  H^m  =  0  when  6t  +  0r  =  n/2  , 
and  this  combined  with  the  free  electron  condition  8Z  =  29^  in  the  wire  gives 
ez/2  +  er  =  n/2  . 


From  equation  (307)  it  follows  that  the  magnetic 
static  current  in  a  wire  situated  in  broken  symmetry 

vector  potential  for  a 
spacetime  is  given  by 

5.  *  pH.  *  -  3A  /3? 

<P  4>  z 

(416) 

Combining  equations  (410)  and  (416)  gives 

A  =  -  pi/ (2u)  r 

z 

(417) 

=  pl/(2w)  ln( 1/r) 

or 

A  =  A  ejeAz 
z  z 

(418) 

where 

A  =  -  pi/ (2n) [In2  r  +  02]l/2 
z  r 

(419) 

9.  *  +  0T  +  0  =  -  9„  -  +  e 

Az  8r  I  p  0r  t  tt  p 

(420) 

tan  4>  =0  /  tn  r 

9r  r 

(421) 

The  analysis  of  the  coaxial  cable  in  broken  symmetry 
tension  of  the  standard  analysis.18-28 

spacetime  is  a  simple  ex- 

For  an  infinite  solenoid  the  generalization  to  broken  symmetry  spacetime 
is18-20 

H  *  n’l. 
z  <p 

(422) 

H  *  n 1 1  9  »  -  9  -  8  -  6. 

z  p  Hz  t  z  tt 

(423) 

where  n'  =■  number  of  turns  per  unit  length  =  N/z  .  For  a  toroid 
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(424) 


H  =  NI/(2xr) 

<P 

=  NI/(2itr)  6^  -  -  0t  -  Btt  -  9r  (425) 


C.  Biot-Savart  Law  for  Broken  Symmetry  Spacetime. 


The  Biot-Savart  law  applied  to  the  calculation  of  the  magnetic  field  of  a 
current  in  a  thin  wire  is  given  by  the  following  complex  number  extension  of  a 
well  known  result18-28 


H  =  1/ (4tt)/ ?/ (F2  +  i2)3/2di 

r  — »  QD 


(426) 


Introducing  z  =  r  tan  $  where  ijJ  =  ip  exp(j0^,)  gives 

_  %  _  _ 

H  =  1/  (4irr )J  cos  ip  dip 


=  1/ (4irr)  (sin  ip 2  -  sin  ij^) 


(427) 

(428) 


=  1/  (2irr)  sin  ip , 


where 


ip2  =  tt/2  e3fV7T/'2) 

^  =  -  tt/2  ejM-1T/2) 


(429) 

(430) 


But  it  has  been  shown  in  Reference  12  that  d^Ct-n/2)  •  0  ,  and  therefore 


H  =  1/  (2irr) 
<P 


(431) 


which  agrees  with  equation  (408)  which  comes  immediately  from  the  broken  sym¬ 
metry  form  of  Ampere's  law  given  in  equation  (406).  Because  0^,(±tt/2)  =  0  it 
follows  that  the  Biot-Savart  does  not  suggest  that  internal  phase  angles  of 
angles  should  appear  in  the  expression  for  the  magnetic  field,  and  this  sub¬ 
stantiates  the  inclusion  of  the  angular  factor  Lc  in  the  right  hand  side  of 
equation  (406)  which  excludes  angular  internal  phase  angles  from  appearing  in 
the  expression  for  . 

It  is  easy  to  show  that  the  broken  symmetry  generalization  for  the  axis 
value  of  the  magnetic  field  of  a  circular  current  loop  of  radius  a  at  a  point 
I  on  the  axis  above  the  loop  is  given  by18-28 

H  =  Ia2/(2R3)  (432) 

z 


where 
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(433) 


52  _2  _2 
R  =  a  +  z 


Therefore 


H  =  Ia2/(2R3) 
z 


(434) 


0„  =  eT  +  20  -  38  =  -  Q  -  ft  +  28  -  38 

Hz  I  a  R  t  tt  a  R 


(435) 


where  R  and  0^  are  obtained  from  the  following  two  component  equations  associ¬ 
ated  with  equation  (433) 


R2  cos (28  )  =  a2  cos (20  )  +  z2  cos (20  ) 
tv  St  z 

R2  sin(28  )  *  a2  sin(28  )  +  z2  sin(20  ) 
K  a  z 


(436) 


(437) 


D.  Asymmetric  Hall  Effect 

A  description  of  the  Hall  effect  is  given  in  detail  in  the  literature. 29 
In  broken  symmetry  spacetime  the  application  of  transverse  electric  and  magnetic 
fif’ds  to  a  solid  conductor  or  semiconductor  induces  a  complex  number  electric 
field  due  to  charge  drift  which  is  given  by 


E  =  <5E  B 
x  y  z 


(438) 


where  6  =  complex  number  ion  mobility  (Hall  mobility)  given  by 


6=5/ (qn) 


(439) 


6  =  a/ (qn) 


0=8  -  0  =  0 
6  a  n  t 


(440) 


From  equations  (288)  through  (290)  and  (438)  through  (440)  it  follows  that 


E  -  6E  B 
x  y  z 


(441) 


e„  -  8n  =  0.  +  9„  =0  -  8  +  0D 

Ex  Ey  6  Bz  a  n  Bz 


(442) 


=  9t  +  9Bz 


=  8  -  9 

x  y 


In  fact 


0  «  e  -  0  +  0D  =0  -  20_ 

Ex  y  t  Bz  x  t 


(443) 


e  =  e  -  2e„ 

Ey  y  t 


(444) 
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The  Hall  effect,  both  classical  and  quantum,  may  possibly  be  used  to  determine 
0X  ,  9y  and  0t  . 

6.  ELECTROMAGNETIC  WAVES  AND  BROKEN  SYMMETRY.  This  section  considers  the 
effects  of  broken  symmetry  spacetime  on  the  propagation  of  electromagnetic  waves 
in  matter  and  the  vacuum.  In  bulk  matter  it  is  gravity  which  is  primarily  re¬ 
sponsible  for  the  broken  symmetry  of  space  and  time.12  Therefore  gravity  is  ex¬ 
pected  to  influence  the  propagation  of  electromagnetic  waves  in  the  vicinity  of 
the  planets  and  stars.  The  effects  of  the  broken  symmetry  of  spacetime  on  the 
propagation  of  electromagnetic  waves  in  the  vicinity  of  the  earth  may  be  ex¬ 
pected  to  be  larger  than  the  effects  of  a  gravitational  redshift  (general  rel¬ 
ativity)  which  is  significant  only  in  massive  bodies  like  stars. 


A.  Asymmetric  Electromagnetic  Wave  Equations. 


Consider  broken  symmetry  electromagnetic  waves  propagating  in  the  z  direc¬ 
tion.  Then  equation  (113)  becomes 


3H  /3z  -  -  3D  /3t 
y  X 


3H  / 31  =  3D  /3t 
x  y 


Equation  (445)  is  equivalent  to 


cos  B  sec  3..  3H  / 3z  =  -  cos  8  .  sec  3_  3D  /3t 
zz  Hyz  y  tt  Dxt  x 

0„  +  B„  -0-8  =  9r,  +  -  0_  -  3 

Hy  Hyz  z  zz  Dx  Dxt  t  tt 


(445) 

(446) 

(447) 

(448) 


where 


tan  V  =  (VV3z)/(3Hy/az) 

Can  BDxt  "  (Dx30Dx/3t)/ODx/3t) 


(449) 

(450) 


Similarly,  equation  (446)  is  written  as 

cos  B  sec  3„  3H  /3z  =  co 
zz  Hxz  x 


tt  sec  eDyt  3Dy/3t 

(451) 

Dyt  9t  6tt 

(452) 

where 


tan  BHxz  =  (Hx30Hx/3z)/(3Hx/3z) 


tan  V  =  (y  V3t)/°V3t) 


(453) 

(454) 
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The  Maxwell  equation  (112)  for  broken  symmetry  waves  becomes 


3E  /3z  =  3B  /3t 

y  x 


(455) 


3E  /3z  =  -  3B  /at 
x  y 


(456) 


Equation  (455)  can  be  rewritten  as  the  following  two  equations 


cos  8  sec  8»  3E  /3z  =  cos  S..,.  sec  6-  _  3B  / 3t 
zz  Eyz  y  tt  Bxt  x 


(457) 


0_  +  8_  -9-0  =8„+B„  -9-8 

Ey  Eyz  z  zz  Bx  Bxt  t  tt 


(458) 


where 


tan  *Eyz  =  V  V32>/<*y*Z) 


(459) 


tan  8Bxt  *  (Bx39pv/rtt  >/(3Bx/3t) 


(460) 


while  equation  (455)  yields 


cos  8  ^ec  8„  3E  / 3z  *  -  cos  8_,.  sec  8_  .  3B  /3t 
zz  Exz  x  tt  Byt  y 


(461) 


0Ex  +  BExz  9z  Ezz  0By  +  8Byt  "  °t  Stt 


(462) 


where 


tan  BExz  =  {Ex36Ex/3z)/(3Ex/3z) 


(463) 


tan  8Byt  =  (ByaeBy/at)/OBy/at) 


(464) 


Combining  equations  (448)  and  (462)  or  equations  (452)  and  (458)  gives 
for  electromagnetic  waves  in  matter 


6  +9  =2(9  +  8  -  9  -  8  )  0  ^  29  -  30 

u  e  t  tt  z  zz  e  t  z 


0^0 
M  Z 


(465) 


where  equation  (465)  corresponds  to  the  wave  velocity  equation  v  =  1/ (eia)  and 
where  uniform  material  has  been  assumed  so  that  Bpxt  =  $Ext  ’  ^Byt  =  %yt  anc* 
so  on.  For  the  vacuum,  equation  (465)  becomes 


0+9  =0 

u  e 


(466) 


because  the  light  speed  in  vacuum  has  a  zero  internal  phase  angle  and  9Z  =  9t 
for  photons  in  vacuum.  For  electromagnetic  waves  in  matter  9y  +  0e  is  determin¬ 
ed  through  equation  (465)  by  the  values  of  9Z  and  9t  which  depend  on  the  local 
energy  density  and  pressure  and  ul'-mately  on  gravity. 
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B.  Electromagnetic  Waves  in  Broken  Symmetry  Matter. 


Consider  plane  electromagnetic  waves  propagating  in  the  z  direction.  As¬ 
suming  e  and  u  are  constants  allows  equation  (445),  (446),  (455)  and  (456)  to 
be  written  as  the  following  wave  equation 

32Ex/3z2  =  1/v2  32Ex/3t2  (467) 


_  _  _  2  „ 

with  similar_equations  for  Ey  ,  Hx  and  Hy  ,  where  v  =  l/(ey).  If  a  product  solution 
of  the  form  Ex  =  Z(z)T(t)  is  assumed  then  equation  (467)  becomes 


d2T/dt2  +  w2T  =  0 

(468) 

2-  2  _2_ 
d  Z/dz  +  k  Z  =  0 

(469) 

where  v  =  5i/k  ,  and  where  k  = 
standard  result27 

broken  symmetry  wave  number  generalization  of  a 

-2  _ 2 

k  =  yeu  +  iyorn 

(470) 

and  k  -  a  +  i3  where  a  and  3  are  the  following  broken  symmetry  generalizations 
of  the  standard  equations27 


_2  -2 — 
a  ye/ 


+  1} 

(471) 

~  1} 

(472) 

a  =  aeJ 


from  which  a  ,  0a  ,  3  and  0g  can  be  obtained.  Note  that  a  and  3  are  written  as 
J9ct  3  =>  (473) 

(474) 

(475) 


The  solutions  of  equations  (465)  and  (469)  are  respectively 
-iwt 


T  =  T  e 
o 


Z  =  Z  e 
o 


ikz 


Then  the  broken  symmetry  electric  field  is 

E  =  E°e~U  ei(5i  "  “E) 
x  x 


(476) 


Writing  the  broken  symmetry  electric  field  as 

E  =  E  e9®Ex 
x  x 


(477) 


gives  the  following  equations 
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(478) 


which  corresponds  to  undamped  waves  periodic  in  spacetime  with  broken  internal 
symmetries  that  satisfy  equations  (485)  and  (486) .  Because  the  local  internal 
phase  structure  of  spacetime  in  bulk  matter  is  determined  primarily  by  gravi- 
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tation  it  follows  that  equations  (478)  through  (488)  describe  the  effects  of 
gravity  on  electromagnetic  wave  propagation. 

C.  Doppler  Effect  for  Waves  with  Broken  Symmetry. 

The  generalization  of  the  conventional  Doppler  formula  for  the  frequency 
of  waves  from  a  source  moving  with  velocity  v  at  an  angle  ip  relative  to  the 
direction  of  a  stationary  observer  is  given  by 

f '  =  f(l  +  v/c  cos  ip)  (489) 

where  f'  and  f  =  complex  number  frequencies  at  the  observer  and  source  respec¬ 
tively,  v  =  veJ9v  =  complex  number  speed  of  the  source  and  ij)  =>  complex  number 
angle  between  the  direction  of  the  source  and  the  direction  of  the  observer. 

The  complex  number  frequencies  are  written  as 

f  =  fej9f  =  1/T  V  =  f'ej9^  =  1/f’  (490) 

=  1/T  e-j9t  =  1/t'  e~^9t 

where  T  and  T'  =  magnitudes  of  complex  number  periods  of  the  wave  motion  at  the 
source  and  observer  respectively,  and  8t  =  -  0f  and  &t  =  “  9f  •  The  convention¬ 
al  Doppler  formula  can  be  written  as27 

f'  =  f  (1  +  v  jc  cos  p  )  (491) 

c  c  m  m 

where  f^  =  conventionally  calculated  frequency  expected  to  be  seen  at  the  ob¬ 
server,  fc  =  conventionally  calculated  frequency  at  source,  vm  =  measured  source 
speed,  and  i|)m  =  measured  angle  between  the  source  direction  of  motion  and  the 
observer's  direction.  The  conventionally  calculated  frequencies  are  given  by 

f  =  1/T  f'  =  1/T'  (492) 

cm  cm 

where  Tm  and  T^  =  measured  wave  periods  at  the  source  and  observer  respectively. 
Frequencies  are  derived  from  periods  so  that  for  broken  symmetry  spacetime 

f  =  l/T  f'  =  l/T'  (493) 

T  =  T  cos  9  t'  =  T*  cos  0*  (494) 

m  t  m  t 

Equation  (493)  holds  only  for  the  magnitudes  of  the  frequency  and  period,  not 
for  their  measured  values. 

From  equation  (489)  it  follows  that 

f'  cos  0!  =  f[cos  9C  +  vC,/c  cos(8c  +9  -  0  ,)]  (495) 

f  f  <p  f  v  cip 

f'  sin  9 '  =  f[sin  9C  +  vC , fc  sin(9c  +0  -  0  ,)]  (496) 

f  f  ip  r  v  cip 


213 


where  C^,  and  0C^  are  given  by  equations  (7)  and  (9)  respectively.  It  is  easy 
to  see  that  equation  (495)  can  be  rewritten  as 


f ’  =  f  (1  +  v  /c  F) 
mm  m 


(497) 


where  F  is  given  by 


F  =  C.  cos(0,  +0  -  0  ,)/(cos  0,  cos  0  ) 

Ip  f  V  C\p  f  V 


(498) 


and  where  fm  and  fm  =  frequency  measured  at  the  observer  and  at  the  source  re¬ 
spectively  and  are  given  by12 

f_  *  f  cos  0,.  f*  =  f'  cos  dl  (499) 


m 


cos  0C 

f  =  f '  cos  0 

f 

m 

rT  cos2  0 

=  1/t'  cos 

m  t 

m 

2  Q 

2 

cos  0 

=■  f  cos 

:  t 

c 

and 


(f  -  f  )/f  =  sin  0, 

c  m  c  t 

(f'  -  f')/f'  *  sin2  0’ 
c  m  c  t 


(500) 

(501) 


Combining  equations  (491)  and  (497)  gives 

f'  -  f  ’  =  f  -  f  +v/c(f  cos  ip  -  f  F) 
cmcmm  c  mm 


(502) 


for  the  difference  between  the  measured  and  conventionally  predicted  frequen¬ 
cies  at  the  observers  position.  Equation  (502)  can  also  be  written  as 


f'  sin2  ©'  -  f  sin2  Q  =•  v  / c  (f  cos  ip  -  f  F) 
c  tc  tmcrmm 

f'  cos2  0  =  f  cos2  0  (1  +  v  /c  F) 

c  t  c  t  m 


(503A) 

(503B) 


When  ip  -  tt/2  ,  then 


1  2 


C^(tt/2)  =  0 


\p  =  ip  =  tt/2 
m 


ec{p(,m  *  0 


(tt/2)  =  o 


(504) 

(505) 


and  the  conventional  equation  (491)  gives  f<l  =  fc  while  the  broken  symmetry 
equation  (497)  gives  f^  *  fm  so  that  f'c  -  f^  =  fc  -  fm  for  this  case  as  can  be 
seen  from  equation  (502) .  For  this  case  also  0t  a  ®t  • 
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When  ip  =  0  it  follows  that 


v0)  * 0 


(506) 


C.  (0)  -  1  6  , (0)  =  0 

4>  cijj 

F  =  cos (8^  +  9^)/(cos  9^  cos  0  ) 


(507) 


=  1  -  tan  9-  tan  0 
f  v 


=  1  +  tan  0  tan  0 
t  v 


^  1  +  0  0  ^  1  +  0  (0  -  0) 
tv  t  X  t 


From  equation  (502)  with  ij>  =  ipm  =  0  it  follows  that 

f'  -  f'  »  (f  -  f  )(1  +  v  /c)  -  v  f  /c  tan  0  tan  0 
cm  cm  m  mm  t  v 


(508) 


(f  -  fJ(1 

c  m 


+  v  /c)  -  v  f  /c  0  0 
m  mm  tv 


Measurements  of  fm  -  fc  and  fm  -  fc  may  possibly  be  used  to  determine  0t  ,  8X  > 

0 y  »  02  and  0^  ,  0^  ,  0'  ,  .  For  motion  of  the  source  away  from  the  observer 

<p  =  tt  ,  and  the  value  or  ipm  to  be  used  in  equations  (502)  or  (503)  is 

=  TT  COS  0^  (ir)  . 

7.  CONCLUSIONS.  The  broken  symmetry  of  space  and  time  affects  the  elemen¬ 
tary  calculations  of  the  measured  physical  quantities  of  electromagnetism  such 
as  charge  and  current  densities  and  the  electric  and  magnetic  field  vectors. 

The  effects  are  manifested  through  the  internal  phase  angles  of  the  charge  and 
current  densities  and  the  field  vectors,  which  in  turn  are  related  to  the  inter¬ 
nal  phase  angles  of  space  and  time  coordinates.  In  the  vicinity  of  the  earth 
the  internal  phase  angles  of  the  spacetime  coordinates  and  of  the  electromag¬ 
netic  field  vectors  are  determined  primarily  by  gravity.  Therefore  gravitation 
must  be  considered  to  affect  the  basic  calculations  of  electromagnetic  theory. 
The  gravity  induced  broken  symmetry  of  spacetime  should  have  measurable  con¬ 
sequences  in  simple  electromagnetic  phenomena  as  the  electric  resistance  of 
wires,  the  propagation  of  electromagnetic  waves  in  matter,  and  the  Hall  effect. 
In  some  cases  atomic  and  molecular  structure  effects  may  induce  a  broken  sym¬ 
metry  in  the  space  and  time  coordinates  which  is  greater  over  macroscopic  dis¬ 
tances  than  the  corresponding  effect  due  to  gravity.  This  is  true  for  coherent 
states  such  as  occur  with  superconductivity.  In  this  case  the  induced  coherent 
time  state  may  be  responsible  for  the  high  Tc  superconductivity  phenomenon  in 
the  planar  copper  oxides.  The  coherent  time  state  elevates  the  normalized 
superconductivity  energy  gap  by  a  factor  6/tt  'v  1.91  and  may  explain  the  exper¬ 
imentally  observed  enhanced  normalized  energy  gaps  associated  with  high  temper¬ 
ature  superconductors. 
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QUANTUM  MECHANICS  AND  THE  BROKEN 
SYMMETRY  OF  SPACE  AND  TIME 


Richard  A.  Weiss 

U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT .  The  quantum  mechanics  of  particles  located  in  spacetime  with 
broken  internal  symmetries  is  investigated.  The  forms  of  the  momentum  and 
angular  momentum  operators  in  asymmetric  space  and  time  are  developed.  SchriJ- 
dinger’s  equation  is  written  for  both  external  and  internal  motion  in  asymme¬ 
tric  spacetime,  and  the  eigenvalues  and  eigenfunctions  for  a  free  particle  and 
a  rotating  object  are  determined.  The  problem  of  a  particle  in  a  box  is  con¬ 
sidered  for  both  an  internal  and  an  external  space  box.  The  energy  eigenvalues 
and  eigenfunctions  for  the  one-  and  three-dimensional  harmonic  oscillator  with 
broken  internal  symmetries  are  determined,  and  the  concept  of  an  internal  space 
harmonic  oscillator  is  introduced.  The  addition  of  angular  momenta  in  a  space- 
time  with  broken  azimuthal  symmetry  is  examined  and  the  existence  of  a  corre¬ 
sponding  gauge  boson  is  investigated.  In  particular,  the  addition  of  spin  and 
orbital  angular  momentum  is  considered  for  broken  symmetry  space  and  time,  and 
the  eigenvalues  of  the  total  angular  momentum  operator  are  calculated.  The 
general  problem  of  measurement  in  quantum  mechanics  is  considered  and  the  di¬ 
chotomy  of  spacetime  points  in  a  continuum  and  the  Heisenberg  uncertainty  prin¬ 
ciple  is  examined.  Applications  to  high  temperature  superconductivity  are  sug¬ 
gested. 


1 .  INTRODUCTION .  The  concepts  of  broken  global  and  local  symmetries  have 
a  long  history  in  quantum  mechanics. 1-3  Even  in  classical  mechanics  the  concept 
of  symmetry  plays  an  important  role;  consider  only  the  fact  that  the  translational 
and  rotational  symmetry  of  space  gives  rise  to  the  laws  of  conservation  of  momen¬ 
tum  and  angular  momentum  respectively,  and  translational  symmetry  in  time  yields 
the  law  of  conservation  of  energy.3-5  For  instance  the  application  of  an  exter¬ 
nal  torque  breaks  the  rotational  symmetry  of  a  system  and  the  angular  momentum 
becomes  time  dependent.  In  quantum  mechanics  the  concept  of  broken  symmetry  be¬ 
comes  even  more  important  when  it  is  applied  to  global  and  gauge  (local)  symme¬ 
tries.1-3  For  example,  a  broken  global  symmetry  is  associated  with  a  massless 
scalar  particle  called  the  scalar  Goldstone  boson,  while  a  broken  gauge  symmetry 
is  associated  with  massive  vector  bosons  and  massive  scalar  bosons.1’  An  exam¬ 
ple  of  the  particles  associated  with  a  broken  gauge  symmetry  are  the  massive  V*-, 
W-  ,  Z  vector  bosons  and  the  neutral  massive  scalar  Higgs  boson  H°  that  are  as¬ 
sociated  with  the  standard  electroweak  theory.1-3  At  the  macroscopic  scale,  a 
gauge  theory  of  bulk  matter  has  been  developed  on  the  basis  of  a  relativistic 
trace  equation  whose  form  suggests  the  broken  symmetries  of  the  thermodynamic 
functions. 6  » 7 

Recently  a  new  form  of  broken  symmetry  associated  with  spacet ime  coordinates 
has  been  proposed.7  The  broken  symmetry  of  the  spacetime  coordinates  is  associ¬ 
ated  with  the  broken  symmetry  of  the  pressure  field  in  matter  and  the  vacuum.7 
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The  manifestation  of  the  broken  symmetry  of  spacetime  appears  in  the  internal 
phase  angles  of  the  space  and  time  coordinates.  Euler’s  equation  of  motion  re- 
ates  the  internal  angle  of  the  pressure  to  the  internal  phase  angles  of  the  co- 
or  inates.  The  skewed  nature  of  spacetime  affects  classical  mechaiics,  hydro— 
ynamics ,  the  equilibrium  of  stars  and  planets,  electromagnetism,  atomic  pro¬ 
cesses  and  the  structure  of  atoms.7 


The  complex  number  cartesian  space  and  time  coordinates  are  written 
a  =  ae^a 

t  =  te^t 

where  a  *  x  ,  y  ,  z  .  Quantities  which  often  occur  in  the  calculations  with  bro¬ 
ken  symmetry  spacetime  are  the  angles  Bact  and  6tt  which  are  defined  by 

tan  Baa  =  a39a/3a  (3) 


(1) 

(2) 


tan  Btt  =  t39t/3t 

The  measured  values  of  the  space  and  time  coordinates  are  given  by7 

a  =  a  cos  0 
m  a 


t  =  t  cos  9 
m  t 

and  therefore 

da  /da  =  cos  0  -  a  tan  9  d0  /da 

m  a  m  a  a 

dtm/dt  =  cos  0t  ~  tm  t3n  9t  d0t/dt 


(6) 

(7) 

(8) 


The  kinematics  of  particles  in  broken  symmetry  spacetime  is  treated  in  Refer¬ 
ence  7.  The  definition  of  measured  coordinates  in  equations  (5)  and  (6)  alle¬ 
viates  the  dichotomy  in  quantum  electrodynamics  of  having  zero  dimensional 
points  in  a  spacetime  continuum  simultaneously  with  the  validity  of  the  Heisen¬ 
berg  uncertainty  principle  which  implies  infinite  energy  and  momentum  for  these 
points.  From  equations  (5)  and  (6)  it  is  clear  that  many  values  of  a  and  t  can 
correspond  to  the  measured  values  and  tm  respectively.  A  measured  point  in 
spacetime  actually  corresponds  to  an  infinite  set  of  possible  values  of  a  and  t 
corresponding  to  the  possible  ambient  conditions  which  determine  the  size  of  the 
internal  phase  angles  of  the  coordinates.  The  apparent  infinite  momentum  and 
energy  values  predicted  by  the  uncertainty  principle  for  points  in  spacetime  do 
not  occur  for  measured  spacetime  coordinates. 


One  of  the  peculiarities  of  space  and  time  with  broken  internal  symmetries 
is  the  possibility  of  internal  motion  of  a  particle  which  is  externally  at  rest 
in  spacetime.  In  particular,  if  the  magnitude  of  a  is  fixed  and  t  is  taken  to 
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be  a  scalar  parameter  then 


da/dt  =  jadB^/dt  (9) 

If  the  time  is  also  treated  as  a  complex  number  and  if  the  magnitudes  a  and  t 
are  both  fixed,  then  space  and  time  rotate  internally  and 

da/dt  =  a/td0  /d0  =  a/td0  /d0  e^  9t^ 

at  at 

where  a  =  x  ,  y  ,  z  .  With  internal  spin  it  is  possible  to  have  momentum  and  ki¬ 
netic  energy  for  an  externally  stationary  particle  (with  a  »  constant  and  t  = 
constant) .  For  external  motion  0a  and  0t  *  constants  while  a  and  t  are  vari¬ 
ables.  However,  the  measured  values  of  space  and  time  coordinates  given  by 
equations  (5)  and  (6)  vary  for  both  internal  and  external  motion. 

The  angles  of  a  spherical  polar  coordinate  system  are  written  as 

9  =  ip  =*=  9e^9^  (11) 

where  9  and  \p  =  complex  number  azimuthal  angle  and  zenith  angle  respectively. 
The  relevant  trigonometric  functions  associated  with  9  and  ip  are 

sin  9  =  S  .  e^9s<^  (12) 


cos  9  =■  C,e 


~j®c<J> 


tan  9  »  SjC A  eJ  (9s^  +  0C*} 


where 

2  2  1/2 

S,  *  [sin  (<p  cos  0.)  +  sinh  (9  sin  0,)] 

4>  <P  $ 

2  2  1/2 

C,  =  [cos  (9  cos  0.)  +  sinh  (9  sin  9.)] 


tan  0  ,  =  cot(9  cos  0,)tanh(9  sin  0.) 

S9  99 

tan  0  ,  =  tan(9  cos  0,)tanh(9  sin  0.) 
c  cp  9  9 

with  similar  expressions  for  ip  .  The  measured  angles  are  given  by 

9m  *  <f>  COS  0  9  =  9  cos  0 

m  9  m  9 

The  measured  sin,  cos  and  tan  functions  are 


(sin  9)  *  S,  cos  9 

m  9  s<{ 
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(cos  <t>)  =  C,  cos  0  . 

m  <t>  c  <p 


(21) 


(tan  *>m  =  yc*  cos<es^ +  v  (22) 

From  equation  (19)  it  follows  that 

3(ji  /dp  m  cos  0.  -  d>  tan  0,  30,/3<j>  (23) 

in  in  cp  (p  " 

3ip  /dip  =  cos  0 .  —  4>  tan  0.  36. /3>p  (24) 

m  tp  m  ip  ip 

In  this  way  the  measured  values  of  angles  and  their  trigonometric  functions  are 
calculated. 

This  paper  investigates  the  effects  of  the  skewed  nature  of  space  and  time 
on  some  basic  quantum  systems.  In  particular  Section  2  considers  skewed  single 
particle  momentum,  angular  momentum  and  energy  operators.  Section  3  studies  the 
SchrOdinger  equation  for  asymmetric  spacetime.  Section  4  develops  the  theory  of 
a  particle  confined  to  a  box  in  external  and  internal  space.  Section  5  calculates 
the  eigenvalues  and  eigenfunctions  for  the  harmonic  oscillator  with  broken  inter¬ 
nal  symmetry,  and  finally  Section  6  investigates  the  addition  of  angular  momen¬ 
tum  in  broken  symmetry  spacetime. 

2.  MOMENTUM,  ANGULAR  MOMENTUM  AND  ENERGY  OPERATORS  IN  ASYMMETRIC  SPACETIME. 
This  section  develops  the  expressions  for  the  measured  values  of  the  momentum, 
angular  momentum  and  energy  operators  in  space  and  time  with  broken  internal  sym¬ 
metries  . 


A.  Momentum  and  Energy  Operators. 


The  complex  number  momentum  operators  for  cartesian  coordinates  with  broken 
internal  symmetries  are7 


p  a,  e^9Pa  p  *  -  ift3/3a 
*a  ra 


(25) 


where  a  ■  x  ,  y  ,  z  and 


p  =  -  ifr  cos  0  3/3a 

*a  act 


0  =  -  0  -  6 
pa  a  aa 


(26) 

(27) 


where  0aa  is  given  by  equation  (3) .  The  energy  operator  is  given  by7 


E  *  th 3/3t 

(28) 

E  »  ih  cos  0  3/3t 

(29) 

9e  ■  -  9t  -  Btt 

(30) 
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where  0tt  is  given  by  equation  (4)  .  The  imaginary  number  i  and  j  satisfy 


iT  =  -  1 


r  -  -  i 


13  =  ji 


Therefore  the  Dirac,  Klein-Gordon  and  Schrodinger  equations  must  be  complex  i  -.m 
ber  equations  in  internal  space,  and  the  energy  eigenvalues  and  eigenfunctions 
must  also  be  represented  as  complex  numbers  in  internal  space.7  If  the  wave 
function  to  which  the  operators  pa  and  E  are  applied  is  real  then  the  measured 
momenta  and  energy  operators  are  given  by 

p  =  cos  0  p  =  -  id  cos  0  cos  0  3/3a  (32) 

am  pa  a  aa  pa 

E  *  cos  0_  E  =  id  cos  £3  cos  0_  3/3t  (33) 

m  £  tt  £ 

For  zero  internal  phase  angles  the  following  results  are  obtained 

p  »  -  ift3/3a  (34) 

a 

E  -  ift3/3t  (35) 

which  are  the  standard  operators  of  quantum  mechanics.8’9 

Generally  the  momentum  and  energy  operators  given  by  equations  (25)  and 
(28)  respectively  are  applied  to  a  complex  number  wave  function  exhibiting  an 
internal  phase  angle  as  follows 

5  =  fe^  (36) 

whose  measured  value  is  given  by 

f  -  f  cos  0,„  (37) 

m  t 

The  operator  equations  are  then  written  as 

p  f  =  -  id  cos  0^  sec  0^  3f/3a  e~*  fa  (38) 

Ef  =  id  cos  0  sec  0^  3f/3t  (39) 


where 


tan  gfa  =  (f30y/3a)/(3f/3a) 


tan  0ft  =  (Yddy/dt) / (3f /3t) 


<t> 

4*  a 

<T> 

+ 

8fa 

0  - 
a 

8aa 

ft 

9f 

+ 

6ft 

9t  - 

8tt 
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The  real  and  imaginary  parts  of  equations  (38)  and  (39)  are  respectively 


(Pa'*')R  ”  -  cos  4^  cos  Baa  sec  8^  3473a 

(Pa^) i  *  -  sin  cos  8aa  sec  8^  3473a 

(E4*)R  =  ifi  cos  <fryt  cos  8tt  sec  8^t  347  3t 


(E'J,)I  =  ift  sin  4^  cos  8fct  sec  8^t  3473t 


(44) 

(45) 

(46) 

(47) 


These  expressions  will  be  used  to  calculate  the  measured  values  of  momentum 
and  energy. 

The  measured  values  of  the  linear  momenta  and  energy  are  defined  by 


p  f  *  (p  40  „  =>  p  4* 
am  ra  R  *am  m 


E  i  8  (E40_  s  E  f 
m  R  m  m 


(48) 

(49) 


where 

p1  »  -  ih  cos  <t>„,  cos  8  sec  8...  3/3a  (50) 

am  4*a  aa  4*a 

E^  *  ih  cos  <tyt  cos  8tt  sec  0  3/3t  (51) 

and  where  pam  and  En,  are  the  measured  momentum  and  energy  operators  whose  form 
will  now  be  determined.  This  is  done  by  using  equation  (37)  which  gives 

3473a  =*  A  34*  /3a  +  B  f  (52) 

am  m  am 

34*/3t  *  A  34'  /3t  +  Bk4'  (53) 

t  m  m  t  m 

where 


A  =  sec  0.„  3a  /3a  (54) 

a  4*  m 

B  =  sec  0  tan  0  39  /3a  (55) 

a  4*  4'  4' 

A.  *  sec  01U  3t  /3t  (36) 

t  t  m 

Bt  =  sec  0^  tan  0y  30^/St  (37) 


Combining  equations  (48)  through  (57)  gives  the  measured  momentum  and  energy 
operators  respectively  as 


(58) 


P 


am 


i h(C  d/da  +  D  ) 
a  m  a 


E  =  i(i(C  3/3t  +  D  ) 

m  t  m  t 


(59) 


where 


c 

a 

=  cos 

$ 

fa 

cos 

*aa 

sec 

Sfa 

sec 

9f 

3a  /3a 
ra 

(60) 

D 

a 

=  cos 

$ 

fa 

cos 

*aa 

sec 

efa 

sec 

6f 

tan  9^ 

dQy/da 

(61) 

ct 

=  cos 

$ 

ft 

cos 

6tt 

sec 

6ft 

sec 

6f 

at  /at 

m 

(62) 

Dt 

31  cos 

1> 

ft 

cos 

Btt 

sec 

3ft 

sec 

6f 

tan  9^ 

30y/3t 

(63) 

where  and  3tm/3t  are  given  by  equations  (7)  and  (8)  respectively.  There¬ 

fore  the  measured  momentum  and  energy  operators  generally  include  the  effects 
of  the  internal  phase  angle  of  the  wave  function.  If  the  internal  angle  of  the 
wave  function  is  zero,  Da  =  0  and  Dt  *  0  ,  and 

p  =•  -  ih  cos  9  cos  6  3a  /3a  3/3a  (64) 

am  pa  aa  m  m 

E  =  ifi  cos  9  cos  8  .  3t  /3t  3/3t  (65) 

m  E  tt  m  m 

which  agrees  with  equations  (32)  and  (33) . 

The  conventional  quantum  mechanical  momentum  and  energy  operators  are 
given  by 


p  f  -  -  ihd'V  /da 
ac  m  mm 

(66) 

E  V  -  ihM  /3t 
cm  mm 

(67) 

where  pac  and  Ec  =  conventional  momentum  and  energy  operators.  Combining  equa¬ 
tions  (58),  (59),  (66)  and  (67)  gives 

p  -  p  =  -  ift[(l  -  C  ) 3/3a  -  D  ]  (68) 

ac  am  a  m  a 

E  -  E  =  ift[(l  -  C  )3/3t  -  D  ]  (69) 

cm  t  m  t 


Differences  between  measured  and  conventionally  calculated  values  of  momentum 
and  energy  should  be  detectable  (especially  for  particles  in  bulk  matter)  due 
to  the  broken  symmetry  of  space  and  time.  Finally,  consider  the  momentum  and 
energy  operators  for  the  case  where  space  and  time  rotate  internally  for  con¬ 
stant  values  of  x  ,  y  ,  z  ,  t  .  For  this  case  equations  (25)  and  (28)  give  re¬ 
spectively  for  da  =  jad9a  and  dt  =  jtd9t 
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(70) 


p  =  ijfi/a  3/39  =  ijd/a  e  3/39 


E  =  -  ijfi/t  3/39^  =  -  ij 


h/t  e-j9t  3/39 


(71) 


These  expressions  will  be  used  in  Sections  4  and  5  where  bounded  motion  in  in¬ 
ternally  space  is  considered. 

B.  Angular  Momentum  Operators  in  Broken  Symmetry  Spacetime. 


The  generalization  of  the  scalar  forms  of  the  angular  moment”"?  operators 


are  given  by8*9 


Lx  *  -  ift(y3/3z  -  z3/3y) 


L  ■  -  ih(zd/dx  -  x3/3z) 


L  “  -  id(x3/3y  -  y3/3x) 


(72) 

(73) 

(74) 


These  broken  symmetry  angular  momentum  operators  obey  the  standard  commutation 
relations.  Similarly,  the  complex  number  angular  momentum  operators  can  be 
written  as  the  following  generalization8*9 


Lx  *  ih(sin  <p  3/3 ip  +  cot  ip  cos  tp  3/3 tp) 


1.^  =“  lh(-  cos  <j>  3/3<fi  +  cot  \p  sin  3/9i) 


L  =  -  ifi.3/3<j> 


(75) 

(76) 

(77) 


The  eigenfunctions  of  Lz  have  been  shown  to  be  given  by7 
&  M  -  MeJQM 


<t>  =*  Ae 

l  5  =  hm 

Z 


(78) 

(79) 


where  for  periodic  external  rotations  with  0^  =  constant7 
JeMe 


M  =  M  =  M  eJ“Me 
e  e 


M  =  m  cos  0 
e  ( 


eM  =  -  9 

Me  <; 


(80A) 

(80B) 


and  for  periodicity  Mg4>  -  Me$  =*  real  number,  so  that 

r  .  .  iMe$ 

♦  ■  ♦  *  Ae  e 

where  for  external  motion  9  ,9  ,9  ,9  ,9.  and  0.  *  constants. 

x  y  z  r  *  $  tp 


(81) 
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The  generalization  of  the  L  operator  eigenvalue  equation  is  given  by 
L2W  =  h2l(l  +  1)W  (82) 

where 

L2  =  -  ft2[l/sin 2  ip  32/3^2  +  1/sin  ip  3/3ij)(sin  ip  d/dip)]  (83) 

where  for  external  rotations  in  broken  symmetry  space7 

Z  =  Z  =  M1  +  v  M*  =  |m|  cos  0,  e  -^0<f>  (84) 

e  e  e  1  $ 

where  the  integer  v  is  given  by  v  =  L  -  |m|  .  The  complete  angular  wave  function 
is  given  by 

9  =  $(?)W(iO  (85) 

Combining  equations  (82)  and  (84)  gives  for  external  rotations  in  broken  symme¬ 
try  space 

L2  =  (i2ia  +  l)g  (86) 

g  =  [1  +  |m|/£(cos  0^  e-j04>  -  1)  ]  [  1  +  | m|  / (£  +  1)  (cos  0^  e"j6<(>  -  1)]  (87) 

The  technique  for  the  generalization  of  the  angular  momentum  rules  to  broken 
symmetry  spacetime  is  now  obvious.  For  instance,  the  raising  and  lowering  op¬ 
erators  L+  and  L_  are  written  for  broken  symmetry  space  as8-  1 

L+  =  h[Z(Z  +  1)  -  M(M  ±  1)]  (88) 


which  for  external  rotations  in  broken  symmetry  space  becomes 

L*  =  6[Ce(Ce  +  1)  -  Me(Mg  ±  1)]  (89) 

=  h[l(l  +  l)g  -  m(m  +  1)1] 
where  g  is  given  in  equation  (87)  and  s  is  given  by 

s  =  1/ (m  +  1)  cos  9,  e  -^(m  cos  9  e  ~’0<*>  +  1)  (90) 

These  equations  reduce  to  the  standard  results  when  0^  =  0  because  for  this 
case  g  =  1  and  s  =  1  . 

The  development  of  the  measured  angular  momentum  operators  proceeds  from 
equations  (72)  through  (74).  The  definitions  of  the  measured  angular  momentum 
operators  are  as  follows 
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(91) 


<V> 

<v> 

<V> 


R 

R 

R 


=  L  Y  =  L  Y 
x  xm  m 


=  L  Y  =  L  Y 
y  ym  m 


L'Y  =  L  Y 
z  zm  m 


(92) 

(93) 


where  the  measured  angular  momentum  operators  amd  Lzm  are  determined 

by  the  evaluation  of  the  left  hand  sides  of  equations  (91)  through  (93).  Com¬ 
bining  equations  (1),  (3),  (72)  through  (74),  and  (91)  through  (93)  gives  in  a 
fashion  similar  to  that  done  for  linear  momentum 

L*  “  -  ift(E'  y3/3z  -  e'  z3/3y)  (94) 

x  yz-7  zy  J 

L1  “  -  ift(Ef  z3/3x  -  Er  x3/3z)  (95) 

y  zx  xz 

L*  ■  -  ift(E'  x3/3y  -  E*  y3/3x)  (96) 

z  xy  yx 

where 


E' 

yz 

cos 

^Yyz 

cos 

0zz 

sec 

0Yz 

(97) 

e' 

zy 

= 

cos 

T'zy 

cos 

6yy 

sec 

6Yy 

(98) 

E' 

zx 

« 

cos 

T'zx 

cos 

0xx 

sec 

0Yx 

(99) 

E' 

xz 

- 

cos 

^Yxz 

cos 

ezz 

sec 

0Yz 

(100) 

E' 

xy 

= 

cos 

^Yxy 

cos 

ca 

sec 

0Yy 

(101) 

E' 

yx 

= 

cos 

^yx 

cos 

0xx 

sec 

0Yx 

(102) 

and  where 

e„, 

+ 

+ 

9 

-  0  - 

B 

(103) 

“Yyz 

Y 

Yz 

y 

z 

zz 

~  __ 

e , 

+ 

b,„ 

+ 

e 

-  0  - 

B 

(104) 

^zy 

Y 

Yy 

z 

y 

yy 

~  _ 

e,„ 

+ 

B,„ 

4- 

e 

-  6  - 

B 

(105) 

^zx 

Y 

Yx 

z 

X 

XX 

—  _ 

0„ 

+ 

8,„ 

+ 

e 

-  0  - 

B 

(106) 

“Yxz 

Y 

Yz 

X 

z 

zz 

_ 

9„ 

+ 

8,„ 

+ 

9 

-  0  - 

B 

(107) 

^Yxy 

Y 

Yy 

X 

y 

yy 

—  _ 

0 

+ 

B,„ 

+ 

9 

-  0  - 

B 

(108) 

^yx 

Y 

Yx 

y 

X 

XX 
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Combining  equations  (91)  through  (96)  with  equations  (5)  and  (37)  gives 


L 

xm 

s 

-  ift[y  (E  3/3z 
;in  yz  m 

+ 

F  )  - 
yz 

-  z  (E  3/3y 
m  zy  ■'m 

+ 

F  )] 
zy 

(109) 

L 

xe 

-  ift[z 

:  (E 

3/3x 

+ 

-  x  (E 

3/3 

z 

+ 

(110) 

ym 

m  : 

zx  m 

zx 

in  xz 

m 

xy 

L 

at 

-  i^fx 

:  (E 

3/3y 

+ 

F  )  - 

-  y  (E. 

3/3x 

+ 

(111) 

zm 

m  xy  m 

xy 

m  yx 

m 

yx 

where 

E 

ac 

A  E' 

sec 

0 

E 

A  E' 

sec 

0 

(112) 

yz 

z  yz 

y 

zy 

y  zy 

z 

E 

as 

A  E* 

sec 

e 

E 

A  e' 

sec 

0 

(113) 

zx 

X  zx 

z 

xz 

z  xz 

X 

E 

A  E’ 

sec 

0 

E 

A  E' 

sec 

0 

(114) 

xy 

y  xy 

X 

yx 

x  yx 

y 

F 

__ 

B  E* 

sec 

0 

F 

B  E' 

sec 

0 

(115) 

yz 

z  yz 

y 

zy 

y  zy 

z 

F 

a 

B  E' 

sec 

0 

F 

B  E' 

sec 

0 

(116) 

zx 

X  zx 

z 

xz 

z  xz 

X 

F 

B  E1 

sec 

0 

F 

B  E' 

sec 

0 

(117) 

xy 

y  xy 

X 

yx 

x  yx 

y 

where  Aq  and  Ba  are  given  by  equations  (54)  and  (55)  respectively.  If  0y  =  0  , 
equations  (109)  through  (117)  reduce  to  the  real  parts  of  the  operator  equations 
(72)  through  (74)  without  the  internal  phase  angle  of  the  wave  function.  The 
differences  between  the  measured  and  the  conventionally  calculated  angular  mo¬ 
menta  are  given  by 

L  -  L  =  -  ih{y  [(1  -  E  )3/3z  -  F  ]  -  z  [(1  -  E  J3/3ym  -  F  ]}  (118) 

xc  xm  lJm  yz  m  yz  m  zy  m  zy 

L  _L  -  -  ift{z  [(1  -  E  ) 3/3x  -F  ]  -  x  [(1  -  E  )  3  /  3  z  -  F  ] }  (119) 

yc  ym  1  m  zx  m  zx  m  xz  m  xz 


L  -  L  =  -  ift{x  [(1  -  E  )3/3y  -  F  ]  -  y  [(1  -  E  )3/3x  -  F  ]}  (120) 

zc  zm  1  m  xy  m  xy  m  yx  m  yx 

which  may  be  experimentally  detectable. 

The  same  procedure  applied  to  the  spherical  polar  coordinate  representation 
of  the  angular  momenta  given  by  equations  (75)  through  (77)  and  equations  (91) 
through  (93)  gives 


L'  =  ifi(G,3/3'l'  +  H’3/3<t>) 

X  X  x 

V  =  ift(Gy3/34*  +  8^3/3$) 

L'  =  -  ifiH '  3/3<f> 
z  z 


(121) 

(122) 

(123) 
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(124) 


G'  =  S,  cos  5  .  cos  B,.  sec  , 
x  9  l*x9  ipip  Vip 


G’ 

y 

=  -  C,  cos 
9 

^y9  cos 

999 

sec 

999 

h' 

X 

=  C.C./S. 
999 

cos 

=9x9 

cos 

e99 

sec 

999 

h' 

y 

=  c.s./s, 

999 

cos 

■^y9 

cos 

e99 

sec 

099 

h’ 

z 

=  cos  ^29 

cos 

999 

sec 

e99 

(125) 

(126) 

(127) 

(128) 


where 

:T'x9  ”  9y  +  9y9  +  9s9  9  9  ^ipip 

Ty9  9f  +  ~  9c<f>  %  6ipip 

T'xij)  9y  +  9f9  9c\Jj  9sip  9c<j)  %  9<M 

1^9  94'  +  94'4>  ~  3c9  9s<l»  +  Qs<f>  9<9  9<M 

^z  9  99  +  6h>c6  9  9  B4><6 


(129) 

(130) 

(131) 

(132) 

(133) 


and 


tan  8  =  (H-ae^/a^) /OH>/3^)  (134) 

tan  =  (H«aey/a<}>) / 0^/34))  (135) 

The  derivatives  that  appear  in  equations  (121)  through  (123)  are  obtained  from 
equation  (37)  to  be 

3f/39  =  A, 39  /39  +  B  f  (136) 

9  m  m  9  m 

39/39  *  A^f  /  3  <6  +  By  (137) 

d>  m  m  9  m 

where 


A^  =  sec  9^  dip^/dip  (138) 

»  sec  9^  tan  9^  39^/39  (139) 

A,  =  sec  9  39  / 3 <6  ( 140) 

9  r  m 

B,  =  sec  9  tan  9  39/39  (141) 

fp  r  t  t 


330 


where  dipm/dip  and  3 4>m/ 9 <t>  are  given  by  equations  (23)  and  (24).  Combining  equa¬ 
tions  (91)  through  (93)  with  equations  (121)  through  (123)  and  equations  (136) 
and  (137)  gives 


L  =  id(G  3/3ib  +  H  3/3<j>  +  I  ) 

xm  x  Tm  x  Tm  x 


L  =  ifc(G  3/3ii>  +  H  3/3d>  +  I  ) 

ym  y  m  y  m  y 


L  =  -  id (H  3/3<f>  +  I  ) 

zm  z  m  z 


(142) 

(143) 

(144) 


where 


-  A.G' 

\p  X 

G  =  A.G' 

y  y 

(145) 

=  ah' 

<j>  X 

H  -  axh' 
y  4>  y 

H  =  A.H* 
z  <(>  z 

(146) 

=  b,g'  +  B^H' 

ip  x  <6  x 

I  *  B  ,  G '  +  B  H ' 

y  <P  y  4>  y 

I  =  bah’ 

Z  <p  z 

(147) 

The  conventionally  calculated  angular  momenta  are  given  by 


L  =  ift(sin  <J>  3/d\p  +  cot  ip  cos  <p  3 / 3 4>  ) 
xc  mm  m  m  m 

L  =  ifi(-  cos  <j>  3/3ij>  +  cot  ill  sin  <J>  3 / 34>  ) 

yc  ram  m  m  m 

L  =  -  idd/d<p 
zc  m 


(148) 

(149) 

(150) 


Therefore 


L  -  L  =  id[  (sin  <j>  -  G  )3/3<ji  +  (cot  ip  cos  <p  -  H  )3/3<J>  -  I  ] 

xc  xm  m  x  rm  m  m  x  m  x 


(151) 


L  -  L  31  ift[  (-  cos  4>  -  G  )3/3<j>  +  (cot  <p  sin  <p  ~  H  ) 3 / 3 c{>  -  I  ]  (152) 

yc  ym  my  m  m  mymy 


L  -  L  =  -  id[(l  -  H  ) 3/3<j»  -  I  ] 

zc  zm  z  m  z 


(153) 


For  pure  internal  phase  rotations  of  the  coordinates,  equations  (72)  through 
(74)  become 


=  ijft(y/z  3 / 3 0 ^  -  z/y  3/39^) 


L  =  ijd(z/x  3/39  -  x/z  3/39  ) 

y  x  z 


L  =  ijfi(x/y  3/39  -  y/x  3/39  ) 

z  y  x 


(154) 

(155) 

(156) 
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while  equations  (75)  through  (77)  become 

L  =>  -  ijft[ (sin  i)/t|<  3/96,  +  (cot  ijJ  cos  $)/<{>  3/30,] 
x  V  <P 

■  -  ij^[-  (cos  <£)/ijj  3/30^  +  (cot  !j>  sin  <£)/$  3/36^] 

L  =  ijft/$  3/30, 
z  <p 

The  calculation  of  L<i  and  Lam  for  these  equations  is  elementary.  From 
(91)  through  (93)  and  equations  (154)  through  (156)  it  follows  that 

L'  -  '  I**  -  Kzy  z/y  3/S6y) 

i;  -  -  is<EL*  */*  3/3ex  -  EL<z  x/z  3/39z> 

l2  •  -  ^E<Eixy  *'y  3/39y  -  Etyx  y/*  3/39x> 


(157) 

(158) 

(159) 
equations 


(159A) 


where 


e’ 

iyz 

=  sin 

_i 

~Vyz 

sec 

QVez 

ELy 

=  sin 

_i 

^zy 

sec 

EY0y 

e! 

xzx 

=  sin 

J. 

"Vzx 

sec 

0f0x 

Elxz 

*  sin 

JL 

T'xz 

sec 

B?0z 

e' 

ixy 

=  sin 

_i 

^xy 

sec 

94<0y 

Eiyx 

=  sin 

„i 

T’yx 

sec 

B4<0x 

(159B) 


tan  B^0z  -  (f30H,/302)/(34'/30z) 
tan  6^Qy  -  (4'30y/30y)/(3'F/30y) 
tan  0v0x  =  (4'30v/30x)/O'4,/3ex) 


(1590 


^y2  ’  9f  +  Vz  +  8y  -  9z 

4y  ■  »T  +  Sf9y  +  9Z  -  9y 

tzx  9y  +  Eyex  +  9z  9x 


S'xz  9'V  +  +  9x  9Z 


T'xy  +  ^fSy  +  9x  9y 


(159D) 


-  =0  +  ft  „  +0  -  0 

“Yyx  f  'Vex  y  x 
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The  measured  angular  momenta  are  obtained  from  equations  (91)  through  (93)  to  be 
L 


-  l«y/Z(Eiyz3/3e2  +  Flyz)  -  z/y(El2y3/aey  +  Fl2y)l 

-  l»t»/*(E4  8/99  +  F.  )  -  */*<E.  S/30,  +  F.„)l 


ym  *  •  '  izx“'““x  "  *  izx'  ~'"ixz”' ”z  '  "ixz 


zm 


-  ift[x/y(E,_a/30„  +  F,w)  -  y/x(Eiyx3/36y  +  F^)  ] 


ixy 


lxy 


iyx 


(159E) 


where 


=  e' 

sec 

e,„ 

EJ 

*  e! 

sec 

0, 

iyz 

¥ 

izy 

izy 

=  e' 

sec 

0 

E. 

■  E.' 

sec 

0, 

izx 

V 

ixz 

izx 

-  e' 

sec 

e„ 

E, 

“  K 

sec 

0, 

ixy 

iyx 

iyx 

(159F) 


F,  =  Bq  E' 
iyz  9z  iyz 


F  =  B  E . 
izx  0x  izx 


F.  =  Bq  E' 
ixy  9y  ixy 


F.  =  T  ,  E. 
lzy  ly  lzy 


F.  =  B.  E. 

IXZ  0Z  IXZ 


F.  =  B.  E'. 
iyx  0x  iyx 


(159G) 


where 


B0z 

=  sec 

ey 

tan 

0y 

3ey/3ez 

B0y 

=  sec 

ef 

tan 

3V36y 

(159H) 

B0z 

=  sec 

e* 

tan 

3V36* 

For  the  case  when  the  wave  function  only  rotates  and  df  =  j'fdey  ,  then  it 
follows  from  equations  (91)  through  (93)  and  (154)  through  (156)  that 


I 

L  = 

X 

-  ih[y/z 

COS 

_r 

^yz 

3e'f/3ez 

-  z/y 

COS 

_r 

T'zy 

39y/30y] 

(160) 

L'  = 

y 

-  ift[z/x 

COS 

_r 

“Vzx 

3V3e* 

-  x/z 

cos 

_r 

^xz 

(161) 

n 

-  N 

i-3 

-  ift[x/y 

cos 

_r 

^xy 

3V3% 

-  y/x 

cos 

_r 

T'yx 

39y/39x1 

(162) 

L 

xin 

=  l'  sec 

X 

s 

(163) 

L 

ym 

=  l'  sec 

y 

% 

(164) 

L 

zm 

=  l'  sec 
z 

(165) 
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where 


=Z  ■  dw  +  0  -  0 

^Fyz  ¥  y  z 


=Z  =  0^  +  9  -  9 

^fzy  V  z  y 


(166) 


E~  -  9^  +  0  -  9 

^  ZX  V  Z  X 


£  =  0m  +  9  -  0 

Yxz  H'  x  y 


(167) 


^xy  "  0Y  +  9x  _  6y 


^yx  =  S  +  9y  -  9x 


(168) 


The  measured  angular  momenta  for  this  case  are  given  by  equations  (160)  through 
(162). 

From  equations  (91)  through  (93)  and  equations  (157)  through  (159)  the  cal¬ 
culation  of  L<i  and  Lam  for  the  internal  phase  rotations  of  spherical  polar  co¬ 
ordinates  and  for  a  wave  function  whose  magnitude  and  internal  phase  angle  are 
variable  is  done  as  follows 

L’  -  ifc(G’  3/30,  +  H*  3/39  ) 
x  ix  \p  ix  <p 


L’  -  ih(G]  3/36,  +  H'  3/39  ) 
y  iy  <i>  iy  r 

L’  -  -  ito'  3/30 , 
z  iz  4> 


(168A) 


where 


cix  ■ y* sec  B»e* sin  4* 
°iy  -  -  V*  8ec  V*  si"  4* 


(168B) 


Hlx  *  yV(V}  seC  Sin  ^x» 

Hiy  *  sec  0f0<(»  sin  ^yyd* 


( 168C) 


Hiz  *  !/♦  sec  Sin  ^z< 


where 


tan  Bfef  =  (f3e¥/30t)/(3f/30t) 


( 168D) 


tan  -  («0v/39#)/(3'r/30t) 


(168E) 
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0 


(168F) 


=  0 

+ 

8  „  +  0  . 

-  0 

I'XI*’ 

4* 

T  01^  S<j) 

4* 

J. 

=  0 

+ 

8  „  -  0 

-  0, 

x 

1*  yip 

4* 

P4*04>  c<J> 

4* 

J. 

=  0 

+ 

8  „  -  0 

-  0  , 

-  0  , 

l»x<j> 

<r 

H*  0  <(>  cip 

Sip 

C<p 

J 

=  0 

+ 

8  -  0 

-  0  , 

+  0 

^y* 

f 

sip 

s<f> 

J. 

a  0 

8  „  -  0 

1*241 

4* 

p4*04>  <P 

The  measured  angular  momenta  can  be  obtained  from  equations  (91)  through  (93) 
so  that  for  pure  internal  rotations  of  spherical  polar  coordinates 


L  »  ift(E .  ,3/30,  +  E.  .3/30.  +  F,  ,  +  F,  . ) 
xm  ix<p  \p  ix<{>  ix4>  ix<)> 


ym 


ifKE,  ,3/30,  +  E.  ,3/30.  +  F  .  +  F.  ) 
iy4»  4*  iy4>  4>  ly  ip  iy  <P 


(168G) 


L  =  -  ift(E.  .3/30.  +  F,  .) 
zm  iz<}>  $  iz<p 


where 


E.  ,  -  G 
1X4* 


E.  ,  *  0 
izi p 


F. 

13 

F. 


F.  .  -  ° 
iz 


» 

sec 

0,„ 

E,  . 

3 

h!  sec 

IX 

4* 

ix4 

IX 

» 

sec 

9,„ 

E.  . 

m 

h!  sec 

iy 

y 

K 

-e 

iy 

E.  „ 

m 

H'  sec 

lZtf) 

iz 

r’ 

’0*  ix 

Fix4> 

= 

B0^Gix 

1  g’ 

F .  . 

bqjg! 

dip  iy 

iy  <P 

9$  iy 

1 

F, 

_ 

bq  g’ 

iz4 

04>  iz 

'4* 
3y 
3  4* 


( 168H) 


(1681) 


where 


*94,  “  sec  9'?  tan  9y  aV3% 


%  "  sec  9y  tan  ey  3V304 


(168J) 


The  case  where  the  wave  function  only  rotates  in  internal  space  will  now  be 
considered . 
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From  equations  (91)  through  (93)  and  equations  (157)  through  (159)  it 
follows  that  for  spherical  polar  coordinates  and  dV  =  j?d6vj, 


L'  -  ift(G'  39/90  +  H*  30  /30) 

x  xr  y  ip  xr  V  <t> 


L’  -  i ft(G'  39/30  +  H'  30  /39) 

y  yr  4*  ip  yr  V 


Lz  =  ~ 


(169) 

(170) 

(171) 


where 


G’  -  S ./i p  cos  , 
xr  <p  ^xip 

g'  53  -  c.u  cos  hJ;  , 

yr  <t>,r  -Vyip 

G’  -  0 
zr 


H'  -  C„C7(S ,*)  cos  i 


xr  i p  <p'  K  tp 


yr  ^  4>  ' 

H'  =  l/$  cos  ^  , 
zr  r  Tzifr 


“Yx* 


Hi  =  C(S,/(S,»  cos  i 


l/y<P 


(172) 

(173) 

(174) 


-  =9  +  0  -  0 

I»x4i  V  s<p  ip 


^yip  =  9c<J>  ~  % 


tii  x  *  9,u  ~  9  i  -  6  i  -0.-9. 
T'xiJ)  f  cip  sip  c<p  <J> 


ti”9u,-e,-91+0A-9J 

I'ycfi  V  cip  s<p  s<p  <| 


i»z<i>  “  ey  9  4 


(175) 

(176) 

(177) 

(178) 

(179) 


The  values  of  Lam  are  then  obtained  from  equations  (163)  through  (165). 


3.  SCHRODINGER  EQUATION  FOR  ASYMMETRIC  SPACETIME.  This  section  considers 
Schrbdinger ’ s  equation  for  two  extremes  of  the  variation  of  coordinates:  a)  ex¬ 
ternal  space  and  time  variation  of  coordinates  with  constant  internal  phase  an¬ 
gle,  and  b)  internal  variation  of  spacetime  coordinates  with  constant  coordinate 
magnitudes  and  varying  internal  phase  angles.  The  single  particle  energy  is 
calculated  for  both  these  cases.  The  general  case  where  both  the  magnitude  and 
the  internal  phase  angle  vary  has  already  been  treated  for  Schrbdinger ' s  equa¬ 
tion.7  In  cartesian  coordinates  SchrtSdinger ' s  time  independent  equation  for 
coordinates  with  broken  internal  symmetry  is  written  as 

-  fi2/ (2y)  (32/3x2  +  32/3y2  +  32/3i2)?  +  V?  =  E?  (180) 


where  V  *  V(x,y,z)  *  complex  number  potential,  and  E  =  complex  number  energy. 
The  corresponding  time  dependent  SchrBdinger  equation  is  given  by7 
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(181) 


-  /i2/(2p)^324'/3a2  +  V?  =  ifiM/dt 
a 

where  a  =  x  ,  y  ,  z  .  The  generalization  of  the  standard  scalar  time  independent 
Schrodinger  equation  in  spherical  polar  coordinates  to  the  case  of  broken  sym¬ 
metry  space  and  time  is  given  by7 


32?  23?,  _ 

'  r  if  +  -2 


1 


—(sin  — )  +  — 

Z  3^  -2 


1 


3r“  *■  “*■  r^  sin  \p  °lt’  d ^  sin2  ijl  3<{>2  h‘ 

The  wave  function  in  equation  (182)  can  be  separated  as  follows 
?  *  R(r)W(iJi)5  (<£) 


0  (182) 


(183) 


which  gives  the  following  complex  number  generalization  of  the  standard  scalar 
equations7 

d2$/d<j>2  +  M2$  =  0  (184) 

1/sin  ip  d/dip(sin  i|)  dW/ dip)  +  (8  -  M2/sin2  ip)W  -  0  (185) 

r2d2g/dr2  +  2r  dR/dr  +  (k2r2  -  B)R  =  0  (186) 


where 


k2  =  2y/fl2(E  -  V) 

(187A) 

M  =  Mej0M 

(187B) 

B  =  BeJ0s 

(187C) 

where  M  and  6  are  complex  number  separation  constants.  The  Schrodinger  equa¬ 
tions  (180)  through  (186)  will  now  be  considered  for  the  special  cases  of  ex¬ 
ternal  and  internal  coordinate  variation.  The  combined  case  of  external  and 
internal  motion  has  already  appeared  in  the  literature.7 

A.  External  Coordinate  Variation 

For  external  coordinate  variation  in  cartesian  coordinates  with  broken  in¬ 
ternal  symmetry  the  internal  phase  angles  9X  ,  8y  and  8Z  are  constants  that  de¬ 
pend  only  on  the  ambient  pressure  and  energy  density.  Then 

dx  =  dxe^x  dy  =  dye^0^  dz  =  dze'^z  (188) 

where  the  constant  internal  phase  angles  are  written  as 

9=9  (P,E)  9=9  (P,E)  9=9  (P,E)  (189) 

x  x  '  y  y  z  z 

Schrddinger ' s  equation  (180)  becomes 
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-h2/( 2u)[e  2j9x  32H-  /3x2  +  e"2j0y  32?  /3y2  +  e~2j0z  32?  /3z2]  +  V?  =  E? 


(190) 


For  the  free  particle  with  external  motion,  the  wave  function  is  given  by 

,i(kexx  +  k^yy  +  kezi)  (191) 


4»  =  e" 

e 


where  the  complex  number  wave  numbers  are  given  by 

k  ■  k  eJ®kex  k  =  k  ®key  k  *  k  eJ®kez 

ex  ex  ey  ey  ez  ez 

and  where  because  of  the  periodicity  condition  in  measured  coordinates7 


(192) 


k  x  *  k  x 
ex  ex 


k  y  »  k  y 
ey  ey-' 


k  z  =  k  z 
ez  ez 


9,  «  -  0 

kex  x 


9.  =  -  0 

key  y 


0,  =  -  0 
kez  z 


(193) 


and  therefore  ?e  *  Ve  for  periodicity.  Equation  (193)  determines  the  internal 
phase  angles  of  the  wave  numbers  for  periodic  waves  in  broken  symmetry  space. 
The  spatial  wavelengths  of  the  waves  are  given  by 


A  =  2rr/k 

ex  ex 

X 

ey 

=  2w/k 

ey 

X 

ez 

=  2it/k 

ez 

(194) 

where 

X  ,  X  ej0*ex 

ex  ex 

X 

ey 

=  X  ej0*ey 
ey 

X 

ez 

=  X  ej0Xez 
ez 

(195) 

Then  it  follows  that 

k  *  2tt  /  X 
ex  ex 

k 

ey 

=  2tt/X 

ey 

k 

ez 

=  2tt  /  X 

ez 

(196) 

0.  -  -  9, 

kex  Xex 

-  9 

X 

(197) 

0,  -  -  0, 
key  Xey 

-  9 

y 

(198) 

0.  -  -  0, 
kez  Xez 

-  9 

z 

(199) 

For  periodic  waves  the  wave  numbers  and  wavelengths  must  adjust  themselves  to 
the  local  spacetime  conditions  according  to  equations  (197)  through  (199). 

The  measured  wavelengths  are  given  by 
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A 

=  A  cos 

9. 

=  A  cos 

9 

(200) 

exm 

ex 

A  ex 

ex 

X 

A 

=  A  cos 

9, 

=  A  cos 

9 

'(201) 

eym 

ey 

Aey 

ey 

y 

A 

=  A  cos 

9, 

=  A  cos 

9 

(202) 

ezm 

ez 

Aez 

ez 

z 

while  the 

measured  wave  numbers  are 

k 

=  k  cos 

9. 

=  k  cos 

9 

(203) 

exm 

ex 

kex 

ex 

X 

k 

*  k  cos 

9. 

=  k  cos 

9 

(204) 

eym 

ey 

key 

ey 

y 

k 

*  k  cos 

9, 

=>  k  cos 

9 

(205) 

ezm 

ez 

kez 

ez 

z 

Combining 

equations 

(196) 

through 

(205)  gives 

k 

exm 

-  2ir/A 

exm 

2 

cos 

9 

X 

(206) 

k 

eym 

-  2ir/A 

eym 

2 

cos 

a 

y 

(207) 

k 

ezm 

-  2ir/A 

ezm 

2 

cos 

9 

z 

(208) 

Note  that 

kexm  ^  2lT^exm 

and  so  on 

.  Finally, 

for  insertion  into  the  wave  func 

tion  in  equation  (191)  one  has 


k  x  =  k  x  =  k  x  /cos  9  =  2wx  /A 

ex  ex  exm  m  x  m  exm 


k  y  =  k  y  =  k  y  /cos  9  *  2iTy  /A 

ey'  ey  eym  m  y  J m  eym 

_  2 
k  z  ”  k  z  *  k  z  /  cos  9  =  2ttz  /A 

ez  ez  ezm  ra  z  m  ezra 


(209) 

(210) 
(211) 


Consider  the  case  of  free  particles  moving  in  external  space  that  exhibits 
broken  internal  symmetry.  The  single  particle  energy  can  be  obtained  from 
equations  (190)  through  (199)  to  be 


Eg  -  h  / (2u) (kgx 

=  ft2k2/(2y) 
e 


2  e-2J0*  +  k2  e-«9y  +  k2  e-2j9^> 

ey 


ez 


(212) 


where 


r2  z1  ^  z2  i  Z2 

k  »  k  +  k  +  k 

e  ex  ey  ez 


(213) 


The  measured  free  particle  energy  is  given  by  the  real  part  of  equation  (?12)  as 
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(214) 


E  =  ft2k2/(2y)  cos (29,  ) 
em  e  ke 

where  ke  and  9ke  are  obtained  from  equation  (213)  and  equations  (197)  through 
(199)  as 

k2  cos (29,  )  =  k2  cos (26  )  +  k2  cos(26  )  +  k2  cos(29  )  (215) 

e  ke  ex  x  ey  y  ez  z 

k2  sin(29,  )  =  k2  sin(29  )  +  k2  sin(29  )  +  k2  sin(29  )  (216) 

e  ke  ex  x  ey  y  ez  z 

Therefore  the  measured  energy  is 

E  =■  H2/ (2u)  [k2  cos(28  )  +  k2  cos(29  )  +  k2  cos(20  )]  (217) 

em  ex  x  ey  y  ez  z 

This  equation  will  be  used  in  Section  4  to  calculate  the  energy  of  a  particle 
localized  in  an  external  space  box. 

For  external  motion  in  spherical  coordinates  with  broken  internal  symmetry 
and  0^  ,  9^  and  0r  constant,  equations  (184)  through  (186)  become7 


d2?  /d$2  +  M2?  =  0  (218) 

e  e  e 

l/sin  ip  d/dijj (sin  ip  dW  /dip)  +  (8  -  M2/sin2  <Ji)W  =*  0  (219) 

©  ©6  e 

r2d2R  /dr2  2rdR  /dr  +  (k2?2  -  6  )Ro  =  0  (220) 

e  e  e  e 

where7 

M  -  m  cos  0.  e-^0'*’  M  =  m  cos  9,  (221) 

e  <t>  e  <p 

M'  =*  |m|  cos  9  e  ^0$  Mr  *  |m|  cos  0  (222) 

e  '  1  <p  e  ‘  <p 


C(C  +  1) 
e  e 


(223) 


C  =  M1  +  l  -  (ml 
e  e 


(224) 


where  m  *  ordinary  magnetic  quantum  number  m  0  ,  ±1  ,±2,  ••*.  For  periodic 
rotations  Mei  *  Me4>  =  real  number,  so  that7 

d24>  /dd>2  +  M24>  =  0  (225) 

e  e  e 

$  e£iMe^  *  e11®*  cos  e<t>  ,  e±im^  (226) 

e 


where  $m  «  <p  cos  0<j,  .  Thus  $e  “  #e  =  reai  number  in  internal  space  (but  obvi¬ 
ously  a  complex  number  in  external  space) . 
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Consider  now  the  calculation  of  the  quantized  energy  values  for  a  rigid 
rotator  in  broken  symmetry  spacetime.  For  rotations  about  the  z  axis  Schrd- 
dinger's  equation  is  obtained  from  equation  (71)  to  be 

-  A.2/ (2X) a2¥/3i2  =  E  9  (227) 

whose  solution  is 

*  =  Ae1^  (228) 

which  is  also  the  eigenfunction  for  Lz  as  shown  in  equations  (78)  and  (79) . 
Combining  equations  (227)  and  (228)  gives  the  kinetic  energy  of  the  rotator  as 

E  =  ^2M2/(2I)  (229) 

z 

where  I  »  complex  number  moment  of  inertia  about  the  z  axis.  The  measured  ener¬ 
gy  is  given  by 

E  =  ft2M2/ (21)  cos(20  -  0_)  (230) 

zm  M  i 

For  periodic  external  rotations  with  0<j,  =  constant  it  follows  from  equations 
(80)  and  (230)  that 

ES  =  ft2 M2/(2I)  cos (20  +  0  )  (231) 

zm  e  9  i 

=  ft2m2/(2I)  cos2  0A  cos(20  +  0_) 

9  9  I 

Equation  (231)  can  be  written  as 

Ee  =  fi2m2/(2I  ff)  (232) 

zm  ef  f 

where  the  effective  moment  of  inertia  is 

I  ne  =  I  sec(20,  +  0T)  sec2  0  (233) 

err  9  i  9 

The  complex  moment  of  inertia  is  written  as 

I  =  ur2  I  =  ur2  0j  =  20^ 

I  =  I  cos  0T  =  I  cos(20  )  =  I  (1  -  tan2  0  ) 
m  I  r'  c  r' 

where  Ic  =•  conventionally  calculated  moment  of  inertia  given  by 

I  =  ur2  =  I  cos2  0  (236) 

c  m  r 

Combining  equations  (233)  through  (236)  gives 


(234) 

(235) 


(237) 


I  ce  -  I  sec[2(0,  +  0  )]  sec2  0,  sec2  0 
eff  c  <f>  r  $  r 

Now  consider  rigid  rotations  in  three  dimensional  space  with  broken  symme¬ 
tries.  In  this  case  the  rotational  energy  of  a  body  is  given  by7 

E  =  h2/(2l)Z(l  +  1)  (238) 

where  for  external  motion  C  =  Ce  where  £e  is  given  by  equation  (224).  Therefore 
for  external  motion 

Ee  =  ft2/( 2I)Ce(Ce  +  1)  (239) 

=  d2/ (21) (M*  +  v)(M'  +  v  +  1) 
e  e 

where  v  is  an  integer  given  by  v  =  £  -  |m|  .  Combining  equations  (222),  (224) 
and  (239)  gives  the  measured  energy  eigenvalues  as 

Ee  =  a2/(2I)[lra|2L0  +  |m|(2v  +  1)L.  +  v(v  +  1)L  ]  (240) 

m  1  2  1  o 


where 

2 

L„  =  cos  0,  cos(20,  +  0T) 

2  <p  <p  I 

L,  =  cos  0,  cos(0,  +  0  ) 
l  <p  q>  I 

L  =■  cos  0T 
o  1 

Equivalently,  it  follows  from  equation  (239)  that 

E®  =  k2/{ 21) [C2  cos(29f  -  0T)  +  C  cos(0,  -  0T)] 
m  e  Ce  I  e  ce  l 

where7 

Ce  =  £[ 1  -  |m|/£(2  -  | mj /£) sin2  9^] 1/2 

2 

tan  0£e  *  -  (|mj  sin  0^  cos  0^)/(£  -  |m|  sin  0^) 


(241) 

(242) 

(243) 

(244) 

(245) 

(246) 


B.  Internal  Coordinate  Motion 

The  case  where  the  cartesian  and  spherical  coordinate  magnitudes  (x  ,  y  ,  z 
and  r  ,  respectively)  are  held  fixed  and  the  system  is  moving  in  internal 

coordinate  space  with  0X  ,  0y  ,  0Z  and  0r  ,  9<j,  ,  9^  as  variables  is  now  con¬ 
sidered.  Also  t  =  constant  and  0t  is  a  variable. 

First  the  case  of  cartesian  coordinates  is  treated.  From  equation  (70) 
it  follows  that  for  a  =  constant 
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p2  =  -  h2 32/3a2 
a 


(247) 


=  h2/a2(d2/%e 2  -  j 3/30  )  . 

a  J  a 


(248) 


where  a  =  x  ,y  ,  z  .  Combining  equations  (180),  (247)  and  (248)  yields  Schro- 
dinger's  equation  for  internal  cartesian  coordinate  motion 


-  h2/ (2v.)lz2V  ./2a2  +  Vj.  =  E.?. 

a  1  11  1  x 

, _2  , .  2-  ,~n2 


r/(2u)Il/ct  (3  ?./36rt  -  j 3?  / 3 0  )  +  V.Y.  =  E.?. 

a  xa  ia  1111 


(249) 

(250, 


where  the  subscript  "i"  refers  to  internal  motion.  The  time  dependent  Schro- 
dinger  equation  is  obtained  from  equations  (28),  (71)  and  (250)  by  writing 


E  fi  =  ifi3?  /3t 


=  -  ijft/t  3Y  /30 

=  -  ijfi/t  e”^9t3?. /30. 
J  i  t 

remembering  that  dt  =  jtd0t  . 


(251) 

(252) 

(253) 


Consider  a  free  particle  located  at  a  fixed  position  in  external  space 
x  ,  y  ,  z  and  moving  in  internal  space  with  0X  ,  0y  ,  0Z  as  variables .  The  wave 
function  is  written  as 


?  (0  ,0  ,0  )  -  ei(kixX  +  kiy^  +  kizZ) 

i  x  y  z 


(254) 


=  exp  i[kixx  exp(j0x)  +  kiyy  exp(j0y)  +  kizz  exp(j0z)] 


The  requirement  that  equation  (254)  be  a  solution  of  Schrodinger ' s  equation  for 
a  free  particle  gives  the  single  particle  energy  for  internal  motion  to  be 


E.  =  h2 / (2y) (k2  +  k2  +  k2  ) 

i  ix  ly  iz 


(255) 


where 


k .  =  k .  e 

ix  ix 


j0kix  k  =  k  e^6kiy 

iy  iy 


k .  =  k  .  e 

iz  iz 


j  ekiz 


(256) 


which  are  complex  number  constants.  The  measured  single  particle  energy  is 
given  by 

E  =  h2 / (2p) [k2  cos (20  .  )  +  k2  cos(26,  .  )  +  k2  cos(20,  .  )]  (257) 

im  ix  kix  iy  kiy  iz  kiz 

In  Section  4  this  equation  will  be  used  to  calculate  the  energy  of  a  particle 
trapped  in  an  internal  space  box.  The  phase  in  equation  (254)  is  not  a  real 
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number  because  there  is  no  periodicity  in  measured  real  space  as  x  ,  y  and  z 
are  fixed.  Therefore 

k^x  =  k.xxej  (0kix  +  ®x)  (258) 

k  y  =  k.  yej(9kiy  +  9y) 
iyJ  ly 

k  2  =  k  zej (9kiz  +  9z) 
iz  iz 


Equation  (254)  also  follows  from  equation  (25)  which  becomes 


-  iWjdx  »k,  f. 
i  ix  i 

(261) 

-  id'Fjdy  =  k. 

i  iy  i 

(262) 

-  i3Y,/3z  =  k., 

i  iz  i 

(263) 

For  constant  Ef  ,  equation  (251)  gives 

?i(0t)  =  exP(~  =  exp{-  iiLt /ft  exp(j0t)]  (264) 


with  t  =  constant  and  t  given  by  equation  (2) .  Therefore  for  x  ,  y  ,  z  and  t  = 
constants  it  is  easier  (for  the  free  particle  case)  to  solve  the  Schrbdinger 
equations  (249)  and  (251)  in  terms  of  x  ,  y  ,  z  and  t  and  then  use  equations  (1) 
and  (2)  to  express  the  results  in  terms  9a  and  9t  ,  rather  than  to  directly 
solve  equations  (250)  and  (253).  On  the  other  hand,  if  the  potential  is  ex¬ 
plicitly  a  function  of  the  internal  phase  angles  V*  =  Vi(9x,6y,9z)  with  x  ,y 
and  z  =  constants,  then  it  may  be  more  convenient  to  solve  equations  (250)  and 
(253)  direct1'/.  Yet  even  in  this  case  it  is  possible  to  use  the  conventional 
form  of  Schrodinger ’s  equation  (249)  by  using  equation  (1)  to  rewrite  the  po¬ 
tential  as 

VVW  “  V"  j  ■  1  ^n<y/y).  -  j  ln(llz)]  (265) 

=  Wi(x ,y ,z) 

and  equation  (249)  becomes 

-  ft2/ (2u)^32’?  /9a2  +  W  ?  =  E.f.  (266) 

a  i  11 

Equations  (250)  and  (266)  are  equivalent. 

The  situation  of  internal  phase  rotations  in  spherical  polar  coordinates 
with  r  ,  and  <p  fixed  is  now  considered.  For  this  case  Schrbdinger ' s  equa¬ 
tion  (182)  becomes 


(259) 

(260) 
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(267) 


-  i/r2o2?  /ae2  +  jaf./ae  )  -  i/ix1^  sin  a/ae ,  (i/$  sin  <p  a?./ae  ) 

1  IT  X  IT  1  |ll 

-  l/(f2i2  Sin2  ip)  (a2? ,  /302  -  j3i?./88J.)  +  8Tr2y/h2(E.  -  V.)f.  «  0 

i  <}>  l  <p  lix 

Equation  (267)  can  be  separated  into  three  independent  equations  by  writing 

\  *  MVVVVV  <268) 

which  gives  the  following  equations 

-  l/i2(d2$./d02  -  jd$./d0.)  +  M2$.  =  0  (269) 

i  <p  i  4>  i  i 

-  l/(*  sin  ij))d/d0^(l/ijl  sin  ip  dV^/de^)  +  (B±  -  M^/sin2  ^)W±  =  0  (270) 

-  d2S./d02  -  jd8./d0  +  (k2?2  -  B.)R.  =  0  (271) 

irJir  i  i  i 

where 

k2  =  8TT2p/h2[E  -  V  (0  .0  ,0  ,r,*,*)]  (272) 

i  i  1  r  ip  <p 

and  where 

M.  -  M.eJ0Mi  (273) 

i  i 

i->  the  complex  magnetic  quantum  number  for  internal  azimuthal  angle  rotations, 
and  possibly  M-^  =  M^(<j>)  because  $  =  constant.  The  solution  of  equations  (184) 
through  (186)  are  relatively  easy  for  external  motion  when  0,.,  *  constant  and  M 
and  0  are  constants  given  in  equations  (218)  through  (224). 7  If  0^,  is  not  con¬ 
stant  but  instead,  at  the  other  extreme,  0q>  is  variable  and  d>  =»  constant  then 
the  separation  constant  for  internal  motion  must  be  independent  of  0 so 
that  certainly  i4  Mg  .  Therefore  equations  (218)  through  (220)  are  valid  for 
0r  ,  0^  and  0^  »  constants  while  equations  (269)through  (271)  are  valid  for  r  , 
ip  ,  <p  =  constants. 

The  solution  to  equation  (269)  can  be  obtained  from  a  solution  of  equation 
(184)  with  <(>  =  constant.  This  solution  is 

i>.(0.)  =  exp(±  iM.<)>)  (274) 

i  <p  l 

=  exp[±iMi<j>  exp  j(0Mi  +  9^  )] 

Note  that  8^^  +  0^/0  for  internal  coordinate  motion  whereas  9j^e  +  0^,  =  0 
for  external  periodic  motion.  Similarly,  the  solution  of  equation  (270)  can 
be  obtained  from  the  solution  of  equation  (185)  with  ip  ■  constant  as  follows7 

W.(9  )  =  P-^-[ cos (i|»e^0^)  ]  (275) 

1  V  Ci 
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with 


8  -  8.  =£.(£.+  1) 

1  i  i 


C.  =  M.  +  6 

i  i 


(276) 


where  6  =  0,1,2,3,***  .  Because  4>  =  constant,  cannot  be  related  to  an  integer 
as  was  the  case  for  the  external  complex  magnetic  quantum  number  Mg  given  in 
equation  (80).  The  solution  of  the  radial  equation  (271)  depends  on  the  form  of 
the  potential  V^  . 


The  SchrBdinger  equation  for  the  internal  space  rigid  body  rotator  is  now 
examined.  For  rotations  about  the  z  axis,  but  with  $  =  constant  and  d<J>  =  j$d0<j) 
it  follows  from  equation  (227)  that 


ft2/(21$2)(d2$,/dej  -  jd$  /d0  )  +  V.(6.)*.  =  E  .* 
i  <p  i  q>  i  4>  i  zi  i 


(277) 


Equation  (277)  can  also  be  obtained  directly  from  equation  (159).  The  solution 
to  equation  (277)  for  Vi  -  0  is  given  by  equation  (274),  and  for  this  case  a 
comparison  of  equations  (269)  and  (277)  gives  the  energy  as 


E  .  =  Pi2M2/(2I) 
zi  l 


(278) 


for  free  internal  rotations.  Also  from  equation  (159) 


L  *.  =  PiMA. 

z  i  11 


(279) 


E  ,4>  -  L  /  (21)<t> 

zi  i  z  i 


(280) 


The  energy  in  equation  (278)  can  be  obtained  directly  by  operating  with  the  left 
hand  side  of  equation  (277)  with  *  0  on  the  solution  given  in  equation  (274) . 
The  measured  energy  is  given  by 


Ezi»  -  “M7'21'  cosWm  -  V 


(281) 


For  internal  rotation  in  three  dimensions  it  follows  from  equations  (83),  (269) 
and  (270)  that  for  the  rigid  rotator 

L2  -  &2[l/(i2  sin2  40(32/392  -  j  3/30 , )  +  1  /  (^  sin  *)3/30.(l/*  sin  ip  3/30,.)  ]  (282) 

I  9  <p  9  9 

=  fi2[M2/sin2  ip  +  l  /  (ip  sin  ip)d/dd^(l/ip  sin  ij/  3/30^)] 

=  fi2fL  =  h2Z.(l.  +  1) 

i  l  i 

so  that 


Ei  =  fT/(  2I)Ci(Ci  +  1) 


(283) 
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where  is  given  by  equation  (276).  The  measured  energy  is  obtained  from  equa¬ 
tions  (276)  and  (283)  as 

Eim  =  ft2/(2IHK2MJ  +  (2<S  +  1)KiMt  +  6(,s  +  1)Kq]  (284) 

where 

K2  =  cos(20Mi  -  0].)  (285) 

K1  =  COs(0Mi  "  V  (286) 

K  =  cos  0T  (287) 

o  I 


Apparently  is  a  continuous  real  number  that  can  assume  any  value.  Thus  the 
spectrum  of  the  internal  space  rotator  is  continuous. 

4,  PARTICLE  IN  BOX  FOR  ASYMMETRIC  SPACETIME.  From  the  previous  section  it 
is  clear  that  particles  can  have  internal  as  well  as  external  motions.  This  sec¬ 
tion  considers  the  motion  of  a  particle  in  a  box  first  for  the  case  of  motion 
in  external  space  with  broken  internal  symmetries,  and  secondly  for  motion  in  a 
box  located  in  internal  space. 

A.  Particle  in  External  Space  Box  with  Broken  Symmetry. 

Consider  a  particle  in  a  box  of  complex  number  space  dimensions  a  ,  b  and 
c  .  The  measured  dimensions  of  the  box  are 

a  =  a  cos  0  b  =*  b  cos  0,  c  =  c  cos  0  (288) 

m  am  b  m  c 

and  it  is  assumed  that  the  internal  phase  angles  of  the  coordinates  are  con¬ 
stants  and  0X  “  ®a  »  ®y  *  ®b  »  “  9C  •  Th®  wave  function  obtained  from  equa¬ 

tion  (191)  will  be  written  as 

4*  ■  A  sin(k  x)sin(k  y)sin(k  z)  (289) 

e  ex  ey  ez 

If  ye  =  0  at  the  boundaries  x  =  a  ,  y  =  b  and  z  =  c  ,  it  follows  that 

ka=nir  kb=mn  kc=£n  (290) 

ex  ey  ez 

and  therefore  equations  (196),  (209)  through  (211)  and  (290)  give 


2a/n 

\  53 

2a  /n 

XID 

m 

2b/m 

A 

2b  /m 

ym 

m 

2c  ft 

A  » 
zm 

2c  /t 
m 

(291) 
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Combining  equations  (217),  (288)  and  (290)  gives  the  measured  energy  for  a  par¬ 
ticle  moving  externally  in  a  broken  symmetry  space  box  as 

E  =  h2: T2/(2p)[n2/a2  cos(26  )  +  m2/b2  cos(20,  )  +  l2/ c2  cos(20  )]  (292) 

em  a  b  c 

=  ft2ir2/(2u)(I  n2/a2  +  I  m2/b2  +  I  l2/c2) 


m 


m 


m 


where 


I  -  cos  0  cos (20  ) 
a  a  a 


I,  =  cos  0.  cos  (20,  ) 
b  b  b 


(293) 


I  =  cos  0  cos (20  ) 
c  c  c 


The  conventionally  calculated  energy  for  a  particle  in  a  box  is  given  by 

E  «  h2  rr2/(2u)(n2/a2  +  m2/b2  +  l2/c2) 
ec  m  m  m 


(294) 


and  therefore 

Kr  ~  *  &2*2/(2u)[(1  -  I  )n2/a2  +  (1  -  IH)m2/b2  +  (1  -  I  K2/c2]  (295A) 

ec  em  am  b  m  cm 


%  3fL2Tt2/(2p)(n282/a2  +  m202/b2  +  l2Q2/c2) 
am  b  m  cm 


For  0a  ■v  0b  ^  0C  =*  0a  it  follows  from  equations  (294)  and  (295A)  that 

(E  -  E  )/E  ^  30 2 

ec  em  ec  a 


(295B) 


The  difference  between  the  predicted  and  measured  energies  may  be  detectable  and 
may  lead  to  an  estimate  of  the  value  of  0^  . 


a 


Consider  a  rigid  rotator  oscillating  between  two  angular  positions  <J>  =  0 
and  d>  =  Then  the  wave  function  is  given  by  a  form  similar  to  equation  (78) 

for  broken  symmetry  space 


$  =  Ae 


iMo* 


(296) 


which  must  be  a  solution  of  SchrBdinger ' s  equation  for  a  body  rotating  about 
the  z  axis 


-  d2/(2I)32$/9i2  -  Ee$ 

z 


(297) 


which  gives  the  energy  as 

E®  *  ft2M2/(2I) 

z  o 


(298) 
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(299) 


whose  real  part  gives  the  measured  energy  as 

Ee  =  fi^2/  (21)  cos(20u  -  8  ) 
zm  o  Mo  I 


where  the  constant  Mq  for  the  external  rotational  oscillator  is  written  as 
JeMo 


M  =  M  e“ 
o  o 


(300) 


and  the  values  of  Mq  and  9 are  to  be  determined  from  boundary  and  periodicity 
conditions.  The  periodicity  conditions  give7 


0Mo  "  “  94 


M  <t>  =  M  (j 
o  o1 


(301) 


In  order  to  satisfy  the  boundary  conditions  the  wave  function  derived  from  equa¬ 
tion  (296)  for  the  rotational  oscillator  must  be  of  the  form 


4>  *  A  sin(Mo<J>) 


(302) 


where  Mq  is  to  be  determined  by  the  boundary  condition  at  4>  =*  <J>a  .  The  condi¬ 
tion  that  $  *  0  at  <p  **  <pa  gives 


M  * 
o  a 


nir 


(303) 


or 


M  »  nir/<J> 
o  a 


(304) 


Therefore  the  measured  energy  is  obtained  from  equations  (299),  (301)  and  (304) 
to  be 


EJ  =  ft2*2/(2I)(n/<|>  )2  cos(28,  +  9_) 
zm  a  9  1 


(305) 


F.  Particle  in  Internal  Space  Box 

Consider  a  box  in  internal  coordinate  space  with  variables  9X  ,  9y  and  8Z 
which  are  bounded  by 

9  <9  <8. 

a  x  b 


9  <9  <  9J 

c  y  d 


(306) 


9  <9  <  ec 

e  z  f 


corresponding  to  the  external  coordinates  x  ,y  ,  z  which  are  constants.  For 
the  wave  function  take  the  real  values  of  equation  (254)  as  follows 
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(307) 


¥,  =  V. 

i  ix  iy  iz 


where 


Tix  ■  sln|kix*  sin(0kix  +  V> 


^iy  m  sin[kiyy  sin(0kiy  +  0y) ] 


(308) 


*i«  *  sinfkizz  sin<9kiz  +  V3 

At  the  boundaries  of  the  internal  space  box  the  wave  function  must  be  zero. 
The  conditions  for  the  vanishing  of  the  wave  function  at  the  boundaries  are 


kixx  sln(0kix  +  V  "  niT 


(309) 


kixx  3ln(9kix  +  V  "  0 


(310) 


kiyy  Sin(0kiy  +  ®d)  =  mlT 


(311) 


kiyy  Sin(0kiy  +  9c>  =  0 


(312) 


kizz  sin(0kiz  +  0f)  -  U 


(313) 


kiz2  3in(0kiz  +  9e)  *  0 


(314) 


where  n  ,  m  and  t  are  integers.  From  equations  (310),  (312)  and  (314)  it  fol¬ 
lows  that 


8,.  -  -  0 

kix  a 


0.  .  -  -  0 

kiy  c 


0..  =  -  0 

kiz  e 


(315) 


Combining  equations  (309),  (311),  (313)  and  (315)  gives 


kix  =  n7r/fx  s:*-n(®b  ~  ea)l 


kiy  “  rair^y  sin(0d  -  0c)  ] 
kiz  ”  sin(ef  “  ee)l 


(316) 


where  n  ,  m  and  t  *  0,1 ,2,3,4, ••  •  .  Placing  the  results  of  equations  (315)  and 
(316)  into  the  expression  for  the  measured  energy  eigenvalues  given  by  equa¬ 
tion  (257)  yields  for  a  particle  in  an  internal  space  box 
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(317) 


E.  *2  2  ,  fry  \  /  T  i  2,  2  _i  2<  2  ti»2i  2. 
lm  =  n  ir  / (2 u)(J  n  /x  +  J  m  /y  +  J  Z  /z  ) 
x  y  z 

where 

y  =  cos(26a)/sin2(0b  -  0fi) 

Jy  =  cos(20c)/sin2(6d  -  @c)  (318) 

j’  *  cos(20  )/sin  (0-  -  0  ) 
z  e  f  e 

During  the  internal  motion  of  the  particle  the  measured  position  of  the  particle 
also  changes  because  ■  a  cos  0a  with  a  *  constant  and  0a  being  variables. 

The  values  of  x  ,  y  and  z  must  be  measured  relative  to  some  fixed  center  of 
force,  such  as  the  nucleus  of  an  atom,  when  the  internal  oscillations  of  an 
electron  are  being  considered.  This  determines  Eim(x,y,z)  .  The  energy  of 
physical  interest  is  then 

u»  ■  K»-dv  (319) 


where  n  ■  particle  number  density.  The  quantity  Um  gives  the  measured  energy 
associated  with  the  internal  oscillations  of  the  coordinates.  The  integral  in 
equation  (319)  may  be  logarithmically  divergent,  and  renormalization  techniques 
will  have  to  be  used  for  the  calculation  of  the  internal  energy. 

Finally,  consider  the  angular  internal  rotational  oscillations  correspond¬ 
ing  to  a  fixed  angular  magnitude  $  of  a  body.  The  wave  function  for  this  in¬ 
ternal  space  rigid  rotator  is  given  by  equation  (274).  Therefore  considering 
only  a  real  portion  of  the  wave  function  gives 

^i C0^)  *  sin[Mi4>  sin(0Mi  +  0  )]  (320) 

If  the  rotator  is  restricted  to  an  angular  sector  in  internal  space  given  by 
0a  <  ed>  <  9b  then 

M . <(>  sin(0  +  0  )  =  0  (321) 

l  Mi  a 

M  sin(0  +  0,  )  =  nir  (322) 

i  Mi  d 

where  n  =  0,1,2,3,***  .  Equation  (321)  gives 


0 


Mi 


0 

a 


(323) 


while  equation  (322)  gives 


M  *  mr / [ d>  sin(0,  -  0  )] 

i  D  Sl 


(324) 
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Substituting  equations  (323)  and  (324)  into  equation  (281)  gives  the  measured 
angular  oscillation  energy  eigenvalues  as 


E.  -  ft2n2/(2I)  (n/d>) 2  cos (29  +  0T ) /sin2 (6,_  -  9  ) 

lm  a  I  b  a 


(325) 


During  the  internal  angular  oscillations,  the  measured  angle  will  oscillate 
according  to  <t>m  =  4>  cos  9^  where  $  =  constant. 

5.  LINEAR  HARMONIC  OSCILLATOR.  This  section  studies  the  simple  harmonic 
oscillator  in  broken  symmetry  spacetime.  The  study  is  separated  into  four  parts, 
a  treatment  of  the  one-  and  three-dimensional  harmonic  oscillators  in  external 
spacetime  with  broken  internal  symmetries  and,  a  discussion  of  the  one-  and 
three-dimensional  harmonic  oscillators  in  internal  space. 

A.  One-Dimensional  Harmonic  Oscillator  in  External  Space  with  Broken 
Internal  Symmetries. 


The  standard  Schrbdinger  equation  for  the  one-dimensional  simple  harmonic 


oscillator  is  given  by 

d2f/dx2  +  (X  -  a2x2)t  =  0 


(326) 


where 


X  =>  87T2pE/h2 


(327) 


a  -  4ir  yv/h 


(328) 


After  a  number  of  transformations,  the  solution  comes  down  to  a  power  series 
whose  coefficients  are  related  by8 


ao+2/ao  =  -  (X/a  -  2a  -  l)/[(o  +  l)(a  +  2)] 


(329) 


where  a  *»  integer.  Only  if  the  series  breaks  off  at  a  value  of  a  =  k  can  a  fi¬ 
nite  polynomial  solution  be  found.  Therefore  the  energy  eigenvalues  are 


Ek  =  (k  +  L/2)hv 


(330) 


where  k  =  0,1,2,3,’**  .  The  eigenfunctions  aree 

-e}/2 

\  -  V 


(331) 


S  =  x/a 


(332) 


where  Hjc(C)  “  Herraite  polynomial  of  degree  k. 

The  generalization  of  equation  (326)  to  broken  symmetry  space  and  time 
with  9X  and  9t  as  constants  is  given  by 
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(333) 


The  measured  energy  eigenvalues  are  given  by 

e  e  e 

E  *  E  cos  0  , 
km  k  Ek 


*  (k  +  l/2)hv  cos  9  =  (k  +  l/2)hv 

'  e  ve  em 


(343) 


Thus  the  standard  result  is  valid  if  measured  energies  and  frequencies  are 
used  as  in  equation  (343).  The  internal  phase  angle  of  the  frequency  is  re¬ 
lated  to  the  fundamental  asymmetry  of  spacetime  because  0ve  =  -  9t  •  There¬ 
fore  for  the  one-dimensional  linear  harmonic  oscillator,  the  effects  of  bro¬ 
ken  spacetime  symmetry  occur  only  through  a  complex  number  frequency  and  not 
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through  a  complex  quantum  number.  It  will  be  shown  subsequently  that  for  the 
three-dimensional  linear  harmonic  oscillator  a  complex  quantum  number  must  be 
introduced  in  addition  to  a  complex  number  frequency. 

B.  Three-Dimensional  Linear  Harmonic  Oscillator  in  Broken  Symmetry 
External  Space. 

The  radial  component  of  the  standard  Schrodinger  equation  for  the  isotropic 


three-dimensional  simple  harmonic  oscillator  is  given  by1 2,1 3 

d2u/dr2  -  [U.I  +  l)/r2  -  2y/ft2(E  -  l/2yo>2r2)]u  ?  0  (344) 

where  the  total  wave  function  is  written  as 

V  -  l/ru(r)W(^)<5>(4>)  (345) 

After  a  sequence  of  standard  transformations  the  solution  can  be  written  as  an 
infinite  series  whose  coefficients  are  related  by 12,1 3 

ck+l/ck  =  '  [E/(hv)  -  *■  -  2k  "  3/2]  /  [  (k  +  1)(2£  +  3k  +  3)]  (346) 

The  condition  for  a  break  off  solution  is12'13 

E  =  (2k  +  l  +  3/2)hv  (347) 

*  (n  +  l/2)hv 
=  (2n'  +  l  -  l/2)hv 

where  some  standard  notations  in  common  use  are 

n  =>  2k  +  l  +  1  (348) 

n'  =  k  +  1  (349) 


and  k  =  0,l,2,3,***  ,  n  —  1,2,3,***  ,  n'  —  1,2,3, *••  ,  and  Z.  =  0,1,2,3,***  . 
Note  that  n  =  principal  quantum  number. 

The  generalization  to  the  case  of  the  three-dimensional  linear  harmonic 
oscillator  in  broken  symmetry  spacetime  with  0r  ,  9^  and  9^  as  constants  is 
given  by 

d2ug/dr2  -  [£e(Ce  +  1) /r 2  -  2u/&2(Ee  -  l/2yu2r2)]ue  =  0  (350) 


where 

d?  =  drej0r 


(351) 


and  where  Ce  «  complex  angular  momentum  quantum  number.  A  break  off  solution 
to  the  broken  symmetry  zenith  angle  equation  does  not  require  Ce  to  be  an  integer 


(352) 


but  only  that7 


Eg  -  Me  =  £  -  |m|  =  integer 


where7 


M  =  m  cos  0,  e 
e  1  1  <j> 


(353) 


For  this  case  it  is  easy  to  see  that  the  break  off  solution  for  equation  (350) 
is  given  by  the  following  generalization  to  equation  (346) 

3k+l/Ek  *  ~  [5e/(hV  ‘  ~  2k  '  3/2]/[(k  +  1)  (2Cg  +  3k  +  3)]  (354) 

so  that  the  complex  number  energy  can  be  written  in  the  following  equivalent 
forms 


E  *  (n  +  l/2)hv 
e  e  e 


(355) 


(E  +  2k  +  3/2)hv 
e  e 


-  (C  +  2n?  -  l/2)hv 
e  e 


=  (M'  +  n  -  fm|  +  1/2) hv 
6  0 


(M  +  l  -  I  mi  +  2k  +  3/2)hv 
e  e 


where 


e  e 


j 


2k  +  C  +1 


(356) 


=  2k  +  M'  +  l  -  | m|  +  I 
e  1 


M  +  n  -  m 
e  1 


For  0^=0  equation  (356)  reduces  to  equation  (348).  From  equations  (352), 
(353)  and  (356)  it  follows  that7 


ne 

cos 

9 

ne 

=  n 

-  |m|  sin^ 

% 

(357) 

ne 

sin 

9 

ne 

=  - 

|m|  cos  0^ 

sin 

% 

(358) 

C 

e 

cos 

0r 

Ce 

-  1 

-  |m|  sin^ 

% 

(359) 

C 

e 

sin 

9- 

Ce 

=  - 

|m|  cos  9^ 

sin 

% 

(360) 

355 


(361) 


so  that7 


ng  =  n[l  -  |mj/n(2  -  |m|/n)sin2  0 

tan  9^e  =  -  ( | m |  cos  6^  sin  0^/(0  -  jm|sin2  0  )  (362) 

Ce  =  l[l  -  |m|/£(2  -  | m| /£) sin2  0  ] 1/2  (363) 

tan  0^e  =  -  C | m j  cos  0^  sin  0^)/(£  -  j m|  sin2  0  )  (364) 

These  quantities  are  required  to  calculate  the  measured  values  of  the  energy 
eigenvalues. 

The  measured  energy  eigenvalues  are  obtained  from  equation  (355)  to  be 

E  / (hv  )  »  n  cos(0  +0  )  +  1/2  cos  0  (365) 

em  e  e  ne  ve  ve 

-  C  cos(0-  +0  )  +  (2k  +  3/2)cos  9 

e  Ce  ve  ve 

■  Ce  cos(0Ce  +  9ve)  +  (2n’  "  1/2)cos  9ve 

=  | ml  cos  6  cos(6  -  0  )  +  (n  -  (ml  +  l/2)cos  0 
11  4>  ve  <t»  11  ve 

•  |m|  cos  0.  cos(0  -  0.)  +  (£  -  |m|  +  2k+3/2)cos  9 
11  <j>  ve  <|>  1  1  ve 

The  imaginary  part  Eej  of  the  energy  in  equation  (355)  can  be  obtained  as 

E  T/(hv  )  =  n  sin(0  +0  )  +  1/2  sin  0  (366) 

el  e  e  ne  ve  ve 

=  C  sin(0r  +0  )  +  (2k  +  3/2)sin  0 

e  Ce  ve  ve 

=  C  sin(0r  +0  )  +  (2n'  -  l/2)sin  9 

e  Ce  ve  ve 

=  S m I  cos  0,  sin(0  -  9  )  +  (n  -  Iml  +  l/2)sin  0 

11  <{)  ve  11  ve 

=  Iml  cos  0,  sin(0  -  9.)  +  (Z  -  |m|  +  2k  +  3/2)sin  0 
1  '  <}>  ve  <p  ve 


(366) 


E  -  (E2  +  E2  )1/2 

e  eR  el 


tan  eEe  -  EeI/EeR 


(367) 


If  9ve  *  -  is  negligible  in  equation  (365)  these  expressions  become 
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(368) 


E  /(hv  )  'v.  n  cos  0  +  1/2 

em  e  e  qe 


=  C  cos  0r  +  2k  +  3/2 
e  Ce 


=  C  cos  0-  +  2n'  -  1/2 

e  £e 


=  n  +  1/2  -  I  ml  sin  0 


=  £  +  2k  +  3/2  -  lm|  sin2  0 


E  T/(hv  )  ^  n  sin  0 
el  e  e  ne 


(369) 


=  C  sin  0r 
e  Ce 


=  -  m  cos  0,  sin  0, 


tan  0_  'v*  (n  sin  0  )  /  (n  cos  9  +  1/2) 

Ee  e  ne  e  ne 


(370) 


=  -  ( | m |  cos  0^  sin  9^)/(n  +  1/2  -  |m|  sin  0^) 

0Ee  *  ‘  H%/(n  +  l/2> 

C.  One-Dimensional  Linear  Harmonic  Oscillator  in  Internal  Space. 

Consider  the  internal  harmonic  motion  at  a  point  along  the  x  axis.  For 
this  case  the  magnitude  x  =  constant  and  the  potential  energy  is  written  in 
terms  of  the  variable  internal  phase  angle  8X  as 


V.(0  )  =  1/2K.  0 
i  x  lx  x 


(371) 


The  Schrddinger  equation  (250)  for  this  case  is  written  as 

rr/(2px2)e~2:j9xO2?./302  -  j  37. /30  )  +  1/2K.  9 2  7.  =  E  .7 . 

ixJix  ix  xi  ii 


(372) 


Equivalently  Schrodinger ' s  equation  can  be  written  in  the  form  of  equation 
(266)  as 


-  ft2/(2u)327i/3x2  +  Wi7i  =  Ei7. 


(373) 


Wi(5?)  =  -  1/2  K  in  (x/x) 


(374) 


where  because  x  =  constant 
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(375) 


dx  =  jxd8 


Equation  073)  can  be  written  as 

32?i/3iE2  +  [X.  +  S2  ln2( x/x)]?  =  0 


where 


A  -  2]iE±/k 

2-2-2 
K.  =  4ir  uv.  =  via), 
ix  i  i 

a2  *=  pK^/ft2  =  16ir^p2v2/h^ 
“i  * 


(376) 

(377) 

(378) 

(379) 


D.  Three-Dimensional  Linear  Harmonic  Oscillator  in  Internal  Space. 

The  potential  energy  of  a  three-dimensional  internal  space  harmonic  os¬ 
cillator  with  x  ,  y  ,  z  *  constants  is  written  as 


v.  =  1/2 (k.  e2  +  k.  e2  +  k.  e2) 

i  IX  x  ly  y  12  z 

For  central  symmetry  in  internal  space  this  simplifies  to 

V.(6  )  -  1/2K  02  -  -  1/2  K.  ln2( r/r)  =  W. (r) 

l  r  ir  r  lr  i 

where  r  »  constant.  The  radial  Schrddinger  equation  becomes 


d2u./dr2  -  {£1(Ci  +  l)/?z  -  2u/^[Ei  +  l/2um^  In*  ( r/r)]}u  -  0 


/-2 


2t- 


-2  <7  .2 , 


where 


dr  ■  jrd0 


“i  =  *ir/p 


and  where 

£  -  M  +  6 

where  6  *  0,1,2, 3,***  .  Equation  (383)  can  be  rewritten  as 
d2ui/df2  +  [Xi  +  52  ln2( r/r)  -  +  l)/r2]ui  =  0 


(381) 

(382) 

(383) 

(384) 

(385) 

(386) 

(387) 
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where  A^  and  5.^  are_given  by  equations  (377),  (380)  and  (385).  Making  the 
substitution  u^  =  rR-^  in  equation  (387)  gives 

d2Ri/df2  +  2/f  dR./df  +  [A.  +  a2  in2( r/r)  -  ^(C.  +  l)/r2]R.  =  0  (388) 

Equations  equivalent  to  (387)  and  (388)  can  be  written  using  equation  (384) 
as  follows 


-  d2Gi/d02  +  jdSi/d0r  +  [  (A±  -  a202)r2  -  C  (C±  +  1)  ]Gi  =  0 


-2„2. _2 


-  d2Ri/d02  -  jdRi/d0j.  +  [(Ai  -  ci202)r2  -  C^C.,  +  1)]R.  =  0 


-2.2. -2 


(389) 


(390) 


The  author  has  not  solved  equations  (387)  through  (390). 

6.  ADDITION  OF  ANGULAR  MOMENTUM.  This  section  studies  the  addition  of 
angular  momentum  in  space  and  time  with  broken  internal  symmetries.  It  will  be 
shown  that  the  standard  addition  law  for  angular  momenta  must  be  modified  for 
the  case  where  the  magnetic  quantum  number  exhibits  a  broken  internal  symmetry. 

A.  Complex  External  Magnetic  Quantum  Numbers. 

Consider  two  complex  magnetic  quantum  numbers  that  describe  the  motion  of 
two  particles  in  external  space  with  constant  broken  internal  symmetries7 

Mj  =  M^e^0*^  =  m^  cos  0^  e^0^  =  m^  cos  Q  e  -)0<H  (391) 

M2  =  M^e-^0142  =  m2  cos  0^  e^0M2  =  m2  cos  0^2  e  20^2  (392) 

where  equation  (80)  was  used  for  external  motion  with  0^  =  -  0^  =  constants  and 
where  mj  and  m2  are  integers  0,±1 ,±2 ,±3 , • • •  .  The  ordinary  rule  of  addition  of 
magnetic  quantum  numbers  would  give10’11 

M  =  M  +  M2  (393) 


where  h  ,  being  a  magnetic  quantum 

M  =  Me^0M  =  m  cos  0.,  e^0M  =  ra 

M 


number,  is  itself  written  as 
cos  0  e  ^0<^ 


(394) 


where  m  should  be  given  by 
m  =  m^  +  m2  =*  integer 


(395) 


and  0^ 
m 


m 


=  constant.  The  real  and  imaginary  parts  of  equation  (393) 
ccc2  0M  =  mL  cos2  0M1  +  m2  cos2  0^ 

sin  0W  cos  0„  =  m,  sin  0„.  cos  0U,  +  m.  sin  0  »  cos  8uo 

M  Ml  Ml  Ml  2  M2  M2 


are 


(396) 

(397) 
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Equations  (396)  and  (397)  determine  m  and  6^  in  terms  of  ,  0j^  and  m2  ,  6^2  • 

In  general  the  calculated  value  of  m  will  not  be  an  integer  m  =  m^  +  m2  except 
for  the  special  case  of  0^  0M1  =  ®M2  •  Therefore  equations  (396)  and  (397) 

do  not  yield  integer  values  for  m  ,  and  therefore  equation  (393)  in  general 
does  not  agree  with  equation  (395).  This  suggests  that  in  broken  symmetry  space- 
time  there  exists  an  internal  space  scalar  particle  such  that  the  addition 
law  for  complex  magnetic  quantum  numbers  must  be  written  as 

M  -!  A  =  M  +  M2  (398) 


instead  of  equation  (393) .  Equation  (398)  then  represents  two  equations  for  the 
two  unknowns  A  and  0jj  in  the  following  manner 


m  cos  0„  +  A 
M 


ml  COs2  0M1  +  m2  C°s2  9M2 


m  sin  0..  cos  0W 
M  M 


ml  Sin  ®M1  COS  0M1  +  m2  Sin  0M2  COS  °M2 


(399) 

(400) 


where  the  integer  m  is  given  by  equation  (395). 

The  solution  of  equations  (399)  and  (400)  is  easily  obtained  to  be 
A  =  -  m/2  [1  +  (1  -  4fV/2]  +  m  cos2  0  +  m2  cos2  0ffi  (401) 

cos2  0M  -  1/2  [1  +  (l  -  4fV/2]  (402) 

where 

f  =  1/m  (mL  sin  ©M1  cos  0M1  +  m2  sin  0M2  cos  0^)  (403) 

If  m  =  0  ,  then  A  =  0  .  Equation  (401)  can  be  written  as 

2  2  2 

A  *  -  m  cos  0^  +  m^  cos  0^  +  m2  cos  0^2  (404) 

If  | uni  =  j m2 1  then  m  =  0  or  2mL  ,  and  A  =  0  and  0M  =  0M1  =  9M2  for  this  spec¬ 
ial  case.  Thus  only  when  | m1 |  i  | m2 |  will  the  A  particle  carry  angular  momen¬ 
tum.  Therefore  in  general  equation  (393)  is  not  valid  except  for  the  special 

case  | m[|  =  |  |  .  If  0M  =  0M1  =  9M2  *  then  A  =  0  for  a11  values  of  and  m2  . 

Finally,  if  ©Ml  =  0  and  0M2  =  0  corresponding  to  an  internally  symmetrical  sys¬ 
tem  then  0(^  =  0  and  A  =  0  . 

The  additional  internal  scalar  particle  that  is  required  for  the  conser¬ 
vation  of  quantized  angular  momentum  in  systems  with  broken  azimuthal  symmetry 

is  called  the  muthon.7  This  particle  may  be  important  in  the  explanation  of 
high  temperature  superconductivity  in  the  planar  copper  oxides  because  it  may 
mediate  the  attractive  forces  between  Cooper  pairs  of  electrons  or  holes.  It 
may  be  responsible  for  the  coherent  time  state  that  is  associated  with  high 
temperature  superconductivity. 
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B.  Addition  of  External  Angular  Momenta 

The  standard  result  for  the  addition  of  angular  momentum  operators  is  that 
they  add  vector ially  as  follows9 

J  =  JL  +  J2  (405) 


whose  z  -  component  equation  is 


These  operators  have  the  following  eigenvalues9 

J2  “  j  (j  +  1)^ 

J  <■  mTft 
z  J 


(406) 


(407) 

(408) 


where  j  =■  0,1,2,3,***  ,  and  mj  -  j  ,  j-1 ,  j-2 ,  •  •  • ,  0,-l,-2,***,  -j  . 

The  corresponding  equations  for  a  broken  symmetry  spacetime  are 

-±  -*■  -t  - 

J  +  A  =  +  J2 

J  +  A  =  J  +  J. 
z  z  lz  2z 


(409) 

(410) 


where  A  and  Az  do  not  have  internal  phases.  The  following  eigenvalues  are  as¬ 
sociated  with  external  momenta  in  broken  symmetry  space  and  time 


j2  =  J(J  +  1) 


a 

■  Mt  - 

m  cos 

9  ej0Md 

Z 

J 

j 

MJ 

*  M„ 

=  m  _ , 

cos  0WT,  e^^^dl 

lz 

J1 

ji 

MJ  1 

2z 

s»  M 

J2 

=  mJ2 

cos  9  ej0Mj2 

C  S  MJ2 

where  for  external  motion  9^j^  ,  0j^j2  an<^  ®MJ  are  constants,  and7 


J  m  Mj  +  j  -  1  tn j  | 


M 


cos  0MJ  e 


3  ®MJ 


(411) 

(412) 

(413) 

(414) 

(415) 


and  Mj  is  related  to  Mjj  and  Mj2  by  placing  equations  (412)  through  (414)  into 
equation  (410)  which  gives  a  result  essentially  equivalent  to  equation  (398), 
and  where  as  before  mj  -  m^  +  mj2  ♦  Equation  (410)  determines  Az  and  9j^j  . 

For  the  case  of  the  coupling  of  spin  and  angular  momentum,  the  following 
are  standard  results9-11 
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J  -t  +  t 

J  =  L  +  S 
z  z  z 

L2  =  l (l  +  1) 

I2  =  s (s  +  1)  =  3/4 
Lz  =  “l 


(416) 

(417) 

(418) 

(419) 

(420) 

(421) 


Jz  *  mJ  "  “L  +  mS  (422) 

J2  -  j(j  +  1)  (423) 

j  **  |  nij  |  +  v  (424) 

l  =  (mj  +  v'  (425) 

s  -  |  n»s  [  =  1/2  (426) 


where  £  *  0,1,2,3,*-*;  ■  0,±1,±2, • • • ,±£;  ms  =  ±1/2;  mj  «  ±1/2,  ±3/2,  ±5/2, 

•••;  j  ■  1/2,  3/2,  5/2,  •••;  and  v  and  vr  -  0, 1,2,3, 

The  generalization  of  these  equations  to  the  case  of  broken  internal  sym¬ 
metry  for  external  motion  is  as  follows 


J  +  A  -*  L  +  S 

(416A) 

J  +  A  ■  L  +  S 
z  z  z  z 

(417A) 

-2  -  - 
L  =  C(C  +  1) 

(418A) 

s2  -  3(3  +  1) 

(419A) 

Cz  =  \  "  ®L  C0S  9ML 

eJ  9ml 

(420A) 

§z  *  %  "  mS  COS  6MS 

ej0MS 

(421A) 

5z  *  SJ  "  mJ  COS  eMJ 

ej0MJ  mJ  "  “l  +  ms 

(422A) 

j2  -  3(3  +  i) 

(423A) 
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M  +  j  -  m. 


*L 


*  =  *S 


J 

:  + 1  - 


I“lI 


where 


(424A) 

(425A) 

(426A) 


m'  * 

I  m  | 

cos  0,.T 

ej0MJ 

M  _  —  m  _ 

cos 

0MT  ej9^ 

(427) 

J 

1  J1 

MJ 

J  J 

MJ 

K- 

Kl 

cos  Sml 

e30ML 

"l--l 

cos 

®ML  ^ 

(428) 

Ss  - 

imsl 

cos  0MS 

eJ0MS 

fis  ■  ”s 

cos 

0  e3®MS 

MS 

(429) 

where 

mg  = 

±  1/2  and  J  mg | 

=  1/2  in  equation 

(429) .  Note 

that  A  is  a  scalar 

in  internal  space.  The  unknown  constants  A2  and  6^j  are  then  calculated  from 
equations  (401)  through  (403)  in  terms  of  ,  mg  ,  6ml  anc*  0MS  usin8  the  fol¬ 
lowing  value  of  the  function  f 


f  -  1/mj  (n^  sin  0^  cos  0^  +  mg  sin  0^  cos  0^) 


(430) 


For  the  case  of  spin  and  orbital  angular  momentum  n>j  ^  0  because  spin  is  half¬ 
integer  and  mj  =  mL  +  mg  t4  0  .  For  this  case  Az  #  0  always.  From  equation 

(417A)  it  follows  that 

m  cos  0__T  e^0MJ  +  A  =*111,  cos  0__  e^9^  +  mc  cos  0  c  e^0MS  (431) 

J  MJ  z  L  ML  a  Mb 

whose  real  and  imaginary  parts  are  given  by 

mJ  COs2  9MJ  +  Az  "  “L  C°s2  9 ML  +  “S  C°s2  9 MS  (432) 

mJ  Sin  °MJ  COS  9MJ  "  “L  Sin  ®ML  COS  ®ML  +  mS  Sin  0MS  COS  ®MS  (433) 

This  gives  Az  and  as  in  equations  (401)  and  (402).  Note  that  equation  (422) 
mj  31  mL  +  ms  is  a  universal  law  that  holds  whether  or  not  spacetime  has  broken 
internal  symmetries  and  must  be  used  in  conjunction  with  equations  (430),  (432) 
and  (433)  to  determine  Az  and  0j^j  . 

By  writing  the  complex  quantum  numbers  J  ,  C  and  $  as 

3  -  JeJ9j  C  *  CeJ9C  $  =*  5ej9^  (434) 

equations  (424A) ,  (425A)  and  (426A)  can  be  used  to  obtain  expressions  for 
J  ,  0j  ,  C  ,  0£  and  >  9$  •  From  equation  (424A)  it  follows  that 
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J  cos  8-  =  Jm  |  cos  0  +  j  -  |m 


(435) 


J  sin  9,  =  [  m T I  sin  0..T  cos  8.... 

J  J  MJ  MJ 


(436) 


which  gives 

can  9j  -  (Kjl  sin  @MJ  cos  9^/0  "  I  m j  I  sin2  9^) 
9J  " 

J  =  j[l  -  imJJ/j(2  -  |mJ|/j)sin2  e^]1^2 
Equation  (425A)  gives 


C  cos  0j.  *  |  |  cos  0^  +  t  -  jm^j 


(437) 


(438) 


(440) 


£  sin  0, 


which  gives7 


K1  Sin  6ML  COS  8ML 


tan  0C  =  (In^l  sin  0^  cos  9ML)/(£  -  jmj  sin2  0^) 
9£%  I-lI 9ML/£ 

C  =  U 1  -  |mL|/£(2  -  |  m^l  /-£)sin2  O^]172 

Finally  for  the  spin,  equations  (426A)  and  (429)  give 
*  "  |msl  cos  0MS  -  1/2  cos  0MS 


(441) 


(442) 


(443) 


(444) 


(445) 


95  "  6 MS 


(446) 


Because  both  spin  and  orbital  angular  momentum  are  associated  with  rotations  it 
follows  that  for  external  motion 


°ML  %L 


0  =  -  0 
MS  4>S 


(447) 


where  9^  ,  9^g  and  8^j  are  constants. 


A  theory  of  the  addition  of  internal  spin  and  internal  orbital  angular  mo¬ 
mentum  can  be  developed  except  that  for  this  case  the  basic  representation  in 
equation  (394)  is  not  valid  when  0^  and  8^  are  variables  for  a  fixed  value  of  m. 
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7 .  CONCLUSION .  The  broken  symmetry  of  spacetime  requires  that  the  momen¬ 
tum,  angular  momentum  and  energy  operators  are  complex  numbers  in  internal  space. 
These  operators  can  be  expressed  in  terms  of  the  measured  space  and  time  coor¬ 
dinates  by  noting  that  the  measured  coordinates  are  the  real  parts  of  the  com¬ 
plex  number  coordinates.  The  energy  eigenvalues  and  eigenfunctions  of  the  3-D 
linear  harmonic  oscillator  are  obtained  for  broken  symmetry  space  and  time  and 
it  is  found  that  the  principal  and  magnetic  quantum  numbers  are  complex  numbers 
whose  difference  must  be  an  integer.  The  momentum  operator,  energy  operator 
and  Schrttdinger ' s  equation  can  be  developed  for  the  case  of  pure  internal  phase 
rotations  with  constant  coordinate  magnitude,  and  the  equations  for  an  internal 
space  harmonic  oscillator  and  for  a  particle  in  an  internal  space  box  can  be 
developed.  A  muthon  particle  must  exist  to  account  for  the  conservation  of  an¬ 
gular  momentum  in  space  and  time  with  a  broken  azimuthal  symmetry.  This  particle 
may  play  a  role  in  the  explanation  of  high  temperature  superconductivity  in  the 
planar  copper  oxides  where  broken  azimuthal  symmetry  may  be  associated  with  a 
coherent  time  field  and  a  raised  level  of  the  superconductivity  energy  gap. 
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GAUGE  THEORY  OF  TIME 


Richard  A.  Weiss 

U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT .  A  gauge  invariant  differential  equation  is  developed  that  re¬ 
lates  time  to  the  pressure  and  energy  density  of  a  thermodynamic  system.  This 
renormalization  group  equation  for  time  is  developed  as  a  relativistic  trace 
equation  that  involves  the  same  gauge  parameters  that  are  used  in  the  relativis¬ 
tic  trace  equation  for  the  internal  energy.  In  conjunction  with  the  time  equa¬ 
tion  and  the  internal  energy  equation,  two  equations  giving  the  dependence  of 
the  dimensions  of  space  and  time  on  the  local  energy  density  and  pressure  are 
developed.  Four  simultaneous  equations  must  therefore  be  solved  to  determine 
the  reaction  rates  of  processes  that  occur  in  real  media.  The  results  of  these 
calculations  indicate  that  physical  processes  occurring  in  a  pressure  and  energy 
density  field  associated  with  an  ambient  background  material  will  run  more  slow¬ 
ly  than  the  reaction  rates  predicted  by  conventional  calculations.  This  is  due 
to  the  non-ideal  character  of  the  state  equation  of  the  background  substance; 
for  ideal  state  equations  the  renormalized  and  conventionally  calculated  reac¬ 
tion  rates  are  equal.  The  scale  and  gauge  invariance  requirement  of  the  equa¬ 
tion  of  time  gives  a  set  of  equations  that  describe  the  time  evolution  of  an 
interacting  thermodynamic  system  in  terms  of  the  fundamental  gauge  parameters 
of  the  equation  of  time.  The  effects  of  mechanical  or  electromagnetic  radia¬ 
tion  on  the  equation  of  time  is  treated  by  a  perturbation  technique.  Applica¬ 
tions  to  solids,  quantum  liquids  and  real  gases  are  considered.  A  theory  of 
coherent  time  states  in  matter  is  developed  and  applied  to  the  development  of 
a  theory  of  high  temperature  superconductivity  in  the  planar  copper  oxides. 

This  theory  predicts  that  the  value  of  the  normalized  superconductivity  energy 
gap  2A/kTc  should  be  6/tt  times  greater  than  the  value  of  the  normalized  energy 
gap  predicted  by  the  BCS  theory  for  conventional  superconductors. 

1 .  INTRODUCTION .  The  nature  of  time  is  still  a  mystery  today  although 
theories  about  time  date  back  to  the  ancient  civilizations.1-27  The  Babylo¬ 
nians,  Greeks  and  Romans  believed  in  cyclic  time  and  the  theme  of  recurrence 
appears  repeatedly  in  their  literature.9-11  Aristotle  and  Plato  both  believed 
in  cyclic  time.  With  the  advent  of  Judaism  and  Christianity  came  the  idea 
that  events  occurred  in  linear  time.9’11  Only  with  the  development  of  the 
mechanical  clock  did  time  acquire  an  abstract  nature  of  mathematically  mea¬ 
sured  sequences,  and  time  as  an  abstract  parameter  was  introduced  by  Galileo 
in  his  studies  of  the  motion  of  bodies.9’  1  Newton  and  Leibniz  introduced 
the  concept  of  instantaneous  velocity,  and  it  was  Newton  who  introduced  the 
idea  of  absolute  time  that  flows  uniformly  without  any  relation  to  external 
things.11  However,  it  was  Einstein  who  made  the  important  discovery  that  time 
is  in  fact  local  and  dependent  on  the  relative  velocity  of  observers . 2-1 4 ’ 1 7 ’ 
18,22-27  wag  a^so  Einstein  who  demonstrated  that  time  slows  down  in  the 
presence  of  a  gravitational  field  by  predicting  the  gravitational  red  shift 
of  spectral  lines  in  light  from  stars  like  the  sun.5’17  Time  also  plays  an 
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important  role  in  the  rhythms  and  processes  of  biological  systems.5*17  In 
thermodynamics  time  is  closely  related  to  the  second  law  of  thermodynamics  for 
irreversible  processes  where  the  concept  of  time's  arrow  appears . 1-1* * 7 » 8 » 22 » 24 
Cosmology  is  intimately  related  to  the  concepts  of  time,  and  it  has  been  sug¬ 
gested  that  the  expansion  of  the  universe  is  the  prime  cause  of  the  arrow  of 
time  in  thermodynamic  systems . 7 » 1 1  * 1 6-1 8  In  quantum  gravity  theory  the  concept 
of  time  itself  becomes  nebulous  and  may  not  be  definable  in  the  early  universe. 28 
Time  and  thermodynamics  are  closely  intertwined.  A  deficiency  in  the  present 
day  concepts  of  time  is  the  lack,  of  a  theory  that  describes  the  effects  of  am¬ 
bient  bulk  matter  and  energy  on  time,  i.e.,  a  theory  that  describes  the  effects 
of  pressure  and  temperature  on  time  in  gases,  liquids  and  solids. 

This  paper  develops  a  thermodynamic  gauge  theory  of  time  that  is  based  on 
a  previously  developed  relativistic  gauge  theory  of  thermodynamics.29  This 
gauge  theory  of  bulk  matter  and  energy  is  formulated  by  means  of  a  relativistic 
trace  equation  which  can  be  written  as  follows29 

E  +  8E  -  33p  -  Ea  +  3^  (1) 

where  the  gauge  functions  Bf  ,  Bp  and  Bf  are  given  by 

Be  *  T/V(dU/dT)pv  =  T/V  Cpy  (2) 

Bp  »  d/dV(PV)u  =  P  -  Ky  (3) 

Be  «  T/V(dUa/dT)pav  (4) 

and  where  E  ,  U  and  P  =  renormalized  energy  density,  internal  energy  and  pressure 
respectively;  Ea  ,  Ua  and  Pa  =  unrenormalized  energy  density,  internal  energy 
and  pressure  respectively;  T  and  V  =  absolute  temperature  and  volume  respective¬ 
ly,  and  the  heat  capacity  at  constant  PV  is  given  by 

CpV  =  (dU/dT)pv  (5) 

and  where  the  bulk  modulus  at  constant  U  is  given  by 

Ky  =  -  V(dP/dV)y  =  n(dP/dn)y  (6) 

where  n  =  1/V  .  Equation  (1)  requires  the  renormalized  state  equation  to  be 
softer  at  high  densities  than  the  state  equation  predicted  by  the  unrenormal¬ 
ized  calculation. 29  Figure  1  shows  a  comparison  of  the  average  energy  per  par¬ 
ticle  for  the  renormalized  and  conventional  neutron  star  matter  state  equations. 


A  generalization  of  equation  (1)  to  thermodynamic  systems  with  broken  in¬ 
ternal  symmetry  has  been  proposed  for  partially  coherent  matter  in  the  form  0 

l  +  lc  -  3B„  «  Ea  +  B?  (7) 
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where  the  complex  number  gauge  functions  Bg  and  Bp  are  given  by 


3  0 


T/V(dU/dT)?v 


Bp  =  d/dV(PV)- 


(8) 


and  U  and  P  *  complex  number  internal  energy  and  pressure  respectively.  The 
vacuum  equations  corresponding  to  equations  (1)  and  (7)  are  respectively30 


Ev  +  Bg  -  3Bp  =  0 

Ev  +  rE  -  3B£  -  0 


(9) 

(10) 


Equation  (9)  determines  E  and  P  while  equation  (10)  determines  Ev  ,  0gv  and 
Pv  ,  0py  .  Thus  Ev  and  Pv  have  finite  values  for  the  vacuum.30  Equations  (1), 
(7),  (9)  and  (10)  also  have  zero-temperature  forms  which  are  written  as  follows 


Eo  -  K  =  Eo 

l  -  3B°  =  Ea 
o  P  o 

(ID 

Eo  -  Kv  =  0 

lV  -  3BpV  =  0 
o  P 

(12) 

where29*30 

3-  =  (1  +  Yq)po  -  ko 

B?-  <1+Vfo-*o 

(13) 

(1  +  YV)PV  -  KV 

P  o  o  o 

Kv  ■  (1  +  -  K 

(14) 

where  yQ  ,  Kq  and  yq  »  KQ  = 

scalar  and  complex  number  values 

of  the  zero- 

-tem- 

perature  Grllneisen  parameter 

and  bulk  modulus  of  matter  respectively,  and 

*0  9 

and  Yo  >  ^o  =  corresponding  values  of  these  quantities  for 

the  vacuum. 

The 

zero-temperature  values  of  the  bulk  modulus  are  given  by 

K  =  ndP  /dn 

K  =  ndP  /dn 

o  o 

0  0 

(15) 

KV  =  ndPV/dn 
o  o 

KV  =  ndPV/dn 
o  o 

For  T  =  0  the  value  of  Ky  in 

equation  (6)  is29 

K?  =  K  -  y  P 

U  o  o  o 

K°  =  K  -  y  P 

U  O  0  0 

(16) 

The  zero-temperature  Grvineisen  parameter  y0  is  the  gauge  parameter  for  the  T  =  0 
theory.29*30  For  systems  with  PV  =  aU  or  PV  =  aU  where  a  *  constant  it  follows 
that 


B 


E 


=  0 


B 


E 


0 


6p  -  0  E  -  E* 

Bp  =  0  E  =  E3  =  real  number 


(17) 
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Equation  (17)  describes  the  ideal  classical  gas  and  the  ideal  nonrelativist ic 
and  ultrarelativistic  Fermi  gases.  It  is  plausible  that  the  gauge  theory  of 
time  that  will  be  developed  in  this  paper  must  incorporate  the  functions  8{r  and 
8p  as  coefficients  in  the  fundamental  equation  of  time  in  a  manner  similar  to 
the  way  they  are  used  in  the  energy  trace  equation  (1). 

It  has  been  suggested  that  the  complex  number  trace  equation  (7)  with  its 
complex  number  internal  energy  and  pressure  implies  that  the  coordinates  of 
space  and  time  must  also  exhibit  broken  internal  symmetries  and  be  represented 
as  complex  numbers.30  The  internal  energy,  pressure  and  spacetime  coordinates 
can  then  be  written  as30 


U  =  UeJ®u 

P  =  Pej0p 

(18) 

x  *  xe^x 

y  =  yej6y 

(19) 

j©_ 

z  »  zeJ  2 

t  =  te^6t 

(20) 

where  U  ,  0jj  ,  P  ,  0p  ,  x  ,  6X  ,  y  ,  0y  ,  z  ,  02  and  t  ,  0t  =  magnitudes  and  internal 

phase  angles  of  the  internal  energy,  pressure,  spatial  coordinates  and  time  re¬ 

spectively.30  Time  varying  processes  are  described  in  this  paper  as  being  in¬ 
coherent  (or  linear)  if  dt  >>  td0t  and  coherent  (or  circular)  if  dt  <<  td0t  .  In 
the  general  case  dt  'v  td0t  and  time  both  stretches  and  rotates  with  changes  in 
the  energy  density  and  pressure  of  matter. 

Equation  (I)  with  its  gauge  functions  8^  and  8p  was  developed  in  part  to 
account  for  the  need  to  have  a  softer  equation  of  state  of  nuclear  matter  and 

the  neutron  gas.29  This  paper  will  use  the  gauge  functions  8f  and  Bp  to  develop 

renormalization  group  equations  for  coherent  and  incoherent  time  whose  solutions 
determine  the  reduced  rate  of  processes  that  occur  within  the  energy  density  and 
pressure  fields  of  real  (non- ideal)  thermodynamic  systems.  Time  is  locally  de¬ 
pendent  on  the  energy  density  and  pressure  of  the  ambient  matter,  t  =  t(E,P)  or 
t  =  t(E,P)  ,  and  is  determined  by  relativistic  renormalization  group  equations. 
These  calculations  do  not  change  the  rate  of  processes  that  occur  within  an  ideal 
ambient  thermodynamic  system.  A  theory  of  coherent  (or  circular)  time  is  de¬ 
veloped  and  applied  to  high  temperature  superconductivity  in  the  planar  copper 
oxides . 

This  paper  is  organized  as  follows:  Section  2  develops  a  gauge  theory  of 
scalar  and  complex  number  time  and  presents  a  basic  partial  differential  equa¬ 
tion  for  time  in  bulk  matter  and  the  vacuum.  Section  3  treats  the  differential 
equations  that  determine  the  dimensions  of  space  and  time  in  ambient  bulk  mat¬ 
ter  and  energy.  Section  4  considers  scale  and  gauge  invariance  of  the  equation 
of  time  and  its  connection  with  nonequilibrium  thermodynamics.  Section  3  studies 
the  time  equation  for  solids,  quantum  liquids  and  real  gases,  Section  6  consid¬ 
ers  the  effects  of  vibrations  on  the  determination  of  time  in  matter,  and  fi¬ 
nally  Section  7  investigates  coherent  time  states  in  matter  and  their  applica¬ 
tion  to  high  temperature  superconductivity. 
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2.  GAUGE  THEORY  OF  TIME.  This  section  develops  a  relativistic  partial  dif¬ 
ferential  equation  of  time  that  describes  the  dependence  of  time  on  the  local 
energy  density  and  pressure  of  matter  and  energy.  Both  scalar  and  complex  num¬ 
ber  equations  of  time  are  developed  which  can  be  expressed  in  terms  of  the  tem¬ 
perature  and  particle  number  density  of  matter. 

A.  Scalar  Equation  of  Time. 

The  chosen  form  of  the  equation  of  time  is  based  on  five  conditions:  a)  that 
it  be  linear  in  time,  b)  it  must  contain  the  Minkowski  spacetime  signature,  c)  it 
must  involve  the  gauge  functions  Bg  and  Bp  so  that  the  time  equation  affects  real 
thermodynamic  systems  and  produces  a  null  effect  for  ideal  systems  in  a  manner 
similar  to  that  of  the  energy  trace  equation  (1),  d)  that  the  effect  in  real  gases 
occurs  in  the  third  and  higher  virial  time  coefficients  in  order  to  agree  with 
the  effects  of  the  energy  trace  equation  (1)  on  the  real  gases,  and  e)  that  the 
time  equation  be  scale  and  gauge  invariant.  These  conditions  and  an  inspection 
of  the  form  of  the  energy  trace  equation  (1)  suggest  that  the  proper  equation 
for  incoherent  (scalar)  time  is 

t  -  B£  3t/3E  +  3Bp  3t/3P  =  ta  -  B£  3ta/3Ea  (21) 

where  t  =  renormalized  time,  and  ta  =  unrenormalized  time.  Equation  (21)  can  be 
written  in  operator  form  as 

Ct  =  ta  -  8^  3ta/3 Ea  (22) 

where  the  time  operator  C  is  given  by 

C  =  1  -  Be  3/3E  +  3 Bp  3/3P  (23) 

Equation  (21)  is  a  renormalization  group  equation  of  time  that  treats  time  as  a 
function  of  the  local  energy  density  and  pressure,  t  =  t(E,P)  .  The  zero-tem¬ 
perature  version  of  the  time  equation  (21)  is  written  as 

t  +  38°  dt  /dP  =  ta  (24) 

o  P  o  o  o 


where  tQ  and  t^  =  zero-temperature  values  of  the  renormalized  and  unrenormalized 
time,  P0  =  zero-temperature  value  of  the  pressure,  and  Bp  is  given  by  equation 
(13).  From  equation  ( 2  L )  it  follows  that 


Bp  <  0 


high  density  and  temperature 


Bp  >  0 


low  density  and  temperature 


while  equation  (24)  gives 
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high  density 


(27) 


t  >  ta 
o  o 


S°  <  0 


a 

t  <  t 
a  o 


6p  >  0 


low  density 


Figure  2  shows  the  relative  behaviour  of  t  and  ta  in  terms  of  density.  For  ideal 
systems  Bg  *  0  ,  Bp  m  0  and  Bg  *  0  so  that  equation  (21)  gives  t  ■  ta  and  equa¬ 
tion  (24)  gives  t0  ■  ta  for  an  ideal  T  ■  0  system. 

According  to  equation  (21)  the  renormalized  time  depends  on  the  local  ener- 
ry  density  and  pressure  of  matter  and  energy.  At  high  densities  and  temperatures 
of  interacting  matter  and  energy  the  renormalized  time  slows  down  (the  time  inter¬ 
val  between  two  events  is  larger)  relative  to  the  unrenormalized  time  evaluated 
at  the  same  energy  density  and  pressure.  Accordingly,  the  rates  of  processes 
that  occur  at  high  pressures  and  temperatures  are  expected  to  be  slower  than  is 
conventionally  predicted.  Equation  (21)  gives  the  renormalized  time  in  the  pres¬ 
ence  of  ambient  matter  and  energy.  For  the  vacuum  equation  (21)  becomes 

CV  -  3g  3tV/3EV  +  3 Bp  3tV/3PV  =■  tV3  (29) 

where  the  renormalized  vacuum  energy  density  Ev  and  pressure  Pv  are  obtained 
from  equation  (9)  as  the  homogeneous  solution  to  the  energy  trace  equation.  Be¬ 
cause  tva  =  constant,  equation  (29)  can  be  rewritten  as 

tV'-  8g  3tV'/3EV  +  3 Bp  3tV'/3PV  =  0  (30) 

by  a  change  of  variable  tv*  *  tv  -  tva  .  Equations  (9)  and  (30)  show  that  tv  , 

Ey  and  Pv  have  nonzero  values  for  the  vacuum.  The  general  solution  of  equation 
(21)  can  be  written  as 

t  =  tp(E,P,ta)  +  tv'(Ev,Pv)  (31) 

where  t  =  particular  solution  of  equation  (21).  The  energy  trace  equation  (1) 
can  be  used  to  determine  E(n,T)  and  P(n,T)  ,  and  if  ta  =  ta(n,T)  and  tv*tv(n,T) 
it  follows  that 

t  =  t  'n,T)  +  tV'(n,T)  (32) 

P 

where  n  =  particle  number  density  of  ambient  matter.  The  constant  tva  =  char¬ 
acteristic  time  for  a  process  occurring  when  the  ambient  medium  is  the  vacuum. 

B.  Complex  Number  Time  Equation. 


It  has  been  suggested  that  time  is  associated  with  an  internal  phase  and 
must  be  written  as  in  equation  (20).  Therefore  the  simplest  generalization  of 
the  scalar  time  equation  (21)  is  the  following  complex  number  equation  for  par¬ 
tially  coherent  time 

t  -  eE  3t/3E  +  38p  3t/3P  =  ta  -  8^  3ta/3Ea  (33) 
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The  solution  of  equation  (33)  gives  t  and  8t  in  terms  of  energy  density  and 
pressure.  The  complex  number  equation  of  time  for  the  vacuum  is  obtained  from 
equation  (33)  with  tav  ■  constant  to  be 


tv'-  8g  3tv'/3Ev  +  38p  3tV‘/3PV  -  0  (34) 

where  the  constant  value  of  tva  is  introduced  as  tv*  tV  -  tva  .  The  solution 
to  equation  (34)  gives  tv  and  8^  .  The  general  solution  of  equation  (33)  can 
be  written  as  the  sum  of  a  particular  and  a  homogeneous  (vacuum)  solution  as 
follows 

t  =  tp(n,T)  +  tV'(n,T)  (35) 

A  more  complicated  generalization  of  equation  (21)  would  incorporate  complex 
number  energy  density  and  pressure  as  follows 

t  -  Be  3t/3E  +  38p  3t/3P  «  t3  -  0g  3ta/3Ea  (36) 


Correspondingly,  a  more  complicated  vacuum  time  equation  is  obtained  from  equa¬ 
tion  (36)  to  be 


-v 

t 


Be  3tV'/3EV  + 


3Bp  3tV'/3PV  = 


(37) 


Neither  equation  (36)  or  (37)  is  considered  in  this  paper.  The  complex  number 
time  equation  (33)  will  be  used  in  Section  7  to  introduce  the  concept  of  co¬ 
herent  time  states  in  matter  and  to  describe  high  temperature  superconductivity 
in  the  planar  copper  oxides. 

C.  State  Equation  for  Time  and  Alternate  Form  of  Time  Equation. 

In  this  subsection  the  dependence  of  time  on  density  and  temperature  is 
considered  and  the  equation  of  time  is  rewritten  in  terms  of  these  parameters. 
For  instance,  the  chain  rule  for  derivatives  gives 

3t/3n  =  3t/3E  dE/dn  +  3t/3P  3P/3n  (38) 

3t/3T  =  3t/3E  3E/3T  +  3t/3P  3P/3T  (39) 

where  n  =  particle  number  density  and  T  =  absolute  temperature.  In  fact  E  and 
P  are  more  fundamental  quantities  than  n  and  T  ,  but  n  and  T  are  conventionally 
used  to  write  the  state  equations  for  matter.  From  equations  (38)  and  (39)  it 
follows  that 


3t/3P  =  e3t/3n 

-  f 3t/3T 

(40) 

dt/dE  =  h3t/3T 

-  g3t/3n 

(41) 

where 


(42) 


e  -  L/D  3E/3T  f  -  1/D  3E/3n 

e  e 

h  -  1/D  3P/3n  g  -  1/D  3P/3T 

e  e 

D  -  3P/3n  3E/3T  -  3P/3T  3E/3n 
e 


(43 

(44) 


Combining  equations  (40)  and  (41)  with  the  equations  of  time  (21)  and  (33)  gives 
the  following  alternative  forms  for  the  equations  of  time 


t  -  q^  3t/3T  +  Sj  3t/3n  ■  t3  -  q3  3t3/3T  +  s3  3t3/3n 
t  -  qL  3E/3T  +  sl  3t/3n  -  t3  -  qS  3t3/3T  +  s3  3t3/3n 


(45A) 

(45B) 


where 


h$E  +  3fgp 


°1 
a 

q  * 


-  gBE  +  3e6p 


•>*«S 


aoa 

g  Bp 


(46) 

(47) 

(48) 

(49) 


where  ha  and  ga  are  calculated  in  a  manner  identical  to  that  in  equations  (43) 
and  (44)  except  that  the  superscript  "a"  is  added  to  E  and  P  .  Solutions  to 
equations  (45A)  and  (45B)  are  useful  for  astrophysical  and  geophysical  problems 
because  having  a  state  equation  for  time  of  the  form  t  «■  t(n,T)  and  6t  »  0t(n,T) 
leads  to  a  calculation  of  the  rate  of  change  of  time  with  radial  distance  as 
follows 


3t/3r  =  3t/3T  3T/3r  +  3t/3n  3n/3r  (50A) 

30t/3r  =  30t/3T  3T/3r  +  30t/3n  3n/3r  ‘  (50B) 


where  3T/3r  and  3n/3r  are  calculated  from  stellar  equilibrium  equations.  Equa¬ 
tion  (45B)  is  used  in  Section  7  to  describe  coherent  time  states  and  high  temper¬ 
ature  superconductivity. 

D.  Formal  Solution  for  the  Zero-Temperature  Scalar  Time  Equation. 

The  solutions  of  the  time  equations  (21)  and  (33)  or  (45A)  and  (45B)  are 
generally  difficult  to  obtain.  However,  the  solution  to  the  zero-temperature 
time  equation  (24)  can  be  obtained  formally  because  of  its  simple  form.  Stan¬ 
dard  methods  show  that  equation  (24)  has  the  following  solution31 

t  =  c  e^(n)  +  l/3e<t>(n)  JV/H'  dn  7n  ’  (5  LA) 

o  o  J  o  o 
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where 


G‘  - 

o  o 

H*  - 
0 

(n)  ■  -  l/3/l /Hq  dn/n 

H  - 

0 

where  the  zero-temperature  bulk  modulus 
from  equation  (13) 

Hq  “  0?(n)/Ko<n)  *  (1  +  Y0)P0/Ko  - 


6p(n’)/Ko(n’)  (51B) 

B°(n)/KQ(n)  (52) 

K0  is  given  by  equation  (15)  and  where 

1  (53) 


The  zero-temperature  vacuum  solution  is 


o 


C  e 
o 


<Kn) 


+  t 


va 

o 


(54) 


Finally,  because  of  the  appearance  of  the  renormalized  quantities  PQ  ,  Kq  and  iQ 
in  equation  (51)  it  is  clear  that  a  prerequisite  for  the  solution  of  the  zero- 
temperature  time  equation  (24)  is  the  solution  of  the  zero-temperature  energy 
trace  equation  (11). 

E.  Renormalized  Process  Rates. 


The  simplest  calculation  of  renormalized  process  rates  is  to  assume  t  ** 
characteristic  renormalized  time  for  a  process,  and  ta  ■  characteristic  unre¬ 
normalized  time  for  a  process.  Then  the  corresponding  rates  are  given  by 

R  =  1/t  Ra  =  l/t3  (55) 

where  R  =  renormalized  process  rate,  and  Ra  =  rate  given  by  conventional  calcu¬ 
lation.  Placing  equation  (55)  into  equation  (11)  gives 

1/R2(R  +  BE  3R/3E  -  3 Bp  3R/3P)  =  (l/R3)2(Ra  +  B^  3Ra/3Ea)  (56) 


Equation  (56)  gives  the  renormalized  reaction  rate  of  a  process  in  terms  of  the 
conventionally  calculated  reaction  rate.  The  reaction  rate  solution  of  equa¬ 
tion  (56)  is  of  the  form  R  =  R(E,P)  .  For  an  ideal  state  equation  of  the  am¬ 
bient  matter  in  which  the  reaction  is  occurring  BE  *  0  and  Bp  =  0  so  that  for 
this  case  R  =  Ra  .  For  T  =  0  the  reaction  rate  equation  (56)  becomes 

1/R2(R  -  38°  dR  /dP  )  =  l/R3  (57) 

o  o  P  o  o  o 


From  equation  (56)  it  follows  that 


R  <  Ra 


high  density  interacting  Fermi  Gases 
high  temperature  real  gases 


(58) 
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'j  low  density  Fermi  gases 
R  >  R  > 

J  low  temperature  real  gases 


(59) 


Equation  (57)  gives 

Rq  <  R3  high  density  T  «  0  interacting  Fermi  gas  (60) 

Rq  >  R3  low  density  T  ■  0  interacting  Fermi  gas  (61) 

For  ideal  systems  R  ■  Ra  and  RQ  ■  R3  •  An  analysis  similar  to  that  given  in 
equations  (38)  through  (44)  gives 


3R/3P  -  e3R/3n  - 

f 3R/3T 

(62) 

3R/3E  -  h3R/3T  - 

g3R/3n 

(63) 

so  that  the  reaction  rate  equation  (56)  can  be  written  as 

1/R2(R  +  q :  3R/3T  -  s{  3R/3n)  -  (1/R3)2(R3  +  q3  3R3/3T  -  s3  3R3/3n)  (64) 

where  qx  ,  s2  ,  qa  and  sa  are  given  by  equations  (46)  through  (49)  respectively. 
The  solution  of  equation  (64)  gives  the  renormalized  reaction  rate  as  R*=R(n,T). 
Finally,  it  should  be  pointed  out  that  the  definition  of  reaction  rate  given  by 
equation  (55)  is  a  simplified  version  of  the  more  general  definition 

R  =  dN/dt  R3  =>  dN/dt3  (65) 

where  N  *  species  particle  number.  From  equation  (65)  it  follows  that  the  re¬ 
normalized  incoherent  reaction  rate  is 

R  =  Ra/(dt/dt3)  (66) 

where  t  is  given  by  the  solution  of  the  time  equations  (21)  or  (45).  Equation 

(64)  is  useful  when  only  a  characteristic  time  is  known  for  a  process  from 
which  the  rates  can  be  estimated  by  an  inverse  time  as  in  equation  (55).  Equa¬ 
tion  (66)  is  useful  when  a  formal  reaction  rate  function  Ra  given  in  equation 

(65)  is  known  for  a  process.  Figure  3  shows  the  relative  behaviour  of  R  and  Ra 
in  terms  of  particle  number  density.  The  complex  number  rate  equations  analo¬ 
gous  to  equations  (56)  and  (64)  are  obtained  from  the  replacement  R  -*■  R  .  For 
this  case  equation  (66)  becomes 

R  =  Ra/(dt/dta)  (66A) 

For  completely  coherent  time  variation  dt  =  jtd0t  and 

R  =  Ra/(jtd0t/dt3)  (66B) 

a  result  which  will  be  examined  in  Section  7. 
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3.  DIMENSIONS  OF  SPACE  AND  TIME.  This  section  considers  the  possibility 
of  the  dependence  of  the  basic  dimensions  of  space  and  time  on  the  energy  den¬ 
sity  and  pressure  of  the  ambient  matter  and  energy.  It  is  not  apriori  clear 
that  the  dimensions  of  space  and  time  are  independent  of  the  local  pressure  and 
energy  density.  This  section  develops  a  set  of  four  coupled  partial  different¬ 
ial  equations  for  the  energy  density,  time,  space  dimension,  and  time  dimension. 
At  low  energy  density  and  pressure  common  experience  suggests  that 

Dg  'v,  3  Dt  -v  1  (67) 


where  Dg  *  renormalized  space  dimension  and  Dt  ■  renormalized  time  dimension. 
Equation  (67)  may  not  be  valid  for  high  densities  and  temperatures.  In  this 
paper  it  will  be  assumed  that  Ds  *  Ds(n,T)  and  Dt  ■  Dt(n,T)  are  unknown  func¬ 
tions  of  particle  number  density  and  temperature  that  need  to  be  determined 
along  with  E(n,T)  and  t(n,T)  by  a  set  of  four  coupled  renormalization  group  equa¬ 
tions  that  are  similar  in  form  to  the  energy  trace  equation  (I)  and  the  equa¬ 
tion  of  time  (21).  It  is  sometimes  convenient  to  write 


D  =>  3  +  d 
s  s 


Da  *  3  +  da  D3  -  1  +  da 


(68) 

(69) 


where  and  =>  unrenormalized  space  and  time  dimensions  respectively.  Space 

and  time  are  fractal  when  dg  <  0  and  dt  <  0  .30 

For  reasons  similar  to  those  used  for  the  development  of  equations  (1)  and 
(21)  it  is  suggested  that  the  four  coupled  energy  density,  time,  time  dimension, 
and  space  dimension  equations  are 

E  +  DtBE  -  DsBp  =  Ea  +  Da6E  (70) 

t  -  D  0r  3t/9 E  +D  6n  9t/9P  -  ta  -  Da0a  9ta/9Ea  (71) 

t  t  s  r  t  c 

D  +  D  8P  9D  /9E  -  D  0  3D./9P  -  D3  +  D303  aD^E3  (72^ 

t  ttt  sPt  t  ttt 

D  +  D  0C  9D  /9E  -  D  0  9D  /9P  =»  D3  +  D30p  9D3/9E3  (73) 

s  tcs  s  P  s  s  tcs 


The  four  unknown  quantities  that  are  determined  by  these  equations  are  E  ,  t  , 
Dt  and  Ds  .  For  low  density  matter  Dt  =  D3  “  1  and  Ds  =  D|  =  3  and  in  this 
limit  the  left  hand  sides  of  equations  (70)  through  (73)  have  the  Minkowski 
signature  (1  ,  -1  ,  -1  ,  -1)  .  Using  equations  (38)  through  (44)  allows  equa¬ 
tions  (70)  through  (73)  to  be  written  as 

E  +  Dc0e  -  Ds0p  =  E3  +  D30a  (74) 

t  -  q2  9t/9T  +  s2  9t/9n  =  t3  -  q3  9t3/9T  +  s3  9t3/9n  (75) 
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Dt  +  q2  3Dt/3T  "  s2  3Dt/3n  “  Dt  +  qD  3Dt/3T  “  SD  3Dt/3n  (76) 

D  +  q  3D  /3T  -  s  3D  /3n  -  Da  +  qa  3D3/3T  -  s3  3DS/3n  (77) 

®  z  s  4  s  s  D  s  [)  s 

where 

q2  *  hDteE  +  fDs6P  “  h(1  +  dt)SE  +  f(3  +  ds)6P  qD  "  DtqS  (78) 

s2  *  gDt6E  +  eDs6P  *  8(1  +  dt)BE  +  6(3  +  ds)6P  8D  "  Dts3  (79) 

The  rate  equations  (56)  and  (64)  with  space  and  time  dimensions  Dg  and  Dt  re¬ 
spectively  become 

1/R2(R  +  Dt8E  3R/3 E  -  Dg8p  3R/3P)  -  (l/Ra)2(Ra  +  Da8g  3Ra/3Ea)  (80) 

1/R2(R  +  q2  3R/3T  -  s2  3R/3n)  -  (l/Ra)2(R3  +  qjj  3Ra/3T  -  s£  3Ra/3n)  (81) 

Equations  (80)  and  (81)  can  replace  equations  (71)  and  (75)  respectively. 

For  systems  with  PV  =  aU  ,  such  as  the  ideal  classical  gas,  ideal  nonrela- 
tivistic  Fermi  gas  and  the  ideal  ultrarelativistic  Fermi  gas,  Bf  =  B|  =•  0  and 
Bp  =  Bp  “  0  and  it  follows  that 

E  =  Ea  t  -  ta  R  -  R3  (82) 

Dt  -  Da  -  1  Dg  «  Da  =  3  (83) 


d  =  df  =  0  d  *  da  =  0  (84) 

t  t  s  s 

The  choice  Dt  'v  1  and  Dg  'v  3  for  low  density  and  pressure  systems  is  due  to  com¬ 
mon  experience.  From  equations  (70)  through  (73)  it  follows  that 


E  = 

E3 

a 

t  =  t 

R  «  R3 

Dt 

-Dt-l 

D 

s 

_a 

a  D  * 

S 

3  zero  density 

(85) 

E  > 

Ea 

t  <  t3 

R  >  R3 

Dt 

a 

A 

D 

s 

>  Da 
s 

low  density 

(86) 

E  < 

Ea 

t  >  t3 

R  <  R3 

°t 

<  D3 
t 

D 

s 

<  Da 
s 

high  density 

(87) 

E  = 

E3 

a 

t  =  t 

R  =  R3 

Dt 

=  D3 
t 

D 

s 

=  Da 
s 

infinite  density 

(88) 

(asymptotic  freedom) 


Figures  4  and  5  show  the  relative  variation  of  Dt  and  Da  and  Ds  and  Da  respec¬ 
tively.  These  figures  assume  a  compactif ication  of  dimensions  to  the  values 
D|  *  3  and  D®  =  1  in  the  zero  density  limit.  At  high  densities  the  renormal¬ 
ized  dimensions  satisfy  Dt  <  D^  and  Dg  <  Dg  so  that  even  at  high  densities  the 


378 


renormalized  dimensions  Dg  and  Dt  may  not  be  too  different  from  the  3+1  Min¬ 
kowski  spacetime. 


The  vacuum  is  a  real  system  with  the  properties 


cva 


,va 


0 

3 


va 


,va 


constant 


,va 


.va 


(89) 

(90) 


For  the  vacuum  equations  (70)  through  (73)  become 


ev  - 
»; + 


v  v  v 
D  +  DBr 
s  t  E 


-  D>p  - 0 

(91) 

3tV/aEV  +  D^Bp  3tV/3PV  -  tva 

(92) 

3DV/3EV  -  DVB^  3DV/3PV  -  DVa 
t  s  P  t  t 

(93) 

3DV/3EV  -  DVB^  3DV/3PV  =  DVa 
s  s  P  s  s 

(94) 

The  vacuum  is  a  real  system  that  exists  at  finite  values  of  density  and  temper¬ 
ature.  In  this  paper  the  renormalized  vacuum  is  simply  the  set  of  solutions  to 
equations  (70)  through  (73)  with  Ea  *  0  in  equation  (70).  The  corresponding 
vacuum  equations  for  equations  (74)  through  (77)  are 


EV  +  -  D^Bp  =  0  (91A) 

tV  -  qj  3tV/3T  +  3tV/3n  =  tVa  (92A) 

+  q^  3D^/3T  -  s^  3D^/3n  =  1  (93A) 

DV  +  q^  3DV/3T  -  s^  3DV/3n  =  3  (94A) 

s  l  s  z  s 


which  allow  the  calculation  of  renormalized  vacuum  parameters.  Equations  (93), 
(94),  (93A)  and  (94A)  can  be  made  homogeneous  by  writing 


V  V 

Dt  =  1  +  dt 


Dv  =  3  +  dV 
s  s 


(95) 


It  should  be  noted  that  i  D^a  =  1  and  ^  Dg3  =  3  .  Finally,  the  vacuum 
time  equations  (92)  and  (92A)  can  be  made  homogeneous  by  writing  tv'  =  tv  -  tva 
as  was  done  for  equation  (30).  The  constant  tva  =  characteristic  time  for  a 
reaction  occurring  when  the  ambient  medium  is  the  vacuum.  The  time  tva  depends 
only  on  physical  constants  and  the  kinematic  parameters  of  the  reaction.  The 
renormalized  vacuum  values  Ev  ,  tv  ,  and  depend  on  density  and  tempera¬ 
ture.  The  homogeneous  forms  of  the  vacuum  equations  (91)  through  (94)  are 
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0 


(91B) 


EV  +  Ci  +  d^)8g  -  (3  +  d^)8p 


tV’-  (i  +  d^)8g  3tv'/3Ev+( 3  +  d^) 8p  3tv'/3P 

dt  +  (1  +  dl)&E  3dt/3£V  “  (3  +  ds)BP  3dt/3P 

dV  +  (1  +  dv)8p  3dV/3Ev  -  (3  +  dV) Bp  3dv/3P 

S  C  C  S  Sr  S 


^V<,.rV 


.V, 


3tV’/3PV  -  0 

(92B) 

3d^/3PV  -  0 

(93B) 

3dV/3PV  -  0 

(94B) 

while  the  homogeneous  forms  of  the  vacuum  equations  (91A)  through  (94A)  are 

r-V 


v» 


+  (1  +  d^)B*  -  (3  +  d^)8p  -  0 

(91C) 

-  q^  3tV’/3T  +  s^  3tV /3n  -  0 

(92C) 

+  q^  3d*/3T  -  s*  3d^/3n  -  0 

(93C) 

+  q^  3dV/3T  -  si  3dV/3n  -  0 

2  s  2  s 

(94C) 

where  from  equations  (46),  (47),  (78)  and  (79)  it  follows  that 
q\  =  hD^8g  +  fD^8p  -  h(l  +  d^)Bg  +  f (3  +  d^)8p 

s^  *  gD^8g  +  eD^Bp  -  g(l  +  d^)8g  +  e(3  +  d^)B^ 


(95A) 

(95B) 


The  zero-temperature  forms  of  equations  (70)  through  (73)  are 

rO 


-  D°8° 

-  E3 

(96) 

s  P 

o 

3t°/3P°  -  t3 

0 

(97) 

-  n°e° 

3D°/3P°  =  D* 

(98) 

S  P 

t  to 

3D°/3P°  =  D3 
s  so 

(99) 

where  sometimes  it  is  convenient  to  use  the  following  notation 


1  +  d; 


t 

3  *  1  +  d3 

to  to 


D°  ■  3  +  d° 
s  s 

D3  =■  3  +  d3 
so  so 


(100) 

(101) 


380 


Equations  (96)  through  (99)  are  four  equations  for  determining  the  renormalized 
values  E°  ,  t°  ,  D°  and  Dg  for  zero-temperature  solids  and  quantum  liquids.  As 
described  in  Reference  29  an  additional  equation  is  required  to  determine  yQ 
that  occurs  in  equation  (96) .  From  equations  (96)  through  (99)  it  follows  that 
E°  »  t°  ,  D°  and  Dg  have  the  same  variation  with  density  as  is  shown  in  equa¬ 
tions  (85)  through  (87)  and  in  Figures  1,  2,  4  and  5  respectively.  The  solu¬ 
tion  of  equations  (96)  through  (99)  are  generally  difficult  because  of  their 
coupled  nature.  An  approximate  solution  for  equations  (98)  and  (99)  can  be  found 
by  first  taking  D|  ■  3  in  equation  (96)  and  solving  for  E°  and  P°  (and  y°  ,  with 
an  additional  differential  equation  coming  from  a  power  series  expansion  in  the 
temperature  as  in  Reference  29).  Then  the  solutions  to  equations  (98)  and  (99) 
can  be  written  approximately  as31 

Do  „  Eoe-*(»>  .  m  .-*<«)  doVn-  ( 102A) 


D  *  FV*(n)  -  1/3  e-*<n)  J  G°'/H'  dnV 
s  J  Ds  o 


(102B) 


where 


C°'  -  e+<"  V  („•) 

Ds  so  ' 


(103) 


where  (n)  and  are  given  in  equations  (52)  and  (51B)  respectively.  The  zero- 
temperature  vacuum  equations  are  obtained  from  equations  (96)  through(99)  to  be 


Eov  _  Dov  ov  =  0 
s  P 


(104) 


tov  +  DovBov  3tov/9pov  =  tva 
s  P  o 


(105) 


D°v  _  Dovgov  3Dov/9pov  = 
t  s  P  t 


(106) 


DOV  -  D°VB°V  3DOV/3P°V  =  3 
s  s  P  s 


(107) 


,  va 

where  tQ  =  constant  =  characteristic  time  for  a  process  to  occur  in  the  vacuum 
at  zero  temperature.  Equations  (104)  through  (107)  can  be  made  into  homogene¬ 
ous  equations  by  writing  tov'  =  tov  -  t^a  and 


=  1 

+ 

dov 

t 

DVa  =  1 
to 

dva 

to 

=  0 

(108) 

=  3 

+ 

dov 

s 

Dva  =  3 
so 

dva 

so 

=  0 

(109) 

and  therefore  from  equations  (106)  through  (109) 


.ov  „o  — <6  (n) 
d£  •v  E  e 


dOV  .v  F°e"^(n) 
s 


(110) 


Note  that  the  renormalized  zero-temperature  vacuum  does  not  have  the  3  +  1 


geometry.  Equations  (104)  through  (107)  are  four  equations  for  Eov  ,  tov  , 

D°v  and  D°v  for  the  renormalized  zero-temperature  vacuum. 

4.  SCALE  AND  GAUGE  INVARIANCE  AND  NONEQUILIBRIUM  THERMODYNAMICS.  This 
section  develops  simple  expressions  for  the  time  evolution  of  an  interacting 
thermodynamic  system  by  considering  the  scale  and  gauge  invariance  of  the  left 
hand  sides  of  the  time  equations  (21)  and  (33).  Suppose  that  for  fixed  energy 
density  and  pressure  a  scale  and  gauge  transformation  of  time  is  made  of  the 
form 


t’=te^  =  t(l-(j>+*-*)  (111) 

t'  =  te  ^  3  t(l  -  jiji  +  •••)  (112) 

in  the  left  hand  sides  of  the  time  equations  (21)  and  (33)  respectively  by 
changing  some  physical  parameters  such  as  interparticle  interaction  parameters. 
Using  equation  (111)  and  the  assumed  scale  invariance  of  the  left  hand  side  of 
equation  (21)  gives 

(1  -  3/3E  +  30p  3/3P)  te_<^  =  e'*(l  -  8^  3/3E  +  3Bp  3/3P)t  (113) 

Some  elementary  calculations  show  that  equation  (113)  is  equivalent  to 

8^(-  3t/3E  +  t3<j>/3E)  +  36p(3t/3P  -  t3<j>/3P)  =  -  0f  3t/3E  +  3Bp  3t/3P  (114) 

Assuming  separability,  equation  (114)  can  be  written  as 

8^  =  6E/[1  -  (t3*/3E)/(3t/3E)l  -v  8^ 1  +  (t3$/3E) / (3t/3E) ]  (113) 

Bp  =  Bp/tl  -  (t3<(>/3P)/(3t/3P)]  -v  8p[  1  +  ( t34>/3P)  /  (3t/3P)  ]  (116) 

which  relate  the  transformed  gauge  parameters  to  their  original  values.  The 
approximations  in  equations  (115)  and  (116)  are  valid  for  short  time  scales. 


A  power  series  expansion  of  8^  and  Bp  gives 

Bg  =  Bg  +  dBp/dtCt'  -  t)  +  •••  (117) 

8p  =  Bp  +  d8p/dt(t  -  t)  +  •••  (118) 

Using  equation  (111)  in  equations  (117)  and  (118)  gives 

BE  =  "  (dB^/dt) <}>t  +  •••  (119) 

Bp  =  Bp  -  (dBp/dtHt  +  •••  (120) 
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Combining  equations  (119)  and  (120)  with  equations  (115)  and  (116)  respectively 
gives  the  following  time  evolution  equations  for  the  gauge  parameters 

(dBE/dt)_  =  -t Be/4>  04>/3E)/(3t/3E)]/[l  -  t(3$/3E)/(3t/3E)]  (121) 

(dBp/dt)_  =  -[  0p/4>  (34>/3P)  /  (3t/3P)]/[l  -  t(3<J>/3P)/(3t/3P)3  (122) 

where  to  obtain  equations  (121)  and  (122)  a  negative  exponent  sign  was  chosen 
in  equation  (111).  If  a  positive  sign  had  been  chosen  in  equation  (111)  the 
time  evolution  equations  would  be 

(d0E/dt)+  =  -[0E/4>  (34>/3E)/(3t/3E)]/[l  +  t(3$/3F)/(3t/3E)]  (123) 

(dBp/dt)+  =  -[  6p/4>  (3<(>/3P)  /  (3t/3P)  ]/[  1  +  t  (3<j>/3P)  /  (3t/3P)  ]  (124) 

where  the  gauge  parameters  and  Bp  are  given  in  equations  (2)  and  (3)  respec¬ 
tively. 

Because  the  time  evolution  equations  must  be  independent  of  the  choice  of 
the  sign  of  the  exponent  in  equation  (111)  the  rate  of  change  of  the  gauge  para¬ 


meters  must  be  written  as 

dBE/dt  =  l/2[(dBE/dt)_  +  (dBE/dt)+]  (125) 

dBp/dt  =  l/2[(dBp/dt)_  +  (dBp/dt)+3  (126) 

Combining  equations  (121)  through  (126)  gives 

d6£/dt  =  -  (Be^e/4>)/(1  -  t2<|;E)  (127) 

dBp/dt  =  -  (8pi|jp/ <f>) / ( 1  -  t2*2)  (128) 

where 

i PE  =  (34>/3E)/(3t/3E)  (129) 

i^P  =  (3<|>/3P)/(3t/3P)  (130) 


Because  <J>  appears  symmetrically  in  equations  (127)  and  (128)  a  positive  or  neg¬ 
ative  choice  for  the  sign  of  <J>  does  not  alter  the  result.  A  similar  analysis 
using  the  gauge  transformation  given  in  equation  (112)  gives  the  result 


dBE/dt  =  - 

(8^/$)  /(I  +  t2i£E) 

(131) 

dBp/dt  =  - 

(Bpijjp/ 4> )  /  ( 1  +  t2^p) 

(132) 
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where 


*E  -  (3<p/dE)  /  (3t/3E)  (133) 

*p  -  (3<t»/3P)/(3t/3P)  (134) 


Again,  because  4>  appears  symmetrically  in  equations  (131)  and  (132)  it  follows 
that  these  equations  are  independent  of  the  sign  in  the  exponent  of  equation 
(112).  For  coherent  time  variation  with  dt  *  jtd©t  it  follows  from  equations 
(131)  and  (132)  that 


1/t  dS£/d0t  -  -  (6e^/4>)/(1  -  *E2) 
1/t  d6p/d0t  -  -  (Bp4>’/*)/(l  -  -i^2) 


(135) 

(136) 


where 

-  (3*/3E)/(30t/3E)  (137) 

<kp  =  (3*/3P)/(36t/3P)  (138) 


A  similar  analysis  can  be  done  for  equations  (45A)  and  (45B)  to  obtain  expres¬ 
sions  for  dqx/dt  and  ds1/dt  ,  however  a  simpler  way  is  to  combine  equations 
(46)  and  (47)  with  equations  (127)  and  (128) 

Equations  (127)  and  (128)  become  more  simple  for  short  time  scales 

dBg/dt  *  -  8E^E/<t>  (139) 

dBp/dt  =  -  BpijJp/ <J>  (140) 

By  writing  4>  as  a  product  <p  =  f  (P)g(E)  ,  it  is  easy  to  show  that  the  solution 
to  equations  (139)  and  (140)  is 

4>  =  c(BEBp)-1  =  c (T/V  Cpv(P  -  Ky)]"1  (141) 

where  c  =  constant.  The  value  of  c  can  be  chosen  by  evaluating  equation  (141) 
at  a  specific  thermodynamic  state  with  4  a  4>1  ,  T  =  T  and  n  =  ni  .  Equations 
(127)  and  (128)  describe  the  time  evolution  of  an  interacting  thermodynamic 
system  in  terms  of  the  gauge  parameters  Bp  and  6f  and  in  terms  of  a  potential 
function  4>(E,P)  .  Thus  the  gauge  parameters  Bp  and  Bg  are  the  basic  physical 
quantities  of  nonequilibrium  thermodynamics.  Equation  (128)  has  a  proper  T=0 
limit 


dB°/dt 
*  o 


(Bj>°/o0)/a  -  t2yp2) 


(142) 
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where  Bp  is  given  by  equation  (13)  and  i^/p  is  obtained  from  equation  (130)  to  be 
=•  0<j>0/3P0)/Oto/3P°)  (143) 

The  basic  time  evolution  equations  for  interacting  thermodynamic  systems  can 
be  obtained  from  the  scale  and  gauge  invariance  of  the  fundamental  equations 
of  time  (21)  and  (33)  respectively. 


5.  THERMODYNAMIC  PARAMETERS  FOR  SOLIDS,  QUANTUM  LIQUIDS  AND  REAL  GASES. 
This  section  evaluates  the  thermodynamic  parameters  e  ,  f  ,  g  ,  h  ,  6g  and  Bp  that 
appear  in  the  equations  of  time  (45A)  and  (45B)  and  the  reaction  rate  equations 
(64)  and  (66) . 

A.  Solids  and  Quantum  Liquids. 

The  state  equation  of  a  solid  or  Fermi  liquid  in  which  a  nuclear,  atomic 
or  molecular  reaction  is  occurring  is  assumed  to  have  the  following  simple 
closed  form29 

E  =  E  +  E.Tj  (144) 

o  1 

P  =  P  +  P.Tj  (145) 

°  3 

where  E  and  P  =  renormalized  energy  density  and  pressure  respectively,  EQ  and 
PQ  =  corresponding  zero-temperature  values  of  the  energy  density  and  pressure, 
and  Ej  and  Pj  =  thermal  coefficients  for  energy  density  and  pressure  respect¬ 
ively,  and  where29 


j  =  1 

j  =  2 

j  -  5/2 
j  =  4 


high  temperature  solid 
low  temperature  Fermi  gas 
low  temperature  molecular  Bose  gas 
low  temperature  solid 


Combining  equations  (44),  (144)  and  (145)  gives 

D  =  jTj_1(-E.  +  m.Tj)  (146) 

e  3  3 

1/Dg  =  (j-EjT^-1)-1  ( 1  -  +  “•)  (147) 

where 


(*^ 

II 

m 

3P  / 3n  -  P. 

3  E 

/3n 

(148) 

3  3 

0  J 

0 

m.  =  E , 

3P./3n  -  P. 

3E. 

/3n 

(149) 

Then  equations  (42)  and  (43)  are  used  to  calculate  the  coefficients  e  ,  f  ,  g  , 
and  h  as  follows 
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(150) 


e  -  Ej/(^  +  ) 

f  -  (3E  /3n  +  3E  /3n  Tj)/D 
o  j  e 

8  •  y  «3 +  vJ) 

h  =  (3P  /3n  +  3P , /3n  TJ)/D 
o  j  e 


(151) 

(152) 

(153) 


For  small  T  equations  (150)  through  (153)  can  be  expanded  in  a  power  series  in 
T  .  These  equations  determine  e  ,  f  ,  g  and  h  for  state  equations  of  the  form 
given  in  equations  (144)  and  (145).  The  gauge  functions  are  required  for  the 
solution  of  the  time  and  rate  equations  (45)  and  (64)  or  (66)  respectively.  The 
functions  Bjr  and  Bp  are  defined  in  equations  (2)  and  (3)  which  combined  with 
the  state  equations  (144)  and  (145)  give29’30 


Jtl  +  V0/(P0  -  V]EjT 


(1 


+  y  )P  -  K  +  E.  Vdy  /dV  Tj 


(154) 


(155) 


For  the  conventional  state  equation  of  the  form 
F3  =  Ea  +  EaTj 

o  J 


Pa  =  Pa  +  paTJ 

°  i 


(156) 

(157) 


it  follows  that 

D3  -  jT~*~l  (Zf  +  malh  (158) 

e  J  J  J 

la  =  Ea  3Pa/3n  -  Pa  3Ea/3n  (159) 

J  J  o  jo 

mj  =  Ej  ^Pj^n  "  (160) 

g3  -  Pa/(^a  +  maTj)  (161) 

ha  =  (3Pa/3n  +  3Pa/3n  T^/D3  (162) 

o  J  e 

$r  -  j[l  +  YaPa/(Pa  -  Ka)]E3Tj  (163) 

t  o  o  o  o  J 


which  are  the  quantities  required  to  evaluate  the  right  hand  sides  of  the  time 
equation  (45)  and  the  rate  equation  (64).  Before  equations  (45)  and  (64)  can 
be  solved,  the  energy  trace  equation  (1)  must  be  solved  to  determine  the  re¬ 
normalized  functions  E0  ,  P0  ,  Ej  ,  Pj  ,  and  Bp  in  terms  of  the  correspond- 
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ing  unrenormalized  quantities  Ea  ,  Pa  ,  Ej  ,  Pj  ,  B§  and  Bp  .  Finally,  the 
unrenormalized  reaction  rate  Ra  that  appears  in  equations  (64)  and  (66)  for 
processes  that  occur  within  solids  or  quantum  liquids  may  be  of  the  form 

r3  “  Ro  +  (i64) 

where  a  =  constant.  Equation  (164)  allows  processes  to  occur  at  T  =  0  due  to 
pressure. 


B .  Real  Gases . 

For  real  gases  the  renormalized  state  equations  are  given  by  a  solution  of 


the  relativistic  energy  trace  equation  (1)  as29 

P  =  nR  T ( 1  +  Bn  +  Cn2  +  Dn3  +  •••)  (165) 

G 

E  =  nR  T(3/2  -  TdB/dT  n  -  l/2TdC/dT  n2  -  i/3TdD/dT  n3  -  •••)  (166) 

b 

where  R^  =  gas  constant.  The  corresponding  unrenormalized  state  equations  are 
written  as2 

Pa  =  nRGT(l  +  Ban  +  Can2  +  Dan3  +  •••)  (167) 

E3  =  nRGT(3/2  -  TdBa/dT  n  -  !/2TdCa/dT  n2  -  l/3TdDa/dT  n3  -  •••)  (168) 


where  Reference  29  gives  the  connection  between  the  renormalized  and 
ized  virial  coefficients  by  solving  equation  (1)  out  to  third  order, 
equations  (44),  (165)  and  (166)  gives  after  some  algebra 

2  2,  2 
D  =  n  R_T(a  +  a.n  +  a„n  +  •••) 
e  G  o  1  2 


unrenormal- 

Combining 


(169) 


where 

a  =  3/2B  -  T2d2B/dT2  -  3/2TdB/dT  (170) 

o 

a{  =  3C  -  l/2d/dT(T2dC/dT)  -  2Bd/dT(T2dB/dT)  +  2TdB/dT  d/dT(TB)  (171) 

a2  =  9/ 2D  -  5/6TdD/dT  -  l/3T2d2D/dT2  -  4CTdB/dT  -  3CT2d2B/dT2  (172) 

-  I / 2BTdC/dT  +  7/2T2dB/dT  dC/dT  -  BT2d2C/dT2 


2 

Note  that  De  given  by  equation  (169)  begins  with  a  second  order  term  n  whereas 
the  pressure  and  energy  density  begin  with  linear  terms  in  n  as  shown  in  equa¬ 
tions  (165)  and  (166). 


The  values  of  e  ,  f  ,  g  and  h  for  the  real  gases  are  then  calculated  using 
equations  (42)  through  (44)  as  follows 


387 


e  »  nRG/De[3/2  -  d/dT(T2dB/dT)n  -  L/2d/dT(T2dC/dT)n2  -  •••]  (173) 

*  l/(nRGTaQ)(3/2  -  e^  +  e^2  -  •••) 

where 

e j  *  3/2 ctj^ / ctQ  +  d/dT(T2dB/dT)  (174) 

e2  “  3/2[(a1/aQ)2  -  a^aj  +  al^aQ  d/dT(T2dB/dT)  -  !/2d/dT(T2dC/dT)  (175) 


f  -  R  T/D  (3/2  -  2TdB/dT  n  -  3/2TdC/dT  n2  -  •••) 

-  l/(n2RGaQ)(3/2  -  +  f^2  -  •••) 

where 

f.  =  3/2a./a  +  2TdB/dT 

I  1  o 

f9  =  3/2 [ (a  /a  )2  -  a  /a  ]  +  2a, /a  TdB/dT  -  3/2TdC/dT 

£.  I  O  /  O  i  O 

g  =  nR_/D  [1  +  d/dT (TB)n  +  d/dT(TC)n2  +  •••) 
vj  e 

=  l/(nRGTaQ)(l  +  +  g2n2  +  ♦••) 

where 

g.  =  d/d T(TB)  -  a, /a 

l  1  o 

g2  =  (al/ao)2  '  a2/ao  +  d/dT<TC)  ~  Vao  d/dT(TB> 
h  =  R  T/D  (1  +  2Bn  +  3Cn2  +  4Dn3  +  •••) 

u  & 

=  l/(n2RGaQ)(l  +  h^n  +  h2n2  +  •••) 

where 

h .  =  2B  -  a . /a 

1  1  o 

h„  =  3C  -  2Ba./a  +  (a./a  )2  -  a„/a 
l  1  o  1  o  l  o 


(176) 

(177) 

(178) 

(179) 

(180) 


(181) 

(182) 

(183) 


(184) 

(185) 
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The  corresponding  unrenormalized  coefficients  ga  and  ha  that  appear  in  the  right 
hand  sides  of  equations  (45)  and  (64)  are  obtained  in  a  similar  way  with  the 
substitutions  B  Ba  ,  C  -*■  Ca  and  D  -*•  Da  in  equations  (180)  through  (185)  and 
in  equations  (170)  through  (172). 

The  values  of  8g  and  8p  that  appear  in  the  time  equation  (45)  and  the  rate 
equations  (64)  and  (66)  have  already  been  evaluated  for  the  real  gases  and  are 
given  by29 

6p  =  [2/3(8  -  Td8/dT)  -  8]n2  +  •••  (186) 

=  -  R  T(B  +  2/3TdB/dT)n2  +  ••• 

G 

8e  =  nT(Y  -  nZ)  (187) 


where 

Y  =  R  [3/2  -  1/8(8  -  Td8/dT)  ]  (188) 

G 

=  RG(3/2  +  T/B  dB/dT) 

2  2  2  (189) 

Z  =  Td  6/dT  -  2R  r/8  (B  -  Td8/dT)  +  1/8  d8/dT(8  -  TdB/dT)  +  R  /B(T  -  Tdr/dT) 

G  G 

=  R  [2TdB/dT  +  T2d2B/dT2  +  2C/B2TdB/dT  -  (1  +  T/B  dB/dT)TdB/dT  -  T/B  dC/dT] 

G 

and  where  the  two  forms  of  the  second  and  third  virial  coefficients  are  related 
by29 

B  =  R  TB  T  =  R_TC  (190) 

G  G 

The  values  of  Bp  and  8g  are  obtained  by  analogous  formulas  with  the  superscript 
"a"  added.  The  connection  between  Ba  ,  C3  and  B  ,  C  is  given  in  Reference  29. 
The  reaction  rate  of  processes  that  occur  in  real  gases  can  be  written  in  the 
form 


R3  =  ga(n)TK  (191) 

where  k  =  constant.  Thus  all  of  the  quantities  necessary  to  solve  the  reaction 
rate  equations  (64)  or  (66)  have  been  determined  for  processes  occurring  in  the 
real  gases. 

6.  INCOHERENT  TIME  IN  A  RADIATION  FIELD.  This  section  considers  the  ef¬ 
fects  of  radiation  (mechanical  or  electromagnetic)  on  the  local  time  in  matter 
or  energy.  The  effects  can  be  calculated  by  considering  the  presence  of  radi¬ 
ation  as  producing  a  perturbation  in  the  time  calculated  by  equation  (45)  as 
follows 
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(192) 


t  +  tr  -  (qL  +  qlr)3/3T(t  +  tr>  +  (sL  +  slr)3/3n(t  +  t^) 

-  ta  +  -  (qa  +  qa)3/3T(ta  +  ta)  +  Is*  +  sa)3/3n(ta  +  ta) 

Subtracting  equation  (45)  from  equation  (192)  and  retaining  only  first  order 
terms  gives  the  renormalization  group  equation  for  time  in  a  radiation  field  in 
matter  or  energy  as 

tr  ~  qL  3tr/3T  +  a{  3tr/3n  -  qlr  3t/3T  +  slr  3t/3n  -  ta  +  Ha  (193) 

where 

qlr  *  hr8E  +  hgEr  +  3fr8P  +  3f0Pr  (194) 


slr  *=  8r6E  +  g8Er  +  3er8P  +  3e8Pr  (195) 

Ha  *  -  B^h3  3ta/3T  +  6aga  3ta/3n  -  qa  3ta/3T  +  Sa  3ta/3n  (196) 

and  where 

h  =  (3P  /3n  -  hZ  )/D  (197) 

r  r  r  g 


fr  =  OEr/3n  -  fZr)/De 
gr  =  (3Pr/3T  -  gZr)/De 
er  =  (3Er/3T  -  eZr)/De 


Z  =  3P  /3n  3E/3T  +  3E  /3T  3P/3n 
r  r  r 


3Pr/3T  3E/3n  -  3Er/3n  3P/3T 


a„a  .  ,  a„a 


qr  -  hr6"  +  h  Pgr 


a  a  a  ,  aQa 

S  '  g  Br  +  g  8r 

r  re  tr 


where  is  given  by  equation  (44)  and  where 

hr  -  T/v|[d/dX(U  +  «r)l(p+pr)v  -  IdWdT)pvl 

hr  -  d/dVI(P  +  Pr)V]u+Ur  -  d/dV(PV)y 

The  value  of  Bfr  is  obtained  from  equation  (204)  with  the  superscript  "a" 
added  to  U  and  Ur  .  Finally,  combining  equations  (194),  (195)  and  (197) 
through  (203)  gives 


(198) 

(199) 

(200) 
(201) 
(202) 

(203) 

(204) 

(205) 
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qlr  “  <l1(f3pr/9T  "  h9Er/3T)  -  s^fSP^Sn  -  h3Er/3n)  +  hBEr  +  3fBpr  (206) 

sir  *  qi(e3Pr/3T  -  g3Er/3T)  -  s^eSP^Sn  -  g3Er/3n)  +  gBEr  +  3e6pr  (207) 

All  the  necessary  quantities  for  solving  the  radiation  time  equation  (193)  have 
now  been  determined. 

7.  COHERENT  TIME  AND  HIGH  TEMPERATURE  SUPERCONDUCTIVITY.  This  section 
develops  a  relativistic  equation  for  coherent  time  and  considers  applications 
to  the  theory  of  high  temperature  superconductivity.  First  the  complex  number 
time  equation  is  written  in  its  general  form  where  both  the  magnitude  and  the 
internal  phase  angle  of  time  vary  with  temperature  and  pressure.  Secondly  the 
case  of  slow  processes  is  considered  where  time  varies  linearly  and  the  internal 
phase  angle  can  be  set  to  zero  and  the  scalar  incoherent  time  equation  is  re¬ 
gained.  Thirdly  the  case  of  ultrafast  time  changes  is  considered  in  which  time 
coherently  rotates  in  internal  space.  Finally,  applications  to  high  temperature 
superconductivity  are  considered.32-38 

A.  General  Form  of  the  Complex  Number  Time  Equation. 

The  complex  number  time  equations  (33)  and  (45A)  can  be  written  as 

Ct  =  t(l  +  L.  +  JL  .  )  =  ta  -  Bp  3ta/3Ea  (208) 

me  J  coh  fc 


where  £  »  time  operator  given  by  equation  (23) ,  and  where  Linc  and  Lcoh  =  in¬ 
coherent  and  coherent  time  components  respectively  which  are  given  by 


L.  =  l/t(~  Bp  3t/3 E  +  36,,  3t/3P)  (209) 

me  fc  P 

=  l/t(-  qz  3t/3T  +  Sj  3t/3n)  (210) 

L  =  -  Bp  36  /3E  +  3B„  38. /3P  (211) 

coh  t  t  P  t 

=  -  q{  36t/3T  +  s L  36t/3n  (212) 

where  in  order  to  obtain  the  coherent  components  in  equations  (211)  and  (212) 
the  following  coherent  time  rotation  is  used 

Jt  =  jtd6  =  jte^^dO  (213) 

J  t  t 

The  real  and  imaginary  components  of  equation  (208)  are  obtained  using  equa¬ 
tions  (209)  and  (211)  to  be 

t  cos  8  -  8e  sec  8tE  cos(8t  +  0tE)  9t/3E  “  (214) 

+  3Bp  sec  Btp  cos(8t  +  Stp)  3t/3P  =  ta  -  6E  3ta/3 E 


391 


(215) 


t  sin  et  -  sec  8tE  sin(0t  +  Bt^)  3t/3E 

+  3 Bp  sec  8tp  sin(0t  +  Btp)  3t/3P  =  0 

(215) 

where 

tan  0tE  =  (t30t/3E)/(3t/3E) 

(216) 

tan  Btp  -  (t30c/3P)/(3t/3P) 

(217) 

Similarly,  the  real  and  imaginary  parts  of  equation  (208)  can  also 
using  equations  (210)  and  (212)  as  follows 

be  written 

t  cos  -  qL  sec  6^  cos(6t  +  BtT)  3t/3T 

+  s.  sec  B  cos(6  +  B  )  3t/3n  =  ta  -  qa  3ta/3T  +  sa  3ta/3n 

I  tn  t  tn 

(218) 

t  sin  -  qL  sec  BtT  sin(Gt  +  BtT)  3t/3T 

+  s.  sec  B  sin(6  +  B_  )  3t/3n  =  0 

i  tn  t  tn 

(219) 

where 

tan  BtT  -  (t30t/3T)/(3t/3T) 

(220) 

tan  B.  =  (t30. /3n) / (3t/3n) 
tn  t 

(221) 

These  are  the  equations  that  describe  partially  coherent  time. 

B.  Incoherent  Time  Equation. 

If  the  processes  occur  relatively  slowly  then  the  change  in  linear  time 
is  much  greater  than  the  rotated  component  dt  >>  td0t  or  L^nc  >>  Lcoj1  and  equa¬ 
tion  (208)  becomes 

Ct  -v  (1  +  L .  )t  =  ta  -  8§  3ta/3Ea  (222) 

me  E 

which  gives  8t  =  0  and  yields  the  scalar  equation 

t  -  0E  3t/3E  +  3Bp  3t/3P  =  ta  -  6a  3ta/3Ea  (223) 

which  is  the  equation  for  incoherent  time  presented  in  the  previous  sections. 


392 


C.  Coherent  Time  Equation. 

For  an  ultrafast  process,  time  changes  mainly  coherently  by  rotation  in 
internal  space  so  that  the  rotational  time  change  is  much  greater  than  the 
linear  time  change  td8t  >>  dt  and  Lcoh  >>  L-£nc  in  equation  (208)  which  becomes 

Ct  %  t(i  +  jL  fe)  =  t3  -  8^  3ta/9Ea  (224) 

which  can  be  rewritten  as  either  of  the  following  two  equations 

t(l  -  j0E  30fc/3E  +  3j 8p  36t/3P)  =  ta  -  8a  3ta/3Ea  (225) 

t(l  -  jqx  30t/3T  +  j s x  30  /3n)  =  ta  -  qa  3ta/3T  +  s3  3ta/3n  (226) 

Combining  equation  (20)  with  equations  (225)  and  (226)  and  taking  the  real  and 
imaginary  parts  of  these  two  equations  yields  the  two  equations  for  the  coher¬ 
ent  time  state 

tan  0  =  Be  30t/3E  -  38p  ae^SP  (227) 

=  qL  30t/3T  -  sL  30t/3n 


t  =  cos  0t  (t3  -  Ba  3ta/3Ea)  (228) 

=  cos  0t  (t3  -  q3  3ta/3T  +  s3  3ta/3n) 

Equation  (227)  is  a  nonlinear  differential  equation  that  determines  the  inter¬ 
nal  phase  angle  of  time  0t  ,  and  equation  (228)  determines  the  renormalized 
incoherent  linear  time  in  terms  of  the  unrenormalized  time  ta  .  The  measured 
incoherent  linear  time  is  given  by 

t  =  t  cos  0  =  cos^  0  (t3  -  Bp  3ta/3Ea)  (229) 

tn  t  t  c 

Coherent  time  becomes  important  when  processes  occur  so  fast  on  a  linear  time 
scale  that  dt  <<  td0t  and  the  rotated  time  describes  the  time  in  a  coherent 
state  of  matter. 

D.  High  Temperature  Superconductivity. 

In  the  accompanying  paper  on  electromagnetism  it  is  shown  that  the  broken 
symmetry  form  of  Ohm's  law  is 

R  =  R  tl  -  tan  0  tan[2(0  -  0  )]}cos^  0  (230) 

am  ac 1  t  a  t  1  t 

for  a  =  x  ,  y  ,  z  and  where  Rctm  =  measured  resistance  in  the  a  direction,  and 
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RaC  =  conventionally  calculated  resistance  in  the  a  direction.  For  the  high 
temperature  superconducting  state  Ram  =  0  which  gives  0t  =  n / 2  -  2(0a  -  0t) 
which  combined  with  the  free  s  -  pair  condition  6a  =  26t  gives  0t  =  tt/6  and 
0a  *  ir/3  .  Therefore,  whereas  the  standard  Bardeen-Cooper-Schrief fer  (BCS) 
theory  predicts  superconductivity  to  occur  when  Rac=  0  ,  the  high-Tc  super¬ 
conducting  state  is  associated  with  Ram  =  0  which  occurs  when  a  coherent  time 
state  exists  within  the  planar  copper  oxides  with  6t  =  tt/6  when  T  <  Tc  . 

For  the  coherent  time  state  with  8t  =  tt/6  equation  (227)  gives 

1/^  =  q^(8et/3T)T  -  s^06t/3n)T  (231) 

which  determines  the  critical  temperature  Tc  in  terms  of  the  particle  number 
density  of  matter,  Tc  =  Tc(n)  ,  provided  the  internal  phase  angle  0t(T,n)  is 
known.  Equivalently,  equation  (231)  determines  the  pressure  dependence  of  the 
critical  transition  temperature  Tc  «  TC(P)  .  For  a  coherent  time  superconduct¬ 
ing  state,  equation  (228)  gives  the  relativistic  linear  time  as 

t  =  /3/2(ta  -  8^  3ta/3Ea)  (232) 

=  /3/2(ta  -  q3  3ta/3T  +  sa  3ta/3n) 

The  measured  linear  time  for  the  high-Tc  superconducting  state  is  given  by 

t  =  3/4(ta  -  Bp  3ta/3Ea)  (233) 

m  t 

In  a  high-Tc  superconductor  processes  run  faster  than  is  predicted  by  conven¬ 
tional  calculations. 

Two  renormalized  time  scales  must  be  distinquished  in  matter:  a)  linear 
time  or  incoherent  time  t  ,  and  b)  circular  or  coherent  time  t0t  for  a  co¬ 
herent  time  state.  Therefore  in  a  coherent  time  state  physical  processes, 
such  as  electron-muthon  scattering,  occur  during  the  time  t0t  ,  while  in  an 
incoherent  time  state  physical  processes,  such  as  the  electron-phonon  scat¬ 
tering  of  the  conventional  BCS  theory,  occur  on  the  linear  time  scale  t  . 

These  two  time  scales  are  associated  with  corresponding  energy  scales.  The 

characteristic  relative  energy  scale  for  superconductors  is  the  normalized  su- 

3  2  3  8 

perconductivity  energy  gap 

A'  =  2A/(kT  )  (234) 

c 

where  A  =  superconductivity  energy  gap  and  Tc  =  transition  temperature.  For  a 
coherent  time  state  high-Tc  superconductor  the  energy  gap  is  designated  by  Act  , 
while  the  superconductivity  energy  gap  for  an  incoherent  time  (conventional 
BCS)  superconductor  is  designated  by  A^t  .  Then  using  the  Heisenberg  uncer¬ 
tainty  principle  with  the  normalized  superconductivity  energy  gaps  gives 

a'  t0  =  t  a!  t  =  r  r  =  I (m  e^)  (235) 

ct  t  it  P 
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so  that 


Act  "  Ait/et  =  6/lT  Ait  ^  1*91Ait  (236) 

because  0t  ■  it/6  for  the  high-Tc  superconducting  state.  The  larger  normalized 
superconductivity  energy  gap  predicted  for  high  temperature  superconductors  by 
equation  (236)  has  been  experimentally  observed.30 

8.  CONCLUSION.  A  renormalization  group  equation  for  time  is  developed 
which  suggests  that  the  reaction  rates  of  processes  occurring  in  an  ambient 
thermodynamic  medium  depend  on  the  state  equation  of  the  medium.  Physical  pro¬ 
cesses  run  more  slowly  when  they  occur  in  a  real  medium  with  PV  f  aU  than  they 
would  if  they  occurred  in  an  ideal  thermodynamic  medium  or  vacuum.  These  re¬ 
sults  follow  from  a  gauge  theory  of  time  that  is  based  on  the  Minkowski  space- 
time  metric.  The  dimensions  of  space  and  time  also  depend  on  the  thermodynamic 
state  equation  of  the  ambient  matter.  Theoretical  predictions  of  reaction  rates 
and  the  dimensions  of  space  and  time  can  be  made  for  solids,  Fermi  liquids,  and 
the  real  classical  gases  by  solving  the  coupled  renormalization  group  equations 
for  energy,  time  and  the  dimensions  of  space  and  time.  The  renormalization 
group  equation  for  coherent  time  is  developed  and  applied  to  the  problem  of  de¬ 
scribing  the  high-Tc  superconducting  state.  It  is  concluded  that  high-Tc  su¬ 
perconductors  represent  a  coherent  time  state  of  matter  which  can  be  described 
by  the  renormalization  group  equation  for  coherent  time.  This  time  state  occurs 
in  matter  with  free  pairs  of  electrons  when  the  internal  phase  angle  of  time  is 
given  by  0t  =  tt / 6  .  This  implies  that  the  normalized  superconductivity  energy 
gap  for  high-Tc  planar  copper  oxides  should  be  6/tt  times  the  magnitude  of  the 
normalized  energy  gap  of  conventional  BCS  superconductors. 
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Figure  1.  Sketch  of  renormalized  (<) 
and  unrenormallzed  (<*)  average  energy 
par  particle  for  a  neutron  gaa  In 
terms  of  particle  nusfcer  density. 

Figure  2.  Sketch  of  renormalized  (t) 
and  unrenormal Ized  (t*)  time  In  terms 
of  particle  number  density. 

Figure  3.  Sketch  of  renormalized  (R) 
and  unrenormalized  (R*)  reaction  rate 
in  terms  of  particle  number  density. 

Figure  4.  Sketch  of  renormalized  (Dt) 
and  unrenormallzed  (D!)  dimension  of 
time  In  terns  of  particle  number 
density. 

Figure  5.  Sketch  of  renormalized  (D,) 
ana  unrenormallzed  (Di)  dimension  of 
space  in  terms  of  particle  number 
density. 
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THERMAL  RADIATION  OF  HIGH-TC  SUPERCONDUCTORS 
Richard  A.  Weiss 

U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT .  The  theory  of  a  photon  gas  with  broken  internal  symmetry  is 
developed.  The  broken  symmetry  of  time  induces  a  broken  symmetry  in  the  photon 
frequency  so  that  Planck's  heat  radiation  law  must  be  written  in  terms  of  a  com¬ 
plex  number  spectral  energy  density.  Over  long  distances  the  broken  symmetry 
of  time  can  be  produced  either  by  gravity  or  by  some  special  nuclear,  atomic 
or  molecular  structure  that  causes  a  coherent  time  state  to  exist  in  matter. 
For  the  case  where  the  gravity  of  a  star  or  planet  produces  the  broken  symme¬ 
try  of  time  the  internal  phase  angle  of  the  frequency  is  essentially  constant 
and  the  thermal  radiation  of  matter  is  incoherent  and  the  Stefan-Boltzmann 
law  is  valid.  But  for  the  case  of  a  high-Tc  superconductor  with  T  <  Tc  the 
presence  of  an  incoherent  time  state  for  the  crystal  lattice  and  a  coherent 
time  state  for  the  electron  (hole)  pairs  requires  the  emitted  thermal  energy 
to  have  both  an  incoherent  blackbody  radiation  component  and  a  coherent  non- 
blackbody  component  that  is  calculated  by  integrating  over  the  internal  phase 
angles  of  the  frequency.  This  paper  calculates  the  thermal  energy  spectrum 
and  thermal  energy  density  for  high-Tc  superconductors  and  suggests  that  the 
measured  thermal  energy  density  of  high-Tc  superconductors  may  serve  as  a  test 
for  the  theory  that  the  planar  copper  oxides  in  their  superconducting  state 
represent  matter  in  a  coherent  time  state. 

1.  INTRODUCTION.  Broken  symmetries  play  a  major  role  in  the  understand¬ 
ing  of  basic  phenomena  in  both  particle  and  bulk  matter  physics.  The  broken 
symmetry  vacuum  is  similar  to  the  ground  state  of  a  many-body  system.2  The 
physical  vacuum  has  non-vanishing  fields,  and  is  analogous  to  the  ground  state 
of  a  ferromagnet.  In  both  cases  the  Hamiltonians  describing  these  systems 
are  rotationally  invariant.  This  is  the  case  of  spontaneously  broken  symme¬ 
try  where  the  ground  state  does  not  have  the  symmetry  of  the  Hamiltonian.2 
The  vacuum  is  thought  to  have  the  Higgs  field  extending  throughout  all  space 
and  having  a  non-zero  value  for  the  ground  state.  It  is  similar  to  the  broken 
symmetry  associated  with  the  Ginzburg-Landau  order  parameter  that  describes 
the  broken  symmetry  superconducting  state  in  which  the  Cooper  electron  pairs 
break  the  symmetry  of  the  ground  state  and  produce  a  macroscopic  coherent 
state  of  matter.2 

The  broken  symmetry  of  the  vacuum  affects  the  state  equations  of  inter¬ 
acting  bulk  matter  by  requiring  the  state  functions  such  as  pressure,  inter¬ 
nal  energy  and  entropy  to  be  complex  numbers  having  internal  phase  angles.3' 4 
The  effects  of  the  broken  symmetry  vacuum  on  matter  and  energy  are  determined 
by  solving  a  complex  number  relativistic  trace  equation. 14  The  broken  symmetry 
of  the  vacuum  state  requires  that  the  coordinates  of  space  and  time  be  complex 
numbers.4  This  requirement  affects  the  calculations  of  such  diverse  scien¬ 
tific  disciplines  as  mechanics,  electromagnetism  and  thermodynamics.  In  par- 
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ticular,  it  affects  the  calculation  of  the  gravitational  equilibrium  of  stars 
and  planets  through  an  intimate  connection  between  the  internal  phase  angle 
of  the  pressure  and  the  internal  phase  angles  of  the  space  and  time  coordin¬ 
ates.4  The  broken  symmetry  of  the  time  coordinate  requires  that  the  frequency 
of  light  in  matter  also  has  an  internal  phase  angle  so  that  the  elementary  for¬ 
mulas  of  atomic  physics,  such  as  Planck's  radiation  law,  must  have  asymmetiic 
forms  and  exhibit  broken  internal  symmetries.4  The  broken  symmetries  of  the 
space  and  time  coordinates  are  induced  over  macroscopic  distances  in  incoherent 
matter  mainly  by  gravity,  but  atomic  and  molecular  structure  can  also  create 
a  coherent  state  of  space  and  time  coordinates  of  macroscopic  dimensions  as 
perhaps  in  the  case  of  high-Tc  superconductors .  This  paper  describes  the  bro¬ 
ken  symmetry  forms  of  Planck's  law  and  the  Stefan-Boltzmann  law  for  the  thermal 
radiation  of  matter  in  a  gravitational  field  and  for  the  thermal  radiation  from 
the  surface  of  a  high-Tc  superconductor. 

The  effects  of  the  broken  symmetry  of  the  Minkowski  vacuum  on  the  thermo¬ 
dynamic  state  functions  of  matter  are  calculated  from  the  following  complex 
number  relativistic  trace  equation4 

U  +  T(dU/dT)_  -  3Vd/dV(PV)_  =  Ua  +  T(dU3/dT)  .  (1) 

PV  U  p  v 

where  0  and  P  =  complex  number  renormalized  values  of  the  internal  energy  and 
pressure  respectively,  Ua  and  Pa  =  unrenormalized  values  of  the  internal  energy 
and  pressure  respectively,  T  =  absolute  temperature  and  V  =  volume  of  a  fixed 
number  of  particles.  The  renormalized  internal  energy  and  pressure  are  written 
as 


o  =  Ue-50^  P  =  Pej0p  (2) 

A  many-body  theory  is  used  to  determine  Ua(V,T)  and  Pa(V,T)  for  an  interacting 
system,  and  then  equation  (1)  is  used  to  determine  the  renormalized  pressure 
P  ,  0p  and  internal  energy  U  ,  9^  •  For  a  noninteracting  system  PaV  =  c1Ua  where 
cx  =  constant,  and  it  follows  from  equation  (1)  that  U  *  Ua  ,  P  =  Pa  ,  0jj  =  0 
and  9p  =>  0  .  The  complex  number  values  of  the  internal  energy  and  pressure  re¬ 
quire  that  the  coordinates  of  space  and  time  are  also  complex  numbers  which  can 
be  written  as4 

a  =  ae^0a  t  =  te^0t  (3) 

for  a  =  x  ,  y  ,  z  .  The  internal  phase  angles  of  the  space  and  time  coordinates 
0a  and  0t  are  determined  from  the  laws  of  mechanics  such  as  Euler’s  equation  of 
fluid  motion.4  From  the  broken  symmetry  representation  of  time  in  equation  (3) 
it  follows  that  the  period  and  frequency  of  vibrations  can  be  written  as 

C  =  Je^0^  v  =  ve^0v  (A) 

where  because  v  *  1/C  it  follows  that 

v  =  L/C  0  =  -  9_,  =  -  0  (5) 

v  I  t 
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For  thermal  radiation  in  matter  with  broken  internal  symmetry,  due  either 
to  gravity  or  to  nuclear,  atomic  or  molecular  structure,  the  procedure  for  cal¬ 
culating  the  renormalized  radiation  density  is  simple  in  principle  but  dif ficult 
in  practice.  The  first  step  is  the  calculation  of  the  unrenormalized  energy 
of  the  photon-matter  system,  in  which  the  radiation  internal  energy  is  written 
as  Ua  =  VEa  =  VEa(n,T)  and  the  matter  internal  energy  as  Ua(n,T)  where  n  = 
particle  number  density.  The  energy  density  of  radiation  is  now  a  function  of 
n  and  T  .  The  trace  equation  (1)  is  then  solved  with  U^n.T)  +  U^(n,T)  used 
as  the  source  term,  with  the  result  that  the  renormalized  internal  energies 
U(n,T)  ,  0y  ,  Ur(n,T)  ,  0yr  are  obtained  along  with  the  renormalized  values  of 
the  Grtlneisen  parameters  y(n,T)  ,  Qy  ,  yr  and  0yr  for  matter  and  radiation  re¬ 
spectively.  4  The  calculation  of  the  unrenormalized  radiation  internal  energy 
Ua(n,T)  is  very  complicated  if  gravitational  effects  in  matter  are  included. 
Even  the  calculation  of  the  unrenormalized  internal  energy  U^(T)  for  thermal 
radiation  in  empty  space  with  gravitation  is  very  complicated. 
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An  approximate  solution  to  the  problem  of  calculating  the  energy  of  thermal 
radiation  in  matter  with  broken  internal  symmetries  has  been  suggested  that 
utilizes  the  implication  of  the  trace  equation  (1)  that  the  renormalized  radi¬ 
ation  density  must  be  a  complex  number."*  The  broken  symmetry  nature  of  time 
and  frequency  as  described  by  equations  (3)  and  (4)  respectively  suggest  the 
following  complex  number  generalization  of  Planck’s  radiation  law** 

=  A/[ehv,/(kT)  -  1]  A  =  8-rrhv3/ c3  (II) 

where  Ev  =  complex  number  value  of  the  renormalized  spectral  energy  density. 
Placing  equation  (4)  into  (11)  gives4 


E  . 

V 

-  E  ej0Ev  =■ 

V 

A(B 

+  jC)/D 

(12) 

E 

V 

=  A/D  <B2  + 

c2) 

1/2 

(13) 

tan 

eEu  -  C/B 

(14) 

:  A 

is  given  by 

equation  (6) 

and  where4 

D  = 

2x  ^ 

e  -  2  cos  y 

x  i  , 
e  +  1 

(15) 

B  = 

cos  (30^) 

(cos 

ye  -  I) 

+  sin 

(30y)  sin  y  eX 

(16) 

C  = 

sin  ( 30 ^ ) 

(cos 

ye  -  1) 

-  cos 

x 

(30^)  sin  y  e 

(17) 

x  = 

hv/ (kT)  cos  0^ 

(18) 

y  = 

hv/(kT)  sin  9 

V 

(19) 

When  9V  =  0  equation  (13)  reduces  to  the  standard  Planck  spectral  energy  den¬ 
sity  . 

The  complex  number  integrated  energy  density  is  given  by4 

E  ■  E  e^Er  =  [E  d v  =  [E  sec  8  eJ (®Ev+9v+8vv)  (20) 

r  r  1  v  '  v  vv 

o 

where 

tan  8  =  vd9  /dv  (21) 

vv  v 

dv  “  e^v(dv  +  ivd9  )  *  sec  8  dv  e3  (®v+®vv)  (22) 

J  v  vv 

The  complex  number  thermal  radiation  energy  density  given  by  equation  (20)  has 
the  following  real  and  imaginary  parts 
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E  *  E  cos  0r  =  /E  sec  0  cos  4>  dv 
rR  r  Er  J  v  vv  v 


E  -  -  E  sin  0C  =  /E  sec  8  sin  4>  dv 
rl  r  Er  J  v  vv  v 


where 


*  =  6C  +0  +6 
v  Ev  v  vv 


(23) 

(24) 


(25) 


and  where  Ev  and  0£v  are  given  by  equations  (13)  and  (14)  respectively.  The 
magnitude  and  internal  phase  angle  of  the  integrated  radiation  density  are  given 
by 


E  *  (E2+  E2  )l/2 
r  rR  rl 

(26) 

tan  0r  =  E  /E 

Er  rl  rR 

(27) 

The  measured  thermal  radiation  density 
number  integrated  radiation  density  so 

is  given  by  the  real  part  of  the  complex 
that 

^rm  ^rR 

(28) 

and  therefore  the  integral  in  equation 

(23)  must  be  evaluated. 

Further  insight  into  the  meaning  of  the  complex  number  integral  in  equa¬ 
tion  (20)  can  be  obtained  by  combining  equations  (20)  and  (22)  to  get 

l  =  Er  +  Er  . 

r  inc  coh 

(29) 

where 

E5  =  JE  e^v  dv 

inc  v 

o 

(30) 

Er  =  jfE  v  de 

coh  J  v  v 

(31) 

where  E^nc  =  incoherent  radiation  energy  density  and  E£0h  =  coherent  radiation 
energy  density.  The  complex  number  radiation  density  given  in  equation  (20) 
has  not  been  evaluated  for  arbitrary  values  of  the  frequency  internal  phase 
angle  0v(v)  due  to  the  difficulty  of  evaluating  the  integral.  The  same  is 
true  for  the  measured  energy  density  given  in  equation  (23) .  Also  there  is 
the  question  of  what  the  function  0v(v)  should  be  for  radiation  in  matter  with 
broken  internal  symmetries  due  to  gravity  or  structure.  In  this  paper  two 
special  cases  of  equation  (20)  are  considered.  The  first  case  is  that  of  in¬ 
coherent  radiation  with  0V  “  ev  *  constant  and  the  frequency  v  being  an  inte¬ 
gration  variable,  the  second  case  is  that  of  coherent  radiation  with  v  »  vc  - 
constant  and  the  internal  phase  angle  0V  is  the  variable  of  integration.  For 
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these  two  cases  equation  (20)  or  equivalently  equations  (30)  through  (31)  can 
be  written  as 


C  00  c 

E r  m  e3^v  f  E  dv  0  =  0C  =  const.  dv  =  e^v  dv  (32) 

inc  J  v  v  v 

o 

Er  ,  =  jv  f  E  e9^v  d0  v  =  v  =  const.  dv  =  jvd0  (33) 

coh  c  ‘  v  v  c  J  v 

This  paper  assumes  that  the  incoherent  radiation  from  ordinary  incoherent  mat¬ 
ter  in  a  gravitational  field  can  be  described  by  equation  (32),  and  that  co¬ 
herent  and  incoherent  time  states  exist  in  high-Tc  superconductors  and  the  cal¬ 
culation  of  the  radiation  energy  density  for  these  materials  with  T  <  Tc  in¬ 
cludes  contributions  from  both  equations  (32)  and  (33) .  Equation  (32)  gives 
the  radiation  contribution  from  the  time  incoherent  lattice  of  the  high-Tc  su¬ 
perconductors,  and  equation  (33)  gives  the  radiation  contribution  from  the  co¬ 
herent  time  states  of  the  electron  (hole)  pair  condensate.  Other  forms  of  mat¬ 
ter  may  require  0v(v)  to  have  a  more  general  behaviour  thus  making  equations 
(20)  and  (23)  difficult  to  evaluate. 

The  general  form  of  the  integral  in  equation  (20)  can  be  rewritten  by  in¬ 
troducing  the  following  change  of  variables 

l  =  teiQZ  =  hv/(kT)  (34) 

£  *  hv/(kT)  9?  “  9V  “  ~  9t  (35) 

The  equation  (20)  can  be  written  as 

Er  =  (8irk4T4)/(c3h3)  l  (36) 

where 

I  *  Jl3(e^  -  l)"1  d l  (37) 


The  standard  technique  for  evaluating  the  integral  in  equation  (37)  begins  by 

c  _  a 

using  the  following  expansion 


+  e 


■2  e 


(38) 


Then 


I 


n=L 


where  n  -  integer  and 


-n 

I 


(40) 
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(41) 


Then  the  thermal  radiation  energy  density  is 

E  =  (87rkV)/(c3h3)  l  Zn 

n=*l 


Thus  far  no  assumption  of  incoherent  or  coherent  radiation  has  been  made. 

The  thermal  radiation  energy  density  for  matter  with  broken  internal  sym¬ 
metries  is  calculated  both  for  the  incoherent  and  coherent  cases.  Section  2 
determines  the  incoherent  thermal  radiation  energy  density  for  the  case  of  con¬ 
stant  internal  phase  angle  of  time  as  is  found  for  example  in  a  uniform  gravi¬ 
tational  field,  and  for  the  case  of  coherent  thermal  radiation  which  is  associ¬ 
ated  with  the  coherent  time  states  of  the  electron  (hole)  pair  fluid  in  high-Tc 
superconductors.  For  coherent  time  superconductors  with  T  <  Tc  the  thermal 
radiation  density  is  a  weighted  average  of  a  blackbody  Stefan-Boltzmann  compo¬ 
nent  associated  with  the  atomic  lattice,  and  a  coherent  non-blackbody  component 
which  is  associated  with  the  electron  (hole)  Cooper  pairs. 

2.  ASYMMETRIC  THERMAL  RADIATION.  This  section  calculates  the  integrated 
thermal  radiation  density  for:  a)  the  case  of  incoherent  radiation  with  constant 
internal  phase  angle  of  time  which  perhaps  describes  radiation  in  the  presence 
of  a  gravity  field,  and  b)  the  case  where  electrons  and  radiation  are  in  a  co¬ 
herent  time  state  in  which  transitions  occur  so  fast  that  a  change  in  time  occurs 
as  a  rotation  in  time,  as  perhaps  in  high-Tc  superconductors,  rather  than  as  a 
linear  time  variation  as  in  ordinary  matter  and  conventional  superconductors. 

A.  Incoherent  Thermal  Radiation. 


The  integral  in  equation  (20)  can  be  easily  evaluated  for  the  case  when 

®v  *  “  9t  “  constant.  This  can  be  done  by  evaluating  the  integral  in  equation 

c 

(37)  for  the  special  case  0^  31  0^  =  9^  =  -  9^  =  constant,  and  d£  =  d£  eJ®£  . 

In  this  case  the  integral  in  equation  (40)  can  be  written  as 


where  n  =  neJ 


and  I  is  a  real  number  and' 


(42) 


_n 

I 


In  =  J53  e"nC 
o 


(43) 


Then  equation  (39)  gives  the  following  standard  result5-9 

1*1=  £  6/n^  =  *^/15 

n=l 


(44) 


Therefore  for  9^  -  -  9^  *  constant  the  integrals  in  equations  (36)  and  (37)  are 
real  numbers  and  the  expression  for  the  incoherent  thermal  energy  density  for 
broken  symmetry  radiation  is 
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0  =  (8TT5k4)/(15c3h3) 


(45) 


which  is  the  standard  Stefan-Boltzmann  law.  For  this  case  the  integral  in  equa¬ 
tion  (32)  is  a  real  number  giving  the  result  in  equation  (45). 

B.  Thermal  Radiation  Spectrum  of  Coherent  Time  States. 


The  accompanying  papers  on  electromagnetism  and  the  gauge  theory  of  time 
suggests  that  high-Tc  superconductors  are  coherent  time  states  of  matter.  In  a 
manner  analogous  to  the  Planck  concept  of  radiation  being  composed  of  harmonic 
oscillators,  the  radiation  in  a  high-Tc  superconductor  is  envisioned  as  being 
composed  of  complex  number  frequency  harmonic  oscillators  each  having  internal 
degrees  of  freedom.  For  radiation  in  high-Tc  superconductors,  each  oscillator 
of  a  fixed  frequency  undergoes  internal  vibrations.  The  calculation  of  the 
thermal  radiation  energy  density  for  high-Tc  superconductors  with  T  <  Tc  there¬ 
fore  involves  an  integration  over  the  internal  degrees  of  freedom  (the  internal 
phase  angles  of  the  frequency).  For  these  internal  vibrations v  * vc *  constant 
and  from  equations  (34)  and  (35)  £  *  Sc  exp(j0jr)  with  £c  -  constant  and 


hvc/(kT) 


dl  =*  jCd0£ 


dQ,.  *  d8 
5  v 


The  integral  in  equation  (40)  for  these  conditions  can  then  be  written  as 

•  /  a 


-n 

I  *  K 


4  f  ej49S  e'nC 

O  * 


'T  .-"«c  9C  (F9  +  jF^> 


-  <«?  +  n"2) 


where 


-rr/6  . 

J  e~n^c  cos  9C 


„n  -n£c  cos  9; 


and  where 


-  sin(49 


n£  sin  9 _ )  =  sin(40  -  n£  sin  0  ) 
c  t,  t  c  t 


cos(40  )sin(n£  sin  0  )  -  sin(40  )cos(n^  sin  0.) 

S  0  ^  s  t  s 
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(51) 


F_  -  cos (49  -  n£  sin  9  )  =  cos  (49  -  n£  sin  9  ) 

^  s  c  ^  c  c  t 


=  cos(49  )cos(n£  'sin  9.)  +  sin(40  )sin(n£  sin  9  ) 

s  C  s  s  C  ^ 

The  limits  of  integration  in  equation  (47)  will  be  explained  subsequently. 
The  integrals  in  equations  (48)  and  (49)  can  be  rewritten  as 

Tn  „  rn  -  rn 
*1  Gi  G2 

X2  ’  G3  +  G4 


where 


-tt/6 


G^(5c)  =  J  e  n^c  cos  cos(49  )sin(n£c  sin  9  ) 


d9 , 


—it  /  6 


0^(5^  =  /  e  n^c  C0S  sin(49r)cos(n£„  sin  9r)  d9£ 


o 

-tr/6 


G"<5=)  -  J 


-n£r  cos  9 


cos(49  )cos(n£  sin  9  )  d9 


—tt/6 


?  (£  )  =  J  e  nf’c  C0S  sin(49  )sin(n£  sin  0  )  d9 


(52) 

(53) 

(54) 

(55) 

(56) 

(57) 


Using  the  phase  angle  relationship  given  in  equation  (35)  allows  equations  (54) 
through  (57)  to  be  written  as 


tt/6 


G*1  (£  )  =  j  e  n^c  COS  cos(40  )sin(n£  sin  9  )  d0 
1  c  o  t  c  t  t 

tt/6  _ 

G™(£  )  =  /  e  n  c  C°S  c  sin(49  )cos(n£  sin  9  )  d0 

l  c  '  t  c  t  t 


ti/6 


G-(£  )  =  -  J  e  c  c  cos(49  )cos(n£  sxn  9  )  d9 

J  c  '  t  c  t  t 

o 

tt/6 

g"(Cc)  -  -  /  9 


c  sin(40  )sin(n£  sin  9  )  d9„ 
t  c  t  t 


(58) 


(59) 


(60) 


(61) 


where  £c  is  given  by  equation  (46) . 

The  limits  of  integration  in  equations  (54)  through  (57)  need  some  expla¬ 
nation.  In  the  accompanying  paper  on  electromagnetism  and  gravity  it  is  shown 
that  the  broken  symmetry  form  of  Ohm's  law  gives  the  measured  resistance  of  a 
body  as 
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{l  -  tan  0^  tan[2(0a  -  0t)])cos^ 


where  a  =  x  ,  y  and  z  specifies  the  orientation  of  the  conductor,  and  where 
Ram  *  measured  resistance  in  the  a  direction,  and  =  Wquj,/!^  =  convention¬ 
ally  calculated  resistance  in  the  a  direction,  where  Warn  and  lam  =  measured 
voltage  and  measured  current  respectively  in  the  ct  direction.  The  high-Tc  su¬ 
perconducting  state  is  described  by  =  0  due  to  the  broken  symmetry  portion 
of  equation  (62)  which  occurs  when  0t  +  2(9a  -  0t)  =  tt/2  .  This  condition  com¬ 
bined  with  the  free  s  -  pair  condition  6a  =  20t  gives  the  result  0t  *  ir/6  and 
0a  =“  ir/3  for  the  coherent  time  high-Tc  superconducting  state  which  exists  when 
T  <  Tc  .  In  this  scheme  conventional  superconductivity  occurs  when  R^  ■  0  . 
Conventional  and  high-Tc  superconductors  are  distinct,  for  instance  experiments 
show  that  the  normalized  superconductivity  energy  gap  for  high-Tc  superconduc¬ 
tors  is  about  twice  the  normalized  energy  gap  for  conventional  superconductors.1 


From  equations  (41),  (46)  and  (47)  it  follows  that  the  coherent  thermal 
radiation  energy  density  for  high-Tc  superconductors  with  T  <  Tc  is  given  by 


Ecoh(VT) 


8iThv^/c^  B  ,  (5  ) 
c  coh  c 


where 


5coh«c>  ■ 

n=l 


-  I  tGi  -  G2  +  j(G“  +  g") ] 

n=l 

The  real  and  imaginary  parts  and  magnitude  of  Bcoh  are  given  by 


BcohR  '  VG?-°2> 

n=l 

B  .  (B2  +  B2  ll/2 

coh  cohR  cohl 


BcohI  -  VG"+V 

n=l 


The  choice  of  the  constant  frequency  vc  is  related  to  the  superconducting  tran¬ 
sition  temperature  by 

hv  =  kT  (67) 

c  c 

so  that  equation  (35)  and  (67)  give 

£  =  T  /T  (68) 

c  c 


and  equation  (63)  can  be  rewritten  as 

Ecoh<VT>  '  1W*4  Tc5coh<VT> 
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which  gives  the  coherent-time  portion  of  the  thermal  radiation  energy  density 
that  is  associated  with  the  Cooper  pairs  of  electrons  and  holes. 


The  total  thermal  radiation  energy  density  emitted  by  a  high-Tc  supercon¬ 
ductor  for  T  <  Tc  is  the  weighted  average  of  the  incoherent-time  radiation 
energy  density  from  the  solid  lattice  of  the  copper  oxide  superconductor  and 
the  coherent-time  radiation  energy  density  from  the  condensate  of  Cooper  elec¬ 
tron  (hole)  pairs  and  therefore 

E  =  a.  Er  +  a  ,  Er  ,  (70) 

r  xnc  inc  coh  coh 

=  oT4[a.  +  a  .  I5/ir4(T  /T)4B  ,  (T  /T) ] 

me  coh  c  coh  c 

where  ainc  ■  particle  number  density  fraction  of  lattice  atoms  (ions)  and  where 
acoh  =  particle  number  density  fraction  of  electron  (hole)  pairs,  which  satisfy 
the  following  relations 


a  ,  /a.  =  n  =  n  /2 

coh  inc  p  e 


a,  +  a  =1 
inc  coh 


a.  =  (1  +  n  )  1  =  2(2  +  n  )  1  (72) 

me  p  e 

a  =n(I+n)i=n(2+n)i  (73) 

coh  p  p  e  e 

where  np  =  average  number  of  electron  (hole)  pairs  per  lattice  site,  and  ne  = 
average  number  of  electrons  (holes)  per  lattice  site.  Therefore  the  radiant 
energy  density  for  a  high-Tc  superconductor  with  T  <  Tc  is  the  weighted  average 
of  a  blackbody  term  associated  with  the  lattice  of  atoms  and  ions,  and  a  non- 
blackbody  term  associated  with  the  coherent  time  state  of  the  electron  (hole) 
pair  fluid.  From  equation  (70)  it  follows  that 

E  _  =  a.  aT4  +  a  .  15o/tt4  T4B  _(T  /T)  (74) 

rR  me  coh  c  cohR  c 

£rl  '  “coh153/’4  TcBcohI(Tc«>  <75) 

E  =  (E2  +  E2  )1/2  (76) 

r  rR  rl 


tan  Qp  -E/E 

tv  rl  rR 


The  measured  value  is  the  real  part  of  the  thermal  radiation  density  given  in 
equation  (74). 

Equation  (70)  is  valid  for  a  blackbody  high-Tc  superconductor.  For  real 
copper  oxide  materials  a  surface  emissivity  factor  e  needs  to  be  introduced 
into  equations  (70)  through  (77).  Thus  equation  (74)  becomes  for  T  <  Tc 


(78) 


E  =  eaT^(a  +  a  ,15/ttA(T  /T)AB  ,_(T  /T)  ] 
rR  inc  coh  c  cohR  c  J 


£eff°T 


where  the  emissivity  e  may  be  temperature  dependent,  and  where  the  effective 
emissivity  for  high-Tc  superconductors  with  T  <  Tc  is  given  by 

Utt  *  £[“inc  +  “coh15/'4(VT>\ohl><VT>!  <79> 

For  matter  in  an  incoherent  time  state  the  thermal  radiation  energy  density  is 
given  by 


which  is  also  the  case  for  high-Tc  copper  oxides  with  T  >  Tc  .  Therefore  from 
equations  (78)  and  (80)  it  follows  that  at  the  transition  temperature  T  =  Tc 
there  is  a  sudden  drop  in  the  thermal  radiation  energy  density  as  the  tempera¬ 
ture  is  lowered  across  the  transition  temperature  given  by 


A Er  =  [E  -  Er  1 

rR  inc  T„ 


-  eaT^l  -  a.  -a  ,15/ir^  B  ,D(l)] 
cl  me  coh  cohR 


(81) 


where  from  equations  (58),  (59)  and  (65)  BcohR(l)  <  0  .  Equivalently,  at  the 
transition  temperature  there  is  an  abrupt  drop  in  the  value  of  the  emissivity 
given  by 


Ae  =  [£eff  "  elT„ 


-  e  [  1  -  a 


inc 


a  15/n 
coh 


BcohR^> 


(82) 


For  T  <  Tc  equations  (78)  and  (79)  show  that  the  thermal  radiation  energy  den¬ 
sity  and  emissivity  are  given  approximately  by 


'rR 


'v  a .  eoT 
inc 


(83) 


e 


eff 


'v  a .  e 
inc 


4 

because  the  exponential  functions  in  BcohR(£c)  attenuate  the  factor  £cBcohR(£c) 
rapidly  for  T  <  Tc  .  The  value  of  the  emissivity  approaches  the  value  ctjnce(0) 
for  T  ->  0  . 


3.  CONCLUSION.  It  has  been  suggested  that  a  high-Tc  superconductor  with 
T  <  Tc  is  composed  of  two  states  of  matter:  a)  a  time-incoherent  lattice  of 
atoms  and  ions,  and  b)  a  fluid  of  Cooper  pairs  of  electrons  (or  holes)  which 
in  this  paper  is  assumed  to  be  in  a  coherent-time  state.  The  thermal  radiation 
from  such  a  two-state  system  is  itself  separable  into  a  universal  incoherent 
blackbody  component  arising  from  the  atomic  and  ionic  lattice,  and  a  coherent 
non-blackbody  radiation  component  which  arises  from  the  time-coherent  electron 
or  hole  pairs.  The  coherent  non-blackbody  radiation  component  is  not  universal 
because  it  depends  on  the  particular  transition  temperatures  of  the  planar 


copper  oxide  compounds.  This  coherent  thermal  energy  density  is  determined  by 
integrating  the  broken  symmetry  Planck  spectrum  over  the  internal  phase  angles 
of  time.  Thus  gauge  rotated  or  circular  time  describes  the  internal  phase 
motion  of  oscillators  in  a  coherent  time  state.  A  measurement  of  the  non- 
blackbody  thermal  radiation  component  for  T  <  Tc  in  the  planar  copper  oxides 
would  represent  proof  that  high-Tc  superconductivity  is  described  by  a  coher¬ 
ent  time  state  of  matter  and  is  completely  different  from  conventional  BCS 
superconductivity . 
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ABSTRACT. 

The  aim  of  this  note  is  to  describe  briefly  some  recent  developments  in  the 
numerical  analysis  of  the  finite  element  method  applied  to  partial  integro— differential 
equations. 


1.  INTRODUCTION. 

Consider  the  following  linear  partial  integro— differential  problem  of  "parabolic" 
type  for  u  =  u  (t,x), 


(U) 


ut  -  Au  ■  /*  B(t^)u(s)ds  +  f(t,x),t>  0,  x  c  n, 

u(t)  =  o  on  &), 
u(o)  *  v  given. 


Here  0  is  a  bounded  domain  in  with  sufficiently  smooth  boundary,  A  is  the  Laplace 
operator  (for  simplicity)  and  B(t,s)  is  a  second  order  (at  most)  partial  differential  operator 
in  the  spatial  variables,  with  smooth  coefficients  for  now.  As  a  general  reference  to  this 
and  similar  problems  we  give  RENARDY,  HRUSA  and  NOHEL  (8). 
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Let  Sh  ,  0  <  h  <  l,bea  family  of  finite  element  spaces  on  fl  of  the  "usual"  kind  in 
Using  the  weak  formulation  of  (1.1)  one  seeks  the  semidiscrete  (time— continuous) 
approximation  u^(t)  in  by  the  formula 

>h,t’*)  +  D(uh’^ 

(1-2)  =  /*  B(  t  ,s;  uh(s),  x)ds +(f,y),  for  x  e  Sh, 

uh(°)  =  vh  <  SH- 


Here  D(v,w)  is  the  Dirichlet  form  (Vv,  Vw)  with  (v,w)  =  Jqv w  dx  the  L^— inner  product. 
B(t,s;v,w)  denotes  the  bilinear  form  on  obtained  in  the  natural  way  from  the  partial 
differential  operator  B(t,s)  via  integration  by  parts,  if  necessary. 

Numerical  solution  of  the  problem  (1.1)  via  finite  differences  goes  back  to 
DOUGLAS  and  JONES  [3].  In  the  present  finite  element  context  a  seminal  paper  is  that 
of  YANIK  and  FAIRWEATHER  [12],  cf.  GREENWELL  [5].  The  techniques  used  there 
treated  the  case  when  the  partial  differential  operator  is  of  at  most  first  order. 

THOMEE  and  ZHANG  [10]  adapted  the  techniques  of  WHEELER  [11]  from  the 
parabolic  case,  B  =  O.  With  R^u  the  (fictitious)  Ritz  projection  into  S^  defined  by 
(U3)  D  (Rhu  -  u,  x)  =  0,  for  x  f  Sh, 

one  proceeds  to  write  an  equation  for  0  =  u^  -  R^u.  (One  knows  a  lot  about  the  error  in 
R^u  -  u.)  The  equatior.  for  0  turns  out  to  be  fairly  complicated  and  far  from  easy  to 

I 

handle,  but  Thomee  and  Zhang  managed.  To  describe  a  typical  result,  assume  that  the 

finite  element  spaces  S^  have  optimal  approximation  order  r  ,  r  >  2  integer,  i.e., 

(>•4)  min  Hv  “X  llL  m\  <  C  hr  ||v{|w  r 

X  r  Sh  L2lH>  W2  (fl) 

r  1 

for  vr  W2  ft  Hy  .  Here  r— 1  can  be  thought  of  as  the  basic  piecewise  polynomial  degree  of 
the  finite  element  shape  functions  and  h  as  the  diameter  of  a  typical  element  .  Then, 


assuming  that  the  solution  u  of  (1.1)  is  smooth  enough,  it  is  shown  in  [10]  that 
(1-5)  ll(uh-u)  (t)||L  (ft)  <  C  (T)  hr,  for  0  <  t<  T. 

1. e.,  we  have  an  error  estimate  in  of  optimal  order. 

2.  RITZ-VOLTERRA  PROJECTIONS. 

CANNON  and  LIN  [1]  proposed  an  alternative  mode  of  analysis  to  that  in  [10].  An 
operator  V^(t)  into  is  now  introduced  via  the  prescription 

(2.1)  D  (Vh(t)u  -  u(t),  \)  =  B(t,s;  Vh(s)u  -  u(s),  \  )ds, 

for  x  e  Sh, 

It  turns  out  that  the  equation  for  u^  —  V^u  is  more  manageable  than  that  for  u^  —  R^u. 

Of  course,  some  more  work  now  has  to  be  done  in  analyzing  the  error  in  Vh  u  -u,  and 
typically  this  is  split  as  u  —  u  =  (V^u  —  R^u)  +  (R^u  —  u)  and  an  equation  for  the  first 
part  on  the  right  is  derived  and  analyzed.  It  is  fair  to  say  that  the  introduction  of 
splits  the  analysis  into  more  "balanced"  and  manageable  parts  than  that  of  merely 
considering  R^. 

This  technique  just  outlined  is  applied  e.g.  to  deriving  optimal  order  pointwise  error 

I 

estimates  in  LIN,  THOMEE  and  WAHLBIN  [7].  (This  paper  also  introduced  the  name 
Ritz-Volterra  projection  for  V^;  the  original  paper  [1]  called  it  a  "nonclassical  H1" 
projection.) 

There  are  few  changes  in  applying  the  technique  to  a  "hyperbolic" 
integro— differential  equation. 

utt  —  Au  =  B(  t  ,s  )u(s)ds  +  f. 

(2.2)  u(t)  =  o  on  (Kl . 

u(o)  =  v,  ufc(  o  )  =  w. 
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Again  the  introduction  of  the  Ritz— Volterra  projection  cuts  the  analysis  into  balanced 
parts. 

The  Ritz— Volterra  projection  can  in  some  cases  in  itself  be  viewed  as  the  finite 
element  solution  of  the  Sobolev  (aka  pseudoparabolic)  equation.  For,  assume  that 
B(t,s;  • ,  • )  in  (2.1)  actually  does  not  depend  on  t.  Differentiating  (2.1)  with  respect  to  t, 

(2.3)  D((Vhu)t-ut  X)  =  B(4;  vhu  -  X),  for  *eSh 

In  this  way  we  see  the  connection  with  finite  element  approximation  of  the  typical  Sobolev 
equation 

Au^  =  B(t)u,  t>o,  x&i 

(2.4)  u  ( t)  =  o  on  39. , 
u(o)  =  v. 

For  an  early  analysis  of  finite  element  methods  in  such  problems,  see  FORD  [6]. 

In  CHEN,  THOMEE  and  WAHLBIN  [2]  the  Ritz— Volterra  technique  is  applied  to 
a  problem  with  a  singular  kernel, 

(2.5)  u,  -  Au  =  K(t— s)  Bu(s)  ds  +  f 

l  U 

where  |k(t)|  ~  t~a.  o<«<1.  Results  matching  provable  regularity  for  the  solution  u  are 
derived. 

3.  DISCRETIZATION  IN  TIME. 

So  far  we  have  been  considering  the  semidiscrete  (time-continuous)  approximation 
given  by  the  system  of  Volterra  ordinary  integro— differential  equations  (1.2).  This  is 
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clearly  an  intermediate  step  in  the  analysis  since  that  system  needs  to  be  further 
discretized  in  time  to  arrive  at  a  practical  method. 

To  outline  some  issues,  let  us  assume  that  we  have  in  mind  as  a  "basic" 
time-stepping  method  the  backward  Euler  method  (for  simplicity  in  writing).  Then  write, 
as  a  preliminary  step,  with  Un  2  u^  (nk),  k  the  (uniform)  timestep, 

(  Un  -  Un_1  X)  +D(Un,  X) 

- K - 

=  /*  B(  t,s;uh(s) ,  x)  ds  4-  (f(nk),  *),  for  exSh. 

The  integral  in  time  on  the  right  needs  to  be  further  discretized.  It  seems  natural  to  let 
that  discretization  involve  only  time  levels  tj  =  jk.  However,  note  that  in  general  a]i  time 
levels  used  in  the  quadrature  of  the  integral  need  to  be  stored.  With  each  time  level 
involving  perhaps  Hr  -  10°  degrees  of  freedom  (in  many  space  dimenisions)  it  may  be 
prudent  not  to  involve  all  previous  levels.  Rather,  use  a  quadrature  rule  which  is  of  higher 
order  than  the  basic  backward  Euler  rule.  This  being  first  order,  O(k),  use,  as  an  example, 
Simpson's  rule  for  the  quadrature  of  the  integral.  This  in  turn  being  second  order,  0(k“)  in 
the  step  kj  used,  it  is  natural  to  try  to  match  k^~  k^^.  Of  course,  if  nk  is  not  an  integral 
multiple  of  kp  something  special  has  to  be  done  for  a  short  (<  k^)  last  segment  of  the 
integral. 

It  is  easy  to  see  that  the  storage  requirements  for  such  a  combination  is  0(k  ^"“). 
as  opposed  to  0  (k—1)  if  all  previous  time-levels  were  used  in  the  quadrature. 

The  hard  part  is  now  to  show  that  the  resulting  method  is  of  0(hr  +  k)  accuracy,  of 
SLOAN  andTHOMEE  [9]. 

Various  other  combinations  of  basic  time-stepping  methods  and  "thinned" 
quadrature  formulae  are  given  in  ZHANG  [13]. 

In  [2]  the  quadrature  of  the  singular  kernel  in  (2.5)  is  accomplished  by  a  product 
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integration  rule. 

In  the  special  case  of  B(t,s)  =  A  in  (1.1),  differentiation  with  respect  to  time  leads 
to  the  strongly  damped  wave  equation, 

(3.2)  utt  —  Aut  =  Au  +  f. 

1 

A  study  of  finite  element  approximations  for  this  can  be  found  in  LARSSON,THOMEE 
and  WAHLBIN  [6].  An  interesting  point  is  that  while  the  corresponding  semigroup  is 
analytic  in  time,  it  is  only  mildly  smoothing  in  space  (and  then  only  with  respect  to 
compatibility  conditions,  not  with  respect  to  regularity).  The  consequences  of  this  for 
space  and  time  discretizations  are  elucidated. 

4.  CONCLUSION. 

The  partial  integro-differential  equation  (1.1),  although  called  of  "parabolic"  type 
by  "analogy",  has  several  differences  with  the  heat  equation,  B  =  O.  A  satisfactory 
mathematical  theory  for  its  numerical  approximation  has  to  proceed  along  lines  specific  to 
such  equations.  The  introduction  of  Ritz— Volterra  projections  as  an  intermediate  step  in 
the  analysis  appears  promising. 

In  practice,  questions  of  storage  limitations  will  have  to  be  addressed  by  the  theory 
for  numerical  solution  of  partial  integro-differential  equations. 
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On  Solving  Cauchy  Singular  Integral  Equations 
by  Using  General  Quadrature-Collocation  Nodes 

R.  P.  Srivastav  and  Fenggang  Zhang 

Department  of  Applied  Mathematics  and  Statistics, 

SUNY  at  Stony  Brook.  Stony  Brook,  New  York  11794 

Abstract.  We  show  that  the  specific  nodes  are  not  necessary  for  solving  Cauchy  singular  integral  equations 
by  using  quadrature-collocation  methods.  The  solvability  of  the  discrete  system  is  proved  for  arbitrary 
selection  of  quadrature  and  collocation  nodes.  We  also  propose  several  special  choices  of  these  nodes. 
Especially,  a  weighted  minimum  norm-least  square  method  is  discussed. 


1.  Introduction 

The  classical  theory  for  solving  Cauchy  Singular  Integral  Equations  (CSIE)  is  based  on  the  prop¬ 
erties  of  sectionally  holomorphic  functions,  which  enable  us  to  reduce  the  singular  equation  to  a 
Fredholm  equation  of  the  second  kind.  (See  N.  I.  Muskhelishvili  [t]  and  F.  D.  Gakhov  [2].)  In 
many  physical  problems,  when  numerical  solution  is  necessary,  direct  methods  tire  often  preferable. 
A  method  is  called  direct  if  the  singular  integral  is  replaced  by  a  numerical  approximation  without 
resorting  to  regularization.  Such  methods  initiated  by  F.  Erdogan  [3]  and  F.  Erdogan  and  G.  D. 
Gupta  [4]  have  been  developed  subsequently  by  F.  Erdogan,  G.  D.  Gupta  and  T.  S.  Cook  [5],  P.  S. 
Theocaris  and  N.  I.  Loakimidis  [8], 

Suppose  the  CSIE  has  the  form 

(1.1)  a(x)g(x)  +  ^  f  9ty-dt  +  f  k(t,  x)g(t)dt  =  /(*),  -1  <  *  <  1. 

*  J-i  1  ~  x  J-i 

All  these  numerical  methods  use  Gauss-type  formulae  after  expressing  the  unknown  singular  function 
as  the  product  of  a  weight  function  (1  -  x)a-l(l  +  x)°~l  and  a  smooth  function  to  be  computed,  a 
and  0  are  determined  by  Noether’s  index  theorems. 

Let  Pnf®^)(*)  be  the  Jacobi  polynomial  of  degree  n  orthogonal  with  respect  to  the  weight  function 
(1  —  x)a~l(l  +  x)fi~x.  The  zeros  of  Pna'P\x)  are  used  as  quadrature  nodes  in  the  Gauss-Jacobi 
integral  formula  and  the  zeros  of  another  related  Jacobi  polynomial  are  used  as  the  corresponding 
collocation  points.  For  example,  when  a  =  0  —  the  Jacobi  polynomial  is  Tn(x),  the 

Chebyshev  polynomial  of  the  first  kind,  and  the  related  polynomial  is  Un-i(*)>  the  Chebyshev 
polynomial  of  the  second  kind. 

There  are  three  factors,  which  influence  the  size  of  the  error  in  the  computed  solution:  (a)  ac¬ 
curacy  of  the  quadrature  formula,  (b)  choice  of  the  collocation  nodes,  (c)  "condition”  of  the  linear 
algebraic  system.  The  rate  of  convergence  depends  on  the  "Lebesgue  constants”  of  the  collocation 
and  quadrature  nodes,  and  the  smoothness  of  the  functions.  Certain  sets  of  quadrature-collocation 
nodes  are  inadequate  to  represent  the  intrinsic  features  of  the  problems,  especially  of  the  problems 
arising  from  the  oscillatory  behavior  of  /(*)  or  the  kernel  k(t ,  x )),  or  th«.  problems  due  to  large 
derivatives  of  these  functions.  For  example,  for  the  function 


/(*) 


ez 

(x  -  c)2  +  a1 ' 


Research  supported  in  part  by  NSF  Grant  No.DMS-8901  880  and  US  Army  Research  Office  Grant  No.DAAL03-90-G- 
0019 


Typeset  by  A\tS-T^X 


421 


for  very  small  values  of  a,  a  collocation  point  in  the  immediate  vicinity  of  c  is  essential.  As  discussed 
by  Gerasoulis  and  Srivastav  in  [7],  the  methods  based  on  orthogonal  polynomials  will  require  the 
quadrature  rules  of  an  excessively  high  degree,  leading  to  extremely  large  systems  of  linear  algebraic 
equations. 

The  paper  by  Tsamysphyros  and  Theocaris  [11]  appears  to  be  the  first  to  ask  the  rhetoric  question 
”  Are  special  collocation  nodes  necessary  for  the  numerical  solution  of  singular  integral  equations?”. 
The  examples  are  given  showing  that  it  is  not  so  for  the  Gauss-Chebyshev  quadrature  and  collo¬ 
cation.  Our  objective  in  this  paper  is  to  analyse  the  linear  algebraic  systems  for  solvability.  The 
approximation  characteristics  of  the  computed  solution  will  be  discussed  in  a  subsequent  paper. 

In  order  to  make  the  paper  self-sufficient,  some  wellknown  results  are  included  here.  We  organise 
the  sections  as  follows: 

Section  2  is  used  to  construct  the  theory  of  orthogonal  polynomials.  The  similar  results  can  be 
found  in  S.  Welstead’s  Ph.D  thesis  [12]. 

In  Section  3  we  prove  the  solvability  of  the  system  of  equations  derived  from  general  quadrature- 
collocation  nodes. 

Section  4  is  to  discuss  several  optional  selections  of  these  nodes. 

The  paper  is  concluded  with  a  numerical  example  in  section  5. 

2.  Two  Related  Sequences  op  Orthogonal  Polynomials 

Consider  the  dominant  part  of  the  Cauchy  singular  integral  operator  U : 

(2.1)  U g(x)  :=  ag(x)  +  -  f  —~dl, 

*  J-ii-x 

where  a  and  b  are  assumed  constants  and  |  a  )^|  b  |.  For  the  general  case  of  variable  a(x)  and  6(x), 
Welstead  has  discussed  in  [12].  Here,  we  only  rehearse  some  related  results. 

The  operator  U  is  defined  on  the  space  of  Holder  continuous  functions.  By  using  the  sectionally 
holomophic  functions,  we  introduce 


d>(z)  = 


1  g( 0 

2m  J _ |  t  —  z 


dt, 


where  z  =  x  +  iy,  x  and  y  are  real  numbers.  Let 


<f>+(x)  =  lim  <fr(x  +  iy), 
y— o+ 


<j>  { x )  =  lim  <j>(x  4-  iy). 
y-o- 


We  have 

g(x)  =  0+(x)  -  0"(x), 

^  /  P^dt  =  ‘t’+W  +  ^ 

3T*  J  __  [  l  —  X 

and 

Ug{x)  -(»+  ib)d>+(x)  -  (a  -  ib)<f>~  (x). 


Denote 


G  = 


a  -  ib  a2  -  b2  -  i2ab 


-i2  9 


ib 


a2  +  b 2 
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we  have 

|G|=1,  tan0  =  £,  0<|*|<f,  \6  |#  I 

Let 

r(x)  =  ^iog|z£, 

Z(z)  =  (1  -  x)A‘(l  +  x)A-«er(')  =  (1  -  z)A‘~ '(1  +  z)A-1+'. 

If  a  >  0  and  6  >  0,  then  0  <  -j  <  ^.  Ai  and  A_j  can  be  chosen  as  following  so  that  both  Z(x)  and 
Z~l(x)  are  integrable  on  (  —  1, 1). 

(1)  A1=0and  A_!  =  — 1; 

(2)  Ai  =  0  and  A_i  =  0  or  A!  =  1  and  A_i  =  —I; 

(3)  Ai  =  I  and  A_i  =  0. 

The  index  of  the  operator  U  is  defined  as  \  =  — (Ai  +  A_i). 

In  the  case  of  1-index, 

(2.2)  Z(x)  =  (1  -  x)*(l  +  x)~1+ »  =  -7f=( 

vT  —  x1  1  +  x 

Let  {pn(*)}o°  be  a  sequence  of  monic  orthogonal  polynomials  with  respect  to  the  weight  function 
Z(x)  on  the  interval  (—1, 1),  i.e. 

(2.3)  J  Z(x)pn(x)pm(x)dx  =  cnSnm, 

where  cn  is  a  constant,  6nm  is  the  Kronecker  notation.  According  to  the  general  properties  of 
orthogonal  polynomials,  there  is  a  recursive  formula  between  every  three  consecutive  polynomials: 

Po(x)  =  1, 

Pi(x)  =  (x-  oq)po(x), 

p2(z)  =  (x  -  Qi)pi(x)  -  /?ip0(x), 


pn+1(x)  =  (x  -  a„)pn(x)  -  /?„pn_x(x), 

where  {«*}[}  and  are  two  sequences  of  numbers.  Now,  we  can  construct  a  new  sequence  of 

monic  polynomials  {qn(x)}g°  by  using  the  {an}g°  and  {/?„} g°: 

?o(x)  =  1, 

<ji(x)  =  (x  -  oi)<7o(x), 

72 (x)  =  (x  -  a2)7i(x)  -  /?27o(x), 


?n  +  l  (x)  =  (x  -  a„  +  1)?n(x)  -  Pn  +  lln  —  l  (x). 


Our  main  arguments  are 
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Proposition  2.1. 

U(Z(x)p0(x))  =  0, 

U(Z(z)Pk(x))  =  pqk-i(x),  for  Ic  =  1,2,  -- 

where 

(2-4)  p  =  'Jo1  +  b2. 

Proposition  2.2.  {<?n(z)}o°  is  a  sequence  of  orthogonal  polynomiaJs  with  respect  to  the  weight 
function  Z~l(x)  on  (-1, 1). 

Consider  the  adjoint  operator  V  of  U: 

(2-5)  Vg(x)  :=a9(x)-±  f  9ML. 

^  1  t  ~  X 

We  have  a  similar  result  of  V  as  the  Proposition  2.1. 

Proposition  2.3.  For  k  =  1,2,--- 


v(~z£r)=m(x)- 

3. The  Discrete  Scheme  of  General  Quadrature  Nodes  and  Collocation  Points 
For  the  problem  with  1-index,  we  shall  find  the  solution  g(x )  satisfying 

(31)  f  asr(*)+  £  =  /(*). 

1  i  flx9(x)dx  =  c. 

Let  g(x)  =  Z(x)<(>( x).  We  choose  X\  <  xi  <  ■  ■  •  <  xn  as  quadrature  nodes  and  yi ,  J/2 ■  -  •  -  ,  j/„_i  as 
collocation  points.  The  {x^}"  and  the  {yt}"-1  are  independent  of  the  weight  function  Z(x). 
Denote 

(3-2)  X(x)=f[(x-xi), 

;=i 


(3'3)  Y(y)=Y[(y-yk). 

k= 1 


The  quadrature  coefficients  are  defined  as 


w  _  f1  Z(x)X(x)dx 

1  J.x(x-x,). X>{x,y 


(3.5) 


V* 


Y(y)dy 

Z(y)(y  -  yk )Y'(yk)' 
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We  can  construct  the  system  of  equations  as  following: 
(3-6)  A4>  =  f. 


where 


and 


4>  =  W*i),4»(*2),---  ,4>(xn))T, 

/  =  (/(yi).  /(y2),  •  •  J(yn-i),c)T 

A  —  {(ikj)nxn  i 


(3.7) 


/  aki  ^ 

l  *ni  =  iW,, 


=  ^n(iWJ  ~  xfc)U{Z(x)x(x))\x=yk),  for  1  <  k  <  n  -  1, 
for  1  <  j  <  n. 

We  can  also  construct  the  corresponding  matrix  B, 

B  —  (bkj)nxm 


1  <  j  <  n, 


(3.8)  {  6lj  =  +  7>fenv(7!S)l: 

(  bkn  —  fib,  for  1  <  k  <  n. 


y=xk),  for  1  <  k  <  n,  1  <  j  <  n  -  1, 


In  order  to  prove  the  solvability  of  (3.6)  and  to  find  the  inverse  of  (3.7),  we  introduce  two  general 
lemmas  first. 

Lemma  3.1.  If  A  —  (a*.y )  is  an  n  x  n  matrix  satisfying 

n 

Y^akj<t>(xj)  =  U{Z(x)<j>(x)) \I=yk,  k  =  1,2,  -  -  ,n-  1, 
i=i 


n  j  ,1 

Y\an]<t>(xj)  =  -  /  Z(x)4>(x)dx 

JZl  *  -7-1 


for  any  polynomial  4>(x)  of  degree  <  n  —  1,  then  A  is  invertible. 
Lemma  3.2.  If  B  —  (bt]  )  is  an  n  x  n  matrix  satisfying 


£ 


b,Myj)  =  v(  1^)1,=... 


fc  =  1,2,-  -  ,  n 


for  any  polynomial  <t>{y)  of  degree  <  n  —  2  and  A  is  an  n  x  n  matrix  satisfying  the  conditions  of 
Lemma  3.1,  then 

AB  =  p'l. 


That  is 


Next,  we  need  to  check  that  the  matrix  of  (3.7)  satisfies  the  conditions  of  Lemma  3.1  and  the 
matrix  of  (3.8)  satisfies  the  conditions  of  Lemma  3.2. 


Proposition  3.3.  The  matrix  A  in  (3.7)  satisfies  the  conditions  of  Lemma  3.1.  A  is  invertible. 

Proposition  3.4.  The  matrix  B  in  (3.8)  satisfies  the  conditions  of  Lemma  3.2.  The  inverse  of  the 
matrix  A  in  (3.7)  has  a  closed  form  as  -jyB. 
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4.  Several  Special  Choices  of  the  Quadrature  and  Collocation  Nodes 

Sometime  we  ne  i  a  simpler  form  of  the  system  of  equations,  or  we  require  higher  accurate 
quadrature  rule  and  or  we  want  the  two  groups  of  nodes  to  be  the  same.  We  can  make  various 
options  about  the  quadrature  and  the  collocation  nodes.  In  this  sections  we  provide  four  different 
choices  of  these  nodes. 

1.  Gauss-Jacobi  Scheme.  Let  {®i,x2, •  •  ■  ,*«}  =  {£1,6, •  •  •  ,£„}  and  {j/lty2, •  - •  ,y„_i}  = 
{»7i j  ‘  •  ‘  where  {£*}"  and  {rjt}"-1  are  the  zeros  of  the  pn(x)  and  g„- i(y)  respectively. 

We  have  X(x)  =  pn(x)  and  Y(y)  =  q„_i(y). 

Denote 


(4.1) 


(4.2) 


__  f1  Z(x)p„(x)dx 


w: 


9n-i  (y)dy 


Z{y)(y -  VjWn-iinj)' 


The  matrix  A  in  (3.7)  will  become  the  form 


(4.3) 


—  ( akj  )nxn, 


where 


_ _ 1 _ 

~  <f;  ~  Ik 
The  matrix  B  in 


'^w‘  -  zk),"-l(v,))  =  r“  1 S  ‘  - 1. 

(3.8)  has  also  a  simpler  form, 


1  <  j  <  n. 


(4.4) 

where 


&  —  (bkj)nxn, 


hm  —  ^  I  ^  V’  I  ^  l(y)\l  \ 


kl  Vi  Qn-iiVi)'  v  z(y) 

=  7  " — (— V7  +  7" TPn(£t))  =  -7- — ,  for  1  <  A  <  n,  1  <  j  <  n  -  1. 

4*  -  Vi  *  9n- i\Vj )  *Zk-Vi 

Similar  to  Proposition  3.3  ,  we  have 

Proposition  4.1.  For  any  polynomial  of  degree  <  2n,  the  matrix  A  in  (4.3)  satisfies 


^2aii<t>(ti)  =  t/(^(-c)«*,(-c))lx=7*-  for  \  <  k  <  n  - 

)- 1 


Y!a*it(ti)  =  ~  [  Z(x)<j>(x)dx. 
i=i  *  -/-1 


Similar  to  Proposition  3.4  we  have 
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Proposition  4.2.  For  any  polynomial  </>(<)  of  degree  <2 n  —  2,  the  matrix  B  in  (4.4)  satisfies 
£  U=f* ,  for  i  =  1, 2,  •  ■  ■  ,  n. 

>=i 

Since  the  Gauss  quadrature  formulae  have  higher  algebraic  accuracy.  This  scheme  may  have  higher 
approximation  rate  of  convergence. 

2.  Lobatto- Jacobi  Scheme.  Let  {  —  1,  —  fjn-i,  —  7?n_2. •  ••  ,  —  ijj.l}  be  quadrature  nodes  and  let 
{—  —  £rt-ii  •  ■  ■  i  — £1}  be  collocation  points.  We  have 

n 

(4.5)  *(*)=(*+ lx*  - 1)  n<* +^) 

i=i 

n  —  1 

=  (— l)n(l  -*)(!  + x)  -  Tfi) 

f= 1 

=  (-!)"( 1  -  x)(l  +  x)qn-i(-x), 


(4.6)  y(»)  =  n(»+^)  =  (-1)np»(-!')- 

>=i 

Considering  our  assumption  of  a  >  0,  6  >  0  and  0  >  0  in  the  definition  of  Z(x)  =  (1  —  x)~»(l  + 
x)-1+»,  we  can  represent  the  quadrature  coefficients  Wj  and  V)  by  using  the  and  V)*. 

(171  tr  C  ZWX(x)dx 

(4'7)  -  L  (x  +  l)A-'(-l) 


r  zmxmj i 

lp,=;.,(.+i)r(- 1) 

_ 1  fl  (f  -  X)~»(l  4-  x)~1+-(l  -  x)(l  4-  x)qn~i(-x)dx 

~  2qn-i(l)  y_!  x  +  1 

=  - — “ 7TT  f  (1  -  X)~'(l  +  x)1-'7„_1(x)dx 

2?n  —  1  (  1 )  ./-i 


-  1  Z1  7n-i(x)dx 

2|?n-1(l)  7_1  (X  —  l)Z(x) 


3r.a?n-i(l)  .... 

'27n_1(l)6(  Z(l)  Wn(  }) 


_  ^Pn(l) 

2f>7n-l(l) 

where  we  use  the  fact  that  jfn  =  0  because  of  0  >  0.  For  j  =  1,2,  •  ,n  —  1,  we  have 


w: 


1  Z(x)X(x)dx 
1  (X  +  Vn-j)X'(rin-}} 
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and 

(4.9) 


-L 


1  C1  -  x)~  *(1  +  g)~1+'(1  ~  x)(l  +  x)qn-i(x)dx 
(*  +  nn-j)q'„-i(vn-j)(i  ~  nl-j) 

-  1  f1  (1  +  j)x~  '(1  -  z)>9n-i(j)dz 


=  1 f\ 

1  -  T)l-j  J- 1 


V’  ■ 

-  n~J 


1  - 


w. 


Z(x)X(x)dx 

n 7_,  (*  - 


1  fl  (1  -  x)~*  (1  +  z)~1+  »  (1  -  x)(l  -f  z)gn-l(z)dx 

2g„-i(-l)  *  -  1 


1 

2fa-i(-l)  J-i  Z(*)(*  +  l) 
~:T(  "-7/  -  -MPr.(-l)) 


i(-l)  A v  2(-l) 

_  Pn(-l) 

26  7n-l(-l)’ 


where  we  also  use  =  0  because  of  0  >  Q. 

For  j  =  1,2,--  ,  n,  we  have 


(4.10) 


w; 

=-/: 

-i: 


Y(y)dy 


(y  +  in-i+i)Z(y)Y'(-tn-j+i) 


Pn(-y)dy 


{y  +  Zn-j+i)Z{y)p'n{Zn-J+i) 
‘  (1  +  !/)■(!  -  y)‘~ ipn{y)dy 

(y  ~  £n-j  + 1  )Pn(£n—  j+  1 ) 

2\ 


_  f1  Z(y)(i  ~  y2)pn{y)dy 
J -l  (y-£n-j  +  lK(£n-;  +  l) 


,,  ^2  ^(y)Pn(y)(y  +  4n->+i)«Zy 

=  d 1' - 5^7, - 

=  (I-^-j  +  i)^„*-J  +  , 

We  can  construct  the  system  of  equations  as  following: 

Let 

<6  =  (<6(-l),<*(-y„-i),  -  M~m)MD)T, 

/  =  (c./(€n),/(“€n-l),---  ,/(-^l))T 
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We  have 


(4.11) 


A*  =  f, 


where 


A  —  (aij)(n  +  l)x(n+l)> 


0  <  k  <n,  0  <  j  <  n- 


b  V'_-  1 

afct  =  — - — — - ,  for  1  <  k  <  n,  1  <  j  <  n  -  1, 

*■  1  -  »£_,•  £n-i+l  -  _  - 

«to  =  -M'  Pn'  !L~j - 7" - -  for  1  <  k  <  n, 

?n-l(l)  1  ~  £n-i+l 


Pn(-l)  1  ,  ,  ^  . 

a*n  =  -A* - T“nTT7 - ■  for  1  <  i  <  n, 

9n-l(  — 1)  1  +  ?n-*  +  l 

1  V *  • 

forl<j< 

»  1  “  »?n-i 


a°>  =  -  -  -  'n7— ,  for  1  <  j  <  n  -  1, 


„  _  A*  Pn(l) 

00  “  26  fo.^l)’ 

_  A*  Pn(-l) 

°"  26  — 1) 

Similar  to  Proposition  4.1,  we  have 

Proposition  4.3.  For  any  polynomial  <j>(t)  of  degree  <  2n  —  3  ,  the  matrix  A  in  (4.12)  satisfies 


aio^(-l)  +  +atn0(l)  =  U(Z(x)<t>(x)) |*=-f„_*+l,  for  1  <  fc  <  n, 

J=l 

"-i  ],  ,i 

«oo<6(— 1)  +  a0j4>(~rln~i )  +  aor.0(l)  =  ~  Z(x)<j>(x)dx. 

i= i  7r,'~I 


Similar  to  Proposition  4.2,  we  introduce  a  matrix 


&  —  (^tj)(n+l)x(n+I)> 


0  <  k  <  n,  0  <  j  <  n. 


where 


(4.13) 


6  (1  -£Lj+1)W*_i+1  f  ,  _  „ 

6*j  = - - - — ,  for  1  <  it  <  n,  1  <  ]  <  R, 

^  £n-f+l  —  ^n-lc 

fc(l-^_i  +  1)^n*_J  +  1  r  _  _ 

60j  = - — - — — 1 — ,  for  1  <  ;  <  n, 

T  sn-j+l  +  1 

6(1-^_,+1)^-a+i  ,  ,  ,  .  „ 

«>n j  = - t — 1 - : — 1 — ,  for  1  <  ;  <  n, 

T  in-j  +  1  i 

6*0  =  bp,  for  0  <  k  <  n. 
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Proposition  4.4.  For  any  polynomial  <j>(t)  of  degree  <  2n  —  1  ,  the  matrix  B  in  (4.13)  satisfying 

n|fg)l»=-i,  forfc  =  0, 

^2b]cj<fr(—£n-j  + 1)  =  <  Z(v))ls'  =  -,)«-j>  f°r  ^  ^  —  n  —  ^> 

;=1  nig) ly=l,  for  4:  =  n. 

By  using  the  same  way  as  that  in  proof  of  Lemma  3.1  and  Lemma  3.2,  we  can  show  that  the 
coefficient  matrix  A  of  Gauss-Jacobi  scheme  in  (4.3)  and  the  coefficient  matrix  A  of  Lobatto-J acobi 
scheme  in  (4.12)  are  invertible  and  that  the  their  inverses  are  of  the  cloeed  form  ^B,  B  is  given  in 
(4.4)  and  (4.13)  respectively. 

These  two  schemes  can  approximate  the  singular  integral  operator  in  higher  algebraic  accuracy. 
Are  there  other  groups  of  quadrature  nodes  or  collocation  nodes  which  have  similar  properties?  For 
example,  can  we  construct  Radau- Jacobi  scheme?  In  the  case  of  a  =  0  and  6  =  1,  we  found  four 
different  choices  of  the  nodes  (see  [9]). 

3. Coincidence  of  Quadrature  Nodes  and  Collocation  Points. 

If  we  choose  the  n  —  1  collocation  points  the  same  as  the  first  n  —  1  quadrature  nodes,  we  also  can 
construct  an  invertible  linear  system  of  equations. 

Let  j/j  =  Xj,  for  1  <  j  <  n  —  1,  we  have 

n 

x(x) = 

i= i 

n  - 1 

Y{y )  = 

}=i 

The  discrete  system  of  equations  has  the  form 

A4>  =  /• 

where 

<t>  =  (<t>(xl),<t>(x2),  ■  ■  ■  ,4>(xn))T , 

/  =  (/(xi),/(x2),  •  ,/(Xn-l),c)T, 

•4  =  (afc;)nxn,  1  <k<n,  1  <  j  <  n, 
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By  Lemma  3.1,  the  matrix  A  in  (4.14)  is  nonsingular.  Next,  we  introduce  a  matrix  B, 


B  —  (iy)nxn 


l  <  k  <  n,  1  <  j  <  n, 


Mi, 

nY'(xj)  Xj-xk 

(4.15)  .  a  6  f1  Y(t)dt 

kk  Z(xk)  *Y'(xk)  /.I  Z(t)(t  -  *t)2’ 

=  ip,  for  1  <  fc  <  n. 

Proposition  4.6.  For  any  polynomial  of  degree  <  n  —  2,  the  matrix  5  in  (4.15)  satisfies 


X>i*(*i)  =  forl<t<n. 


By  using  Lemma  3.2  ,  we  know  that  the  inverse  of  A  in  (4.14)  is  ^jB. 

Note:  The  collocation  points  are  not  necessary  the  FIRST  n  —  1  quadrature  nodes.  In  fact,  they 
can  be  arbitrary  n  —  1  nodes.  If  we  take  n  collocation  points,  we  will  obtain  a  system  of  n  +  1 
equations  of  n  unknowns.  The  rank  of  the  coefficient  matrix  is  n.  We  can  use  the  regularization 
method  to  find  it’s  solution. 

4. Minimum  Norm-Least  Square  Scheme. 

Let  ,*«}  =  {yi,!/2,-  -  > 2/n }  =  {&,&.•••  ,£r>},  then  X(x)  =  T(x)  =  p„(x).  We  can 

construct  a  system  of  n  equations  of  n  unknowns  only  from  the  integral  equation  U  (Z(x)<j>(x))  =  f(x) 
without  considering  the  additional  condition  f* j  Z(x)<f>(x)  —  c.  We  are  going  to  show  that  the 
rank  of  the  coefficient  matrix  is  n  —  1  and  to  find  a  closed  form  of  it’s  generalized  inverse. 

Denote  the  discrete  system  as 

A*  =  /, 


where 


^  =  (^l),^2),-  -,^n))T, 

/  =  (/«!), /«*),•••  ,/tfn))r, 

A  —  (ajfcj  )nxn  , 


_  6  [l  Z(t)pn{t)dt 

atJ  *&(£])  J-i  (< -&)’ 

.  b  fl  Z(t)vn(t)dt 

=  +  L~-(F 


,  for  1  <  k,j  <  n,  fc  ^  j, 


Similarly,  we  introduce  a  matrix  B , 


(<-02 


B  —  (6fc;  )n  x  n  , 


,  for  1  <  fc  <  n. 


6  /■'  Pn{l)dt  ,  ,  ,  ,  .  ,  ^  , 

bk>  - 77T7  /  tTTw; — TTT^ — 7T  for  1  S  ^  n-  r  J- 

*P(,(£j)  i-i  Z(t)(* -£;)U  -&) 

,  a  b  fl  Pn(t)dt  f  ,  ,  ,  „ 

Z(fk)  *?'„(£*)  J-iZm-Sk)2  -  ~ 


1  Pn  (<)<** 

■  i  z(0(< -*,-)(*-&) 

6  /*  Pn(0<i< 
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Proposition  4.7.  For  any  polynomial  <t>(t)  of  degree  <  2n,  the  matrix  A  in  (4.16)  satisfies 


=  U(Z(x)<t>(x))\r=(k,  for  1  <  fc  <  n. 

7=1 

Proposition  4.8.  For  any  polynomial  4>(t)  of  degree  <  2 n,  the  matrix  B  in  (4.17)  satisfies 


forl<k<n. 


The  proofs  are  the  same  as  that  of  Proposition  4.5,  however,  we  can  not  use  the  general  Lemma 
3.1  and  Lemma  3.2  anymore. 

Denote 

(Po(Si)  Pi(Si)  ■  Pr>-l(Sl)’ 

Po(6z)  Pl(6)  •••  Pn—  1  (£2) 

Po(Sr»)  Pl(Sn)  Pn—l(£n)‘ 


and 


Q  = 


We  have 


I '  0  9o(6)  9n-2(Sl)' 

0  90(^2)  •  •  •  9n— 2(^2) 

\  0  90 (Sr.)  ••  9n-2(Sn)> 

AP  =  pQ. 


Since  P  is  nonsingular,  rank(A)  —  rank(Q)  =  n  —  1. 
Denote 


and 


Pi  (Si) 

P2(Sl)  ••• 

Pn-l(Sl) 

pi  (Sa) 

p2(S2)  ••• 

Pn-l(S2) 

Pl  (Sr.) 

P2(Sn)  • 

Pn-l(Sn) 

/  9o(Si) 

9i(Si) 

••  9n— l(Sl ) 

|  9o(S2) 

9l(S2> 

••  9n-l(S2) 

'90  (Sr.) 

91  (]**«) 

9n—  1  (Sn  ) 

We  have 


BQ  =  pP. 


Since  Q  is  nonsingular,  rank(A)  =  rank(p)  =  n  —  1. 
Introduce 

B=  X:B. 


we  have 
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Proposition  4.9.  B  is  the  M  -  N  generalized  inverse  of  A  in  (4.16),  that  is  B  and  A  satisfy 
(4.18.1)  ABA  =  A, 


(4.18.2) 

(4.18.3) 


BAB  =  B, 
(AB)TM  =  M(AB), 


(4.18-4)  ( BAfN  =  N(BA), 

where  M  =  (QQT)~l  and  N  =  (PPT)_1. 


Proof.  Let 


■C 


where  /n-i  is  an  identity  matrix  of  order  n  -  1.  It  is  obvious  that  JT  and  J  satisfy 


(4.19.1) 

(4.19.2) 

(4.19.3) 

(4.19.4) 

Since  P  =  pJT  and  Q  =  QJ,  we  have 


Therefore 


JJTJ  =  J, 
JTJJT  =  JT, 
(JJT)T  =  JJT, 
(JTJ)T  =  JTJ. 

A  =  fiQJP~l, 

B  =  -PJTQ-X. 
p 


ABA  =  p(QJP~l)(PJTQ-l)(QJp-x)=p(QJjTjP- 1  =  ,4, 

BAB  =  i(BJTQ-1)(QJB-l)(PJ^Q-i)  = 

P  p 

(AB)TM  =  (gjp-1pjTg-l)-i(gOT)'1  =  g-^J^Kr'g-1) 

=  Q~TJJtQ~ 1  =  (Q~T  Q~l)(QJ  JT  Q~l)  =  M(AB), 

(BA)TN  =  (PJTQ-1QJP-1)T(PPT)-'  =  (P''TJT7PT)(p-TF-1) 

=  P~TJTJP~l  =  {PPT)-\PJTJP~X)  =  Ar(5A)T. 

According  to  C.  R.  Rao  and  S.  K.  Mitra  [13]  (pp.  52),  B  is  the  M  —  N  generalized  inverse  of  A 
Q.E.D. 

Proposition  4.10.^  i  =  B/  is  t/ie  minimum  N—norm  M—  least  square  solution  of  the  nxn  system 
of  equations  A<t>  =  /.  That  is,  the  x  satisfies 


where  y  makes 


x  ||jv=  mm  ||  y  ||N, 


I  Ay-f  IU=  mjn  ||  Az  -  f  ||M. 


The  ||  x  ||^f  and  ||  x  ||n  are  defined  as  'JiT Mx  and  V x^ N x  respectively. 


5.  Numerical  example 

We  have  considered  a  numerical  example  for  the  equation  (1.1)  with  a  =  0,  6=1  ,  k  =  0  and 
f(x)  —  fc—  p+ei •  As  c  is  quite  small  (e.g.  c  =  0.01),  f(x)  has  a  peak  value  104  at  x  =  c.  The  exact 
solution 

,  (1  +  4c4)'i  (x  -  c)cos|  +  csinl 

<P(x)  = - 7 - Ti- — 2 -  > 

c  (x  -  c )2  +  c2 

where 

tan#  =  2c2. 

It  has  also  a  pair  of  maximum  and  minimum  at  the  vicinity  of  0.  Since  the  unusual  behavior,  the 
values  of  /(x)  at  Chebyshev  collocation  points  {sj }  can  not  represent  the  feature  of  /(x)  and  the 
classical  Gauss-Chebyshev  scheme  fades  to  provide  a  satisfactory  approximate  solution.  The  curve 
(dot  line)  in  the  following  figure  shows  unacceptable  errors. 

Another  way  is  to  choose  collocation  points  flexibly.  For  example,  if  we  take  x  =  —0.01  as 
collocation  point  instead  of  x  =  sjracni  —  0,  we  obtain  a  general  quadrature  colloaction  scheme. 
The  curve  (dash  line  )  in  the  figure  shows  that  it  gives  a  much  better  approximation  to  the  exact 
solution  (points  labeled  by  small  triangles  in  the  figure)  than  the  Gauss-Chebyshev  scheme  does. 

As  n  increases,  Chebyshev  nodes  become  dense.  Since  the  Lebesgue  constant  of  the  interpolation 
at  Chebyshev  nodes  are  bounded,  a  more  satisfying  approximation  may  be  obtained.  However, 
under  the  restriction  of  the  number  n,  the  flexible  choice  of  collocation  nodes  can  lead  to  a  better 
result  at  a  lower  computational  cost. 
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On  a  Hyperbolic  Tangent  Quadrature  Rule 
for  Solving  Singular  Integral  Equations 
with  Hadamard  Finite  Part  Integrals 

Fenggang  Zhang 

Department  of  Applied  Mathematics  and  Statistics 
SUNY  at  Stony  Brook. 

Abstract.  Stenger’s  formula  is  adapted  for  singular  integrals  defined  as  Hadamard  finite  part.  Conver¬ 
gence  of  the  revised  quadrature  rule  is  studied.  A  scheme,  for  solving  singular  integral  equations  with 
Hadamard  finite  part  integrals,  is  proposed  based  on  the  revised  hyperbolic  tangent  quadrature  rule. 
The  integral  equation  is  reduced  to  a  system  of  linear  equations,  by  taking  the  same  points  as  quadrature 
nodes  and  collocation  points.  The  coefficient  matrix  of  the  system  is  shown  to  be  nonsingular. 


1.  Introduction 

Many  physical  problems  from  diverse  areas  such  as  aerodynamics,  hydrodynamics  and  elasticity 
may  be  reduced  to  integral  equations  of  the  form 


(1.1) 


/  idt  +  =  •F(x).-1  <  x  <  1. 


The  first  term  denotes  the  Hadamard  finite  part  integral  (J.  Hadamard  [7]). 


(1.2) 


m 


(t- 


;  dt 


*  fU)dt 


where  the  initial  "F.P.”  means  extracting  only  the  terms  which  remain  finite  as  e  go.s  to  zero.  The 
second  term  on  the  left  side  of  (1.1)  is  the  Fredholm  integral. 

The  Hadamard  finite  part  integral  also  can  be  regarded  as  a  distribution  defined  on  C$°(—  oo.oo) 
(  A.  Zemanian  [8]),  where  Cg°(—  oo.oo)  is  a  set  of  infinitely  differentiable  functions  with  compact 
support.  Therefore,  we  have 


(13) 


m 

( t  -  X)* 


dt  = 


±  f  mdt. 

dx  J t  -  x 


The  right  side  of  (1.3)  is  the  distributional  derivative  of  the  Cauchy  principal  value  integral,  which 
is  often  regular. 

The  numerical  methods  for  solving  (1.1)  have  been  discussed  by  Kutt  [1],  loakimidis  [2],  Kaya 
[3]  and  Golberg  [4],  In  [5j  Venturino  lias  proposed  a  technique  for  solving  Cauchy  singular  integral 
equations.  Venturino’s  scheme  utilized  the  quadrature  formulas  which  were  proposed  by  F  Stenger[6] 
and  were  called  the  ’’hyperbolic  tangent  quadrature  rule".  Although  there  is  a  close  r  elation  between 
the  two  types  of  singular  equations,  the  supersingularity  of  the  Hadamard  finite  part  imegral  prevents 
direct  application  of  the  original  hyperbolic  tangent  quadrature  rule.  Stenger's  fori  •,  ,ia  in  favorable 

This  research  was  completed  under  the  guidance  of  Professor  ft.  Srivaslav.  The  author  thanks  him  f>r  his  significant 
advice  and  very  helpful  comments.  Research  supported  in  part  by  NSF  Grant  No.DMS-89018.v0  and  US  Army 
Research  Office  Grant  No. DA AL03-90-G-0019 
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cases  has  exponential  convergence.  It  seems  to  be  of  interest  to  modify  it  so  that  it  can  be  used  for 
supersingular  integrals. 

In  this  paper,  we  first  generalize  the  trapezoid  quadrature  rule  to  the  Hadamard  finite  part  integral 
on  (—00,00).  Then,  we  propose  the  revised  scheme  of  the  hyperbolic  tangent  quadrature  rule  and 
estimate  the  error  bound  of  this  scheme.  In  section  4,  we  introduce  the  concept  of  the  generalized 
Cauchy  matrix  and  prove  the  coefficient  matrix  of  the  discrete  system  is  invertible.  Finally,  we  give 
two  numerical  examples  to  check  how  well  the  revised  scheme  works 

For  the  sake  of  brevity,  we  only  discuss  the  canonical  equation  of  the  form 

<*•<> 

2.  Hadamard  Finite  Part  Integral  on  (-00,00) 
and  the  Revised  Trapezoid  Quadrature  Rule 
According  to  [6],  the  trapezoid  quadrature  rule  can  be  introduced  by  using  the  Whittaker  Cardinal 
function. 

Let  Qd  be  a  strip  in  the  complex  plane: 


(2.1)  Qd:=  {x  +  »y,|y|<  d}, 

where  d  >  0. 

Let  Bd  be  a  family  of  functions  f(z),  that  are  analytic  in  Qd,  satisfying 

(2.2)  J  |  f(x  +  iy)  )  dy  — >  0,  as  x  — *  ±00 


and 

(2.3) 
Denote 

(2.4) 


N(f,  Qd)  ■=  hm  f  (|  f(x  +  iy)  |  +  |  /(x  -  iy)  \)dx  < 

<j-d~  o 

*(/):=  f(t)dt-h  f;  f(kh), 

k  =  ~oo 


00. 


M 


k  —  —  M 


/oo 

f(t)dt-h  £  f(kh). 

■OO  L-  AM 

In  [6],  it  is  proved  that  if  /  €  Bd,  then 

(2.6) 


II  rt/)IU<  ~\dNUM- 


2sinh^ 


If  also 
(2-7) 


/(x)|<Cle-°'r|, 


then 

(2.8) 

by  taking  h  =  \J~jf .  where 


>l.\t(f)  |<  cze 


-JhdaM 


(2.9) 


Ci,C2.  .  are  constants  independent  of  M . 
Now,  for  the  Hadamard  finite  part  integral  on  (—00,  oc), 

m 


<t  —  «)2 


dt. 


there  is  an  associated  infinite  series,  whose  partial  sums  may  be  used  for  numerical  integration.  The 
error  of  this  approximation  is  easy  to  estimate.  The  main  result  in  this  section  is 
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Theorem  2.1.  If/(x)  e  Bd,  (  f(x)  )<  ce-0'1',  (|  /"  j|«<  c2  and 

f(x  +  kh ) 


(2.10) 

(2.11) 

(2.12) 

then 


~  2^  (fc  —  0.5)(fc  4-  0.5)h2 ’ 

kzz-oo 


«(/)  -  /(/)  -  *(/)• 


A/ 


,,,  _  X'  /(x  +  fch) 

€*/(/)  —  /(/)  2-  (fc  —  0.5)(Jfc  +  0.5)b2’ 

fcs-  A/ 


l<(/)  is^h?+CA 

I  <«(/)  l<  <*(«-•'“  +  Ik) 


by  taking  M  =  where  c,  c2  and  c3  are  constants  independent  of  M  and  h, 


(2.13) 


(2.14) 


Ci  = 


4(1  +  c*  +  ad2) 
'  d2 


4ci  ac 

c,  =  m«(— .c,  +  — ). 


We  prove  the  theorem  through  several  lemmas. 

Lemma  2.2.  If  f(t)  €  Bd  and  ||  /  ||oo<  c,  then  the  lladamard  finite  part  integral  1(f)  can  be 
represented  as 

h  =  /  S(*)dt 

*/  —CO 


where 


o(t)  =  { 


f(x  +  t)  +  f(x  —  t)  —  2/(r) 


2l2 


/"(*) 


,  as  t  ^  0 
,  as  t  =  0. 


Here  g(t)  €  Bd  and 


Proof. 


A'(9M<^N(f,Qd)  +  8(~  +  l)\\f\\00 

r  m  j.  ,  r  /(o  „ 
I{f)  =  L  <^y-dt  +  Ljrr^dt 


}(x  +  t)  +  /(x  -  t) 


3<jf 


~  l 

f(l  +  t)  +  /( X  -  <)  -  2/{x) 


df 


(2 


df  +  /  ^d<) 


/ 
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r°  r  i 

=  J  2g(t)dt  +  f(z)F.P.hm  j  dt 

=  h, 

because  f(t )  is  analytic,  0  is  no  longer  the  singular  point  of  g(x),  g(t )  is  even  on  (-00, 00)  and  the 
second  term  is  equal  to  zero.  Furthermore, 

N(g,  Cld)  =  lim  f  (|  g(t  +  iy)  I  +  I  g(t  -  iy)  |)d< 

»— <*■  J- 00 


|  f(x  +  t  +  iy)  +  iy)  -  2  f(x)  | 

!  t  +  iy  |2 


\f(x  +  t-  iy)  +  f(x-t  +  iy)  -  2/(x)  | 

1 t  -  iy  I2 

_  f°°  \f(x  +  t  +  iy)  j  +lf(x-t-  iy)  | 

<  „««n _(2  /  - rri5 


+4/(*)(  f 

J  m<i 


<^(/.fid)  +  4||/lleo(J  +  2).  $•£■£>■ 


Lemma  2.3.  If  /  €  Bd  and  I  /(x)  |<  ce  o|x|, 


h  f(x-+kh)  +  f(x~kh)~‘2f(x)  t"i~w 

i{g)~h-rA  2^  - rrr? - +  J\.X))> 

1  k  —  ~oo,k#0 


k2fl2 


then 


where  ci  is  defined  in  (2.13). 
Proof.  Since 


I  «(j)  l< 


cie""^ 
osinh  ’ 


N{f,n,i)<2c(  e 

J  —OO 


a 


by  using  (2.6)  and  the  Lemma  2.2,  we  obtain 


I ({3)  l<  ' 

<  ^ 
~  2sinh^  d3 


_  jU 
e  * 

2  sill  h^ 


N(g,Qd) 


N(fM  +  Sc(—  +  1)) 


< 


4ce~"*' 
o d2  sinli  Jtd 


( 1  4-  0  +  ad2). 


Q.E.D. 
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Lamma  2.4.  If  f  €  Bd,  |  /"(*)  |<  c2  and 

,  ft  V  /v1  +  +  f{x  —  fcft)  —  2/(x) 

£  13  '  1  2  (fc-0.5)ft(*  +  0.5)ft 


I  c*(<7)  l<l  <(<?)  i  +C2/»- 

Proof.  Using  the  mean  value  theorem  and  the  identity 


CO 

V' _ o 

^  fc2  -  0.25 


we  find  that 


,  ,  f>/(x  +  ^)  +  /(*-W)-2/(x)  1  1  h 

e  (g)  -  ((g)  |=|  h  ^ - ~2 - (____  -  p)  -  5/  (*)  I 

^  t^l2/  1  1,  ft 

^C2fcEMp3o25-F)+2C2 


k4  it2  -0.25 

i  =  l 


Q.E.D. 


The  proof  of  the  Theorem  2.1: 

By  the  Lemma  2.2,  the  Lemma  2.3  ,the  Lemma  2.4  and  the  identity 


00  j 

E]  (k  -  0.5 )(k 


we  find  that 


(k  -  0.5)(J b  +  0.5) 


<(/)  =  /(/)  -  «■(/)  =  h~  cr(f) 

—  *c  \  ,  ^  V'  /(i  +  fcft)  4-  f(x  —  kh)  —  2/(s) 
-M3l+9  2^  (x-  -  0.5ft)(*  +  0.5ft) 


-  or(/) 


1  00 

=  <’(o)~T  E  77 


ft  f^(fc-0.5ft)(lb  +  0.5ft) 


=  «*(«) 


I  «m(/)  !<!«(/)  1+2  E  (fc-0.5)(fc  +  0.5)ft2 


<1  <(/)  I  + 


4(Jl#  +  0.5)A 


Since  M  =  ^4,  for  M  large 


we  have 


Cie  »  c 

-  osinl)  ^  +  C2  +  (2,1/  +!)/»' 


— — 77  <  4e"^, 
siiih  — 


I  fAr(/)  l<  — ~e  aMh  +  (c2  +  7-7)  A 
o  4ffa 


<  c3(e~oMh  +  ft). 


Q.E.D. 
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3.  Hadamard  Finite  Part  Integral  on  the  Interval  (-1,1) 
and  the  Revised  Hyperbolic  Tangent  Quadrature  Rule 

In  [6]  the  hyperbolic  tangent  quadrature  rule  is  introduced  to  approximate  the  regular  and  the 
Cauchy  principal  value  integrals  on  (  —  1,1).  Let  us  recall  the  main  results  of  (6j  before  we  discuss 
its  modification. 

Let  {2  be  the  open  unit  disk  in  the  complex  plane: 


fi  :=  {2|  |  z  |<  1} 

and  Bd  be  a  family  of  functions  f(z),  which  are  analytic  in  fl  and  satisfy 


(3.1) 


,2* 

N (/,  Cl)  :=  lim  /  \  f(reie)  \  d9  <  co. 

r—  i"  Jo 


If  /(x)  €  Bd  and 


(3.2) 

then 


elh  -  1 
ekh  +  1  ’ 


(3.3) 


7(/)N!  f 


f(x)dx  —  —  ^2  (1  -xl)f(xk)  |< 


i=- oo 


2sinhf£ 


N{f,ay, 


also,  if  |  f(x)  |<  c(l  -  ar2)""1  on  (-1,1)  and  h  =  then 


(3.4) 

Let 

(3.5) 
and 

(3.6) 

where 

(3.7) 

Similar  to  the  section  2,  let 

(3.8) 


,  i  , 

»?A/(/)|:=l  /  f(x)dx  —  ~  ]T  l1  ~  It)/(x*)  l<  “ 


a~\J  a  A/ 


■dt,  —  1  <  x  <  1, 


CO  f  /  X  k  4-  T 


2(1  -  X-)  ,  '  Ajt-0.5* 


A’  =r  — oo 


fc/l 

-  si nh  — 


A/ 


4  —  0. 5*4+0  5 


°mU)  •'= 


^  /( i(in ) 


2(  1  -  X2)  Si-0  5*4+0  5  ’ 
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(3.9) 


«(/)  ==  1(f)  -  *(/). 

«m(/)  :=/(/)- 

A  fractional  linear  transform  of  the  complex  variable  defined  by 

x  t  —  i 


*  = 


1  +  sx 


1  —  tx 


is  introduced.  It  is  a  one-to-one  analytic  mapping  from  the  open  unit  disk  onto  itself  and  leaves 
—  1,1  unchanged.  We  obtain 


(3.10) 
Let 

(3.11) 


1  f(f&) 


s2(  1  -r2) 


ds. 


9(s,x)  -  ^ 


2s2 

(1  -  x2)/"(x) 


if  s  ^  0 
if  s  =  0. 


Our  main  result  in  this  section  is 
Theorem  3.1.  If  /(*)  e  Bd,  ||  /  c0  and 

(3.12)  |  </(s,x)(l  -  s')  |<  c2(x),  for  -  1  <  x  <  1,  -1  <  s  <  1, 

then 

I  ((f)  l<  cl(x)(e"^‘  +  h 2) 

2 

and  by  taking  M  =  -Jr,  ive  have 

I  <*/(/)  l<  c3(x)(2e-M*  +  fc2). 

where  Ci(x),  c2(x)  and  03(1)  are  functions  of  x,  but  independent  on  s,  h.  and  M , 

(3-13)  ci(x)  =  max(2Tc0(-j~  +  T~~-> )’ (£C2(J)  +  75co)  7  1  -)■ 

d- ( x )  1  -  X'  8  12  1  -  x- 


(3.M)  i-.i(x)  =  max  (ci(x),  ) 

1  -  x- 

and 

d(x)  =  min  ( 1  —  x,  1  +  x). 
We  prove  this  theorem  through  several  lemmas. 
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LEMMA3.2.  If  f(t)  G  Bd,  then  the  Hadamard  finite  part  integral  1(f)  can  be  represented  as  the 
sum  of  Ii  and  J2, 


(3.15) 


t hi: 


g(s,x)ds, 


(3.16) 


h  = 


-2 /(*) 


1  -  x2  ' 

Here  1^  is  a  regular  integral,  g(s,x)  G  Bd  and 

2(1  -  z2! 

N(g(S,x),fld)  <  ’N(f,(ld)  +  |  f(x)  |  . 

Proof.  From  the  [8]  (page  62),  the  Hadamard  finite  part  integral  can  be  regarded  as  a  pseud¬ 
ofunction  or  a  functional  defined  on  the  test  space  Oq°(— oo,oo),  that  is,  for  any  function  4>(s)  G 
C§°(-oo,oo), 

f 1  *inlds  ._  fl  fd  <t>"(r)(s  -  r)dr  {  _ 


-ds  —  <f>(  0) 


1  4>(s)  ~  4>(Q)  -  <j>'(0)s 


ds  —  4>(  0) 


and 


-l 


1:^-1: 

-L 

1  0{-^)-c»(O)  +  ^,(O) 


1  4>(s)~4>( 0)  -  6V(0) 

5  2 

-s: 


ds  -  <?(0)  •+■ 


j: 


s- 


ds  —  0(0) 


'  0(s)  -H  0(-s)  -  20(0) 
2s2 


ds  -  2(/>(0). 


This  functional  can  be  generalized  to  any  continuous  function  with  finite  support  in  (— co,oo). 
Particularly, 

/>- 


In  fact,  let 

1,  if  |  s  |<  1  +  a, 

0,  if  |  s  |>  l  +  2a, 

.  if  -  1  -  2a  <  s  <  -1  -  a, 
—  ~  ~f> ,  if  l  +  a  <  s  <  1  +  2a. 

We  can  construct  a  sequence  {cb,(.s)},  <b,(*)  €  C^(-oc,  co)  and 


/(*)  = 


0n(«)  — '  1,  <j>, , ( * )  —  0,  <p"(a)  0  uniformly  on  [—1,  1]  as  n  —  oc, 


where 


t’rAs)  =  f  !(r)~<n(s  -  r)dr , 

J  —  OO 
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7»(0  =  foa-^^rr. 
IZo  C (nt)dt 


C(i)  = 


Therefore, 


0,  if  |t  |>  1, 
e'3-' ,  if  1 1  |<  1. 

i ■ _  f1  Ms)_ 


r  ±d,  =  lim  r  *4>„» 

j-oo  «-  n^°°y-i  s2 


=  lim  (  f 
n-°°  y_i 


/o  #!(r)(*  - 


*-2^„(0))  =  -2. 


Let  fli(s)  =  we  have 


,(f)  =  rhL, 


1  r1  /(*&) 


1  /“  /(f^)+/(Tf^)*-2/(x)  ,  „/(*) 

=  rr^/_1 - w - ds~2r^ 

=  h  +  /2- 


Since  (3.4)  is  a  fractional  linear  transform  from  the  open  unit  disk  onto  itself,  is  analytic 

with  respect  to  s  in  the  |  s  |<  1  and  the  g{s,x)  is  too.  Furthermore, 

f 2* 

(3.18)  N{g(s,x),Qd)  =  lim  /  |  g(re‘e ,  x)  |  d9 

r~l~  J0 

r2*  rei»  +  x  r 2*  _red«  +  x 

-  r-T-(i  ]fiTT7^)ld0  +  I  l'<T^^)l")  +  4,r|/(*,,■ 

where  s  =  re‘tf.  Let  <  =  pe’^,  we  have 


1  —  x2 

is  =  ’ 
rfs  =  — —  nf 11 — dp 

(t-x)(l-tx)  * 


lim  / 
r—  I_  Jo 


2,r  re's  -L  x  /- 

l/(r- - rj)\d0=  lim  / 

1  +  zre,#  r-i-  J0 


i  "  (p^-Turr^r;^  i 


<  “r— *(/.«,). 

rf-(x) 


We  can  obtain  the  same  estimation  for  linir—i-  fo*  I  I  d0.  Therefore, 


N(g(S,x),Qd)  <  2->(-cj~>^(/.^)  +  4tt  |  /(*)  |  .  Q.E.D. 
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Lemma  3.3.  If  |  j(s,x)(l  -  s2)  |<  c2(x)  for  -1  <  x  <  1  and 


9(s, 


gjxt.z) 

Sk-0.5Sk+0.S  ' 


then 


_  -|2.  | 

l  «(»)  l<  -  ~~rN(g(s,  x),  a<j)  +  -c2(x)/i2. 
2  smh  ® 


Proof.  By  (3.3),  we  have 

/l  ,  oo  _  zl. 

g(s,x)ds-~  g(xk,x)(l-xl)\<——^N(g(s,x),Qd). 

1  fc=— oo  1  2/* 

By  using  the  identity 


£ 


.  sjt_o.5St+o.5  sinh4’ 

fc  =  -oo  . 


we  obtain 


oo  ^ 

^  ^  n  -  x?Ul 


k=  —  oo 


(I  )^Jk  —  0. 5^+0. 5 


1  —  cosh  ^ 

)  ~  O  *■ 


1 


t= -oo 


St-0.5si+0.5 


Therefore, 


/  ■  i  2  4..  —2  h  h 

v  4Asinhi  4  ~  4 

h  ,  w,  2\  4  V-'  g(Zk,x)  ,  1  ,_lt2 

O  a(xk,x)(l  -**)-  r  2^  : - : - -  l<  qC2(x)/i  . 

2  ^  2  '  Sjfc-0  5«*  +  0.5  ° 

A  —  —  CO 


k=~oo 


By  using  the  triangle  inequality,  we  can  obtain  the  estimation  of  t(g).  Q.E.D. 
Proof  of  the  Theorem  3.1 

Since  |  /(x)  |<  c0,  by  using  (3.1)  and  the  Lemma  3.2,  we  have 


and 


N(f,fld)  <  2 5TC0 

—  x2 

H*) 


N(g,Vd)  <  A  +  l)47rc0. 


Now,  writing  the  <r(/)  in  the  sum  of  <Ti  and  <r2, 

h 

2(1  -  x 


/.  ^  /(T^)  +  /(Tf^7)-2/(^) 

<Tl  =  ^37^  Z-  - 

kzz-ro 


2«fc-0  5-S|;  +  0  5 


!l(Xk,*) 


h  — 

2(1—  X')  ^-'  St-O.S^t  +  0  5 

=  —  OO 


■”  =  «nhj/u)  £ 


1 


fc=  — OO 


$t-0  5<S*  +  0.5 
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using  the  Lemma  3.2, 


1 


k=—oo 


S*-0.5SJk+0.5 


and  using  the  estimation 


2+2  E 


Jk=— oo 


1  .  h  ^ 
I— I  2  .  .  fc  I—  i o • 

Sfc-0.5Si+0.5  Sinh-J  1* 


we  have 


I  <(/)  l< 


1  -  x2 


<(?)  I  + 


co 


h2 


1  -  x2  12 


*  +  +  r2(I)/l2) + 


2/» 


<  ci(i)(e  ^  +  h2), 

where  cx(x)  is  defined  in  (3.13).  Furthermore 


<*(/)  l<l  <(/)  I  + 


-  l2)  S*-°  5St+0  S 


E 


<!<(/)!  + 


Co/l 


1 


2(1  —  x2)  sinh  4  tanh  ( A/ +  0.5)4 


1) 


<  c3(z)(e~  »  +  h2  +  e~!i,h), 
where  c3(x)  is  defined  in  (3.14).  By  taking  A/  =  ^y, 

I  0»/(/)  l<  c3(*)(2<r A"*  +  h2).  g.E.£>. 

Note,  the  upper  bound  c3(x)  in  the  (3.12)  is  dependent  on  the  behavior  of  the  function  /(<).  02(2) 
may  become  unbounded  when  x  goes  to  ±1. 

4.  The  Solvability  of  the  Discrete  System  of  Equations 

For  the  finite  part  integral  equation  (1.1),  if  k(x,t)  =  0,  the  discrete  functional  equation  given  by 
the  revised  hyperbolic  tangent,  quadrature  rule  will  be 


(4.1) 


,  »  __  FW, 
2(1  -  j:-)  “  #  A‘t-0  5 Ski 


k-=-.\f 


t+0.5 


where  x*  =  tanh  4p. 

We  can  choose  x  =  Xj  as  the  collocation  points.  Since 


(4.2) 


Jfc  +  Xj 
1  +  Xt  Xj 


—  Xt+;, 
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the  collocation  points  are  the  same  as  the  quadrature  nodes.  Let  /  =  k  +  j,  we  have 


M+j 


<«>  f  E 


/(*/) 


2  l=-M+j 


=  (l-x^FCx;),  for  J  =  -M,-M+  !,•••  ,-1,0,1,  •  ,M. 


There  are  2Af  +  1  equations.  In  order  to  keep  the  number  of  the  unknowns  /(xj)  the  same  as  the 
number  of  equations,  we  prefer  to  ignore  those  quadrature  nodes  which  are  too  close  to  the  ends  —1 
or  1.  It  is  reasonable  to  take  the  range  of  X|  as  from  x-m  to  Xm  and  we  have 


(4.4) 

Let 

and 


,  m 

-  T  - 

2  /=-Af  sl-}-0.5Sl-j+0.S 


/(**) 


=  (1  -  x])F{Xj),  for  j  =  _M,  •  •  •  ,  -1,0, 1,  -  ,  M. 


^  =  2  (/(x-w  )>/(*-**+ 1)>  ■■■  ,/(xm))T 


F  =  ((1  -  (1  -  xlM+1)^(x-w+i),  -.(I-  x^)F(xm))T- 

The  discrete  linear  system  of  equations  is 


(4.5) 

where 


A4>  =  F , 

^  =  (aj/)(2A/  +  l)x(2A/  +  l)> 

j  =  -M.-M+  1,  - ■  -  ,  —1,0, 1,  -  , A/,  /  =  -M,-M  +  !,•••  ,  —1,0, 1,  -  ,Af, 


(4.6) 


<>ii  = 


1 


S|_;_0.5S|-;+0  5 


The  following  steps  are  concentrated  on  proving  that  A  is  invertible. 

Since 

4 

ajl  ~  (eC'-;-0  S)A/2  _  e-(/_j-0.S)A/2)(e(/-7  +  0.5)A/2  _  +0.5)h/2) 


4p 


,21 


(pl-0  5  _  pj^pl+0.5  -pj)' 


where 


p  =  e*  >  0. 

The  matrix  A  can  represented  as  the  product  of  D  and  D, 

D  =  4dmg(p-™,p*-*,  +  ",-  -  ,p2Af), 
B  =  (&;/)(2Af  +  l)x(2Af  +  l)> 


(4.7) 


r„/-0  5 


(pt-°5  -  p])(pl  +  0S  _ 
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We  only  need  to  prove  that  B  is  nonsingulax. 

If  B  is  an  n  x  n  matrix,  AtJ-  =  -  2  ■,  where  all  {r,}"  and  {yy}"  are  distinct,  then 

det(B)  £  0. 

B  is  called  the  Cauchy  matrix  [9].  In  fact, 

(4.8)  det(B)  =  (-1)^  LI*) . 

The  set  of  functions  {x'2yj }?  *s  linearly  independent,  that  is,  if  there  are  constants  et,C2,  •••  ,  c„, 
such  that 

(4.9)  V  -S—  =  0, 

i= i  w 

then  ci  =  C2  =  •  •  •  =  c„  =  0. 

By  [10],  we  can  say  that  the  set  of  functions  {7^"}?  satisfies  the  Harr  condition.  Now  we 
generalize  these  concepts. 

Definition  4.1.  If  {xj}]}  and  {t/y}?  are  two  groups  of  numbers,  we  call  the  matrix 
(4-10)  B  =  (6ij)nXn  =  w  ~r)nxn 

(*i-i  -  yj)(Xi  -  Vi) 

the  generalized  Cauchy  matrix. 

Lemma  4.2.  If  ail  {x,}J  and  {yy}"  are  distinct,  then  the  generalized  Cauchy  matrix  is  nonsingular. 
Proof.  If  the  B  in  (4.10)  is  singular,  then  there  are  C|, cy, • •  •  ,  cn  not  all  zeros  such  that 


Y]  7 - %: - t  =  0,  for  j  =  1,2,---  ,  n. 

frt  (*t-i  -  y,)(x>  -  yj) 


p(t)  =  IKxi-t) 


p(t)  is  a  polynomial  of  degree  n  +  1.  Consider 


o(o  =  r — — — ■ 

(x.-i  -0U.-0 

where  <p(t)  is  a  polynomial  of  the  degree  »  —  1.  Since  <P(t/j )  =  0  for  j  =  1, 2,  •  •  •  ,11,  we  liave 

<H0  =  0. 


That  is 


V - - - =  0 

“  (x,-i  -  t)(x,  -t) 
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or 


y>  - - — )  = 

i-Zi-i  ar,-i-f  Xi-t 
By  rearrangement  of  the  summation,  we  obtain 


3  0. 


(4.11) 


Xi 


i  1,-1  ,  i 

£0 _ 1 _  C.  +  l _ Ci _  _ 1 _ £n _ 1  _  Q 

X()  Xq  t  Xj  +  i  X|  Xi  Xj  — i  Xj  t  X„_1  Xn  t 


Since  the  set  of  functions  {^^}o  is  linearly  independent,  (it  satisfies  the  Harr  condition,)  all 


C1  | _ Ci+i 


X\  -  X0  X,+i  -  Xi  Xi-  Xi_! 


li=n-l 

J«=l  >  — 


Xn  Xn_i 


must  be  zeros,  that  is 

ci  =  c2  =  ••  ■  =  c„  =  0.  Q.E.D. 

Since  the  matrix  B  in  (4.7)  is  a  generalized  Cauchy  matrix,  the  {/£>,-°'5}  and  the  {p2}  are  two 
groups  of  distinct  points.  By  using  this  lemma,  B  must  be  nonsingular.  Hence  the  following  theorem 
is  obvious. 

Theorem  4.3.  The  matrix  A  in  (4.6)  is  invertible. 

5.  Numerical  Results 

We  give  here  two  applications  of  the  method  presented  in  the  preceding  sections.  Although  the 
matrix  A  is  a  symmetric  matrix,  we  use  the  general  Gaussian  elimination  algorithm  with  single 
precision. 

Example  1.  Consider  the  following  Hadamard  finite  part  integral  equation, 


1  C  f(tw 

ir  J_x  (t-x)2 


F(x), 


where  F(x)  =  x2  4-  4x  —  0.25.  The  exact  analytic  solution  is  f(t)  =  —  +  6t  —  0.25).  We 

select  M  —  16  and  h  —  0.8.  The  results  are  shown  in  Table  1. 

Example  2.  Consider  the  same  equation  with  the  right  hand  side  F(x)  =  £(x—  1  -  xln  )  The 
exact  analytic  solution  is  f(t)  =  l  —  t2.  We  choose  M  =  13,  h  =  1.0  and  choose  M  —  16,  h  =  0.8. 
The  results  are  shown  in  Table  2  and  Table  3.  Since  the  solution  is  an  even  function,  we  only  show 
the  half  data. 
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Table  1. 


Nodes 

-0.999995 

-0.999988 

-0.999973 

-0.999940 

-0.999865 

-0.999699 

-0.999330 

-0.998509 

-0.996683 

-0.992632 

-0.983675 

-0.964028 

-0.921669 

-0.833655 

-0.664037 

-0.379949 

-0.000000 

0.379949 

0.664037 

0.833655 

0.921668 

0.964028 

0.983675 

0.992631 

0.996682 

0.998508 

0.999529 

0.999699 

0.999865 

0.999939 

0.999973 

0.999988 

0.999994 


Exact  solution 

Numerical  solution 

Absolute  error 

Relative  error 

0.005336 

0.004071 

-0.001265 

23.71% 

0.008523 

0.007710 

-0.000814 

9.55% 

0.012860 

0.012462 

-0.000397 

3.09% 

0.019163 

0.019203 

0.000040 

0.21% 

0.028797 

0.029051 

0.000253 

0.88% 

0.042951 

0.043598 

0.000647 

1.51% 

0.064019 

0.065190 

0.001171 

1.83% 

0.095427 

0.097277 

0.001849 

1.94% 

0.142066 

0.144885 

0.002819 

1.98% 

0.210851 

0.215146 

0.004295 

2.04% 

0.310984 

0.317533 

0.006549 

2.11% 

0.452289 

0.462418 

0.010129 

2.24% 

0.637646 

0.653708 

0.016062 

2.52% 

0.838912 

0.864813 

0.025901 

3.09% 

0.945410 

0.984484 

0.039074 

4.13% 

0.735484 

0.777139 

0.041656 

5.66% 

0.083333 

0.091056 

0.007723 

9.27% 

-0.670339 

-0.708051 

-0.037712 

5.63% 

-1.040588 

-1.087179 

-0.046591 

4.48% 

-1.002749 

-1.036427 

-0.033677 

3.36% 

-0.792702 

-0.813875 

-0.021173 

2  67% 

-0.572672 

-0.585929 

-0.013257 

2.31% 

-0.397081 

-0.405581 

-0.008500 

2.14% 

-0.270259 

-0.275803 

-0.005544 

2.05% 

-0.182408 

-0.180038 

-0.003630 

1.99% 

-0.122649 

-0.125002 

-0.002353 

1.92% 

-0.082330 

-0.083799 

-0.001469 

1.78% 

-0.055227 

-0.056053 

-0.000826 

1.50% 

-0.037031 

-0.037354 

-0.000322 

0.87% 

-0.024833 

-0.024696 

0.000137 

0.55% 

-0.016643 

-0.016028 

0.000615 

3.69% 

-0.011177 

-0.009917 

0.001260 

11.27% 

-0.007492 

-0.005239 

0.002253 

30.07% 
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Table  2 


Nodes 

Exact  solution 

Numerical  Solution 

Absolute  error 

Relative  error 

-0.99991 

0.000180 

0.000158 

-0.000022 

12.36% 

-0.99975 

0.000493 

0.000482 

-0.000011 

2.17% 

-0.99933 

0.001340 

0.001336 

-0.000004 

0.28% 

-0.99818 

0.003641 

0.003638 

-0.000002 

0.06% 

-0.99505 

0.009866 

0.009870 

0.000004 

0.04% 

-0.98661 

0.026592 

0.026631 

0.000039 

0.14% 

-0.96403 

0.070650 

0.070934 

0.000284 

0.40% 

-0.90515 

0.180706 

0.182594 

0.001888 

1.04% 

-0.76159 

0.419974 

0.430501 

0.010527 

2.50% 

-0.46212 

0.786448 

0.825407 

0.038960 

4.95% 

0.00000 

1.000000 

1.065347 

0.065347 

6.53% 

0.46212 

0.786448 

0.825408 

0.038960 

4.95% 

0.76159 

0.419974 

0.430501 

0.010527 

2.50% 

0.90515 

0.180707 

0.182594 

0.001888 

1.04% 

0.96403 

0.070651 

0.070934 

0.000283 

0.40% 

0.98661 

0.026592 

0.026630 

0.000038 

0.14% 

0.99505 

0.009866 

0.009868 

0.000002 

0.02% 

0.99818 

0.003641 

0.003636 

-0.000005 

0.14% 

0.99933 

0.001341 

0.001332 

-0.000009 

0.69% 

0.99975 

0.000494 

0.000478 

-0.000015 

3.11% 

0.99991 

0.000182 

0.000154 

-0.000027 

15.06% 

Table  3. 


Nodes 

Exact,  solution 

Numerical  solution 

Absolute  error 

0.0000000 

1.0000000 

1.0411291 

0.0411291 

0.3799490 

0.8556388 

0.8853009 

0.0296621 

0.6640366 

0.5590555 

0.5713608 

0.0123053 

0.8336545 

0.3050202 

0.3086018 

0.0035816 

0.9216684 

0.1505274 

0.1513890 

0.0008616 

0.9640275 

0.0706509 

0.0708401 

0.0001892 

0.9836748 

0.0323839 

0.0324238 

0.0000400 

0.9926315 

0.0146828 

0.0146910 

0.0000082 

0.9966823 

0.0066243 

0.0066258 

0.0000015 

0.9985079 

0.0029820 

0.0029820 

0.0000000 

0.9993293 

0.0013410 

0.0013406 

-0.0000005 

0.9996985 

0.0006029 

0.0006021 

-0.0000008 

0.9998645 

0.0002710 

0.0002701 

-0.0000009 

0.9999391 

0.0001218 

0.0001207 

-0.0000012 

0.9 999726 

0.0000547 

0.0000535 

-0.0000012 

0.9999877 

0.0000247 

0.0000228 

-0.0000018 

0.9999945 

0.00001 1 1 

0.0000086 

-0.0000025 

We  can  give  some  final  comments  on  the  numerical  results.  First,  since  both  the  collocation  points 
and  the  quadrature  nodes  cluster  around  the  endpoints  of  the  interval,  the  absolute  error  in  the 
middle  points  seems  a  little  bigger.  By  increasing  the  number  of  nodes,  the  accuracy  can  be  improved. 
Second,  although  the  nodes  are  very  dense  around  the  end  points,  the  relative  error  at  the  nodes 
of  the  most  extreme  positions  is  large.  The  reason  is  that  the  outside  quadrature  nodes  have  been 
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neglected  when  we  construct  the  system  of  equations.  However,  the  points  on  the  second  and  the 
third  nodal  positions  are  so  close  to  the  points  of  the  extreme  positions  that  it  will  not  affect  the 
total  figure  to  discard  the  evaluated  values  at  the  most  extreme  positions. 
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DOMAIN  DECOMPOSITION  METHODS  FOR  NONSELFADJOINT 

OPERATORS • 

ZBIGNIEW  LEYKI 

Abstract.  We  consider  the  problem  of  solving  the  system  of  equations,  arising  from  the  discretiza¬ 
tion  of  nonselfadjoint  elliptic  boundary  value  problems  via  the  finite  element  method.  It  can  be  solved 
with  the  aid  of  exact  or  approximate  solvers  for  the  same  equations  restricted  to  subregions.  The 
interactions  between  the  subregions,  to  enforce  appropriate  continuity  requirements,  are  handled  by 
an  iterative  method.  If  we  want  to  use  the  conjugate  gradient  method,  we  must  transform  the  primary 
system  of  equations  as  to  get  a  symmetric  system  of  equations.  We  show  how  this  can  be  done  so  as  to 
get  a  small  condition  number  of  the  transformed  system  of  equations.  We  also  discuss  computational 
aspects  of  the  problem. 

Key  Words.  Domain  decomposition,  additive  Schwarz  method,  nonsymmetric  linear  equations, 
finite  elements,  elliptic  equations 

AMS(MOS)  subject  classifications:  65N30,  65F10. 


1.  Introduction.  Domain  decomposition  methods  have  recently  become  one  of 
the  most  powerful  methods  for  solving  partial  differential  equations.  Most  of  them  are 
different  variants  of  the  additive  Schwarz  method,  see  [8],  which  in  turn  is  very  simple 
and  easy  to  implement  on  computers  (including  parallel  computers).  The  additive 
Schwarz  method  was  introduced  in  [6]  and  [9]  and  subsequently  studied  in  [5,  7,  8].  In 
those  papers  only  symmetric  equations  were  considered,  whereas  in  [3]  nonsymmetric 
equations  have  been  analyzed. 

The  main  property  of  the  additive  Schwarz  method  (ASM)  is  to  divide  the  given 
problem  into  a  number  of  smaller  equivalent  problems.  The  interaction  between  these 
smaller  problems  is  handled  by  an  iterative  method.  We  choose  a  division  of  the  primary 
problem  such  that  the  condition  number  of  the  tranformed  (preconditioned)  problem 
is  independent  of  the  number  of  the  smaller  problems  and  of  some  other  parameters 
like  H  (size  of  a  coarse  mesh)  and  h  (size  of  a  fine  mesh).  If  the  primary  problem  is 
symmetric,  then  the  conjugate  gradients  method  is  used.  But  if  it  is  nonsymmetric,  it  is 
rather  difficult  to  decide  on  the  best  iterative  method.  In  [3],  where  the  nonsymmetric 
positive  definite  differential  problems  have  been  considered,  the  GMRES  method  is  used. 
Moreover,  for  symmetric  problems  of  the  form  Aw  =  /,  the  inner  product  (w,v)a  = 
(Aw,  u)  has  been  used  ((•,•)  is  the  usual  Li1  inner  product).  This  inner  product  cannot 
be  applied  when  A  is  nonsymmetric  and  we  must  use  another  one.  In  [3]  the  (w,  v)as  = 

*  This  work  was  supported  by  the  U.S.  Army  Research  Office  through  the  Mathematical  Sciences 
Institute  of  Cornell  University 

*  MSI,  409  College  Avenue,  Cornell  University,  Ithaca,  N.Y.  14853;  on  leave  from  University  of 
Warsaw,  Institute  of  Computer  Science,  00-901  Warsaw,  PKiN,  p.  850,  Poland 


455 


Domain  Decomposition  Methods 


( Asw,v )  inner  product  has  been  used,  where  As  is  the  symmetric  part  cf  A.  We 
show  in  this  paper  that  for  ASM  there  is  another  convenient  inner  product  of  the  form 
(w,  v)r-i  =  (R~1w,v),  where  R  is  an  operator  related  to  the  decomposition  of  the 
primary  problem. 

The  outline  of  this  paper  is  as  follows.  We  present  in  Sect.  3  an  iterative  method  to 
solve  nonsymmetric  preconditioned  problems.  It  is  the  CG  method  with  a  special  inner 
product  applied  to  a  symmetrized  preconditioned  problem.  The  way  of  transforming 
nonsymmetric  equations  into  symmetric  ones  (symmetrization)  is  described  in  [2].  We 
prove  that  the  iterative  method  is  convergent  and  the  rate  of  convergence  is  given.  In 
Sect.  4  we  show  that  for  the  system  of  linear  equations  resulting  from  the  finite  element 
method  applied  for  Dirichlet  boundary  value  problem  (a  primary  problem)  the  rate 
of  convergence  of  this  method  is  independent  of  the  number  of  smaller  problems  (or 
subregions)  and  of  H  and  h.  Note  that  we  do  not  impose  the  condition  that  the  coarse 
mesh  should  be  fine  enough  to  get  the  convergence  of  the  iterative  methods  as  in  [3] 
(we  have  no  upper  bound  on  H).  In  Sect.  5  we  consider  some  numerical  examples. 

2.  Preliminaries.  Consider  a  linear  equation  of  the  form 

(2.1)  Aw  =  /, 

where  A  :  W  — *  W  is  a  linear  operator,  w,f  £  W  and  W  is  a  finite-dimensional  Hilbert 
space  with  an  inner  product  ( • ,  ■ )  and  a  norm  Jj  •  ||.  Finite  element  or  finite  differences 
discretization  of  an  elliptic  boundary  value  problem  lead  to  problems  of  this  type. 

Let  D  :  W  — ♦  W  be  a  linear,  symmetric  and  positive  definite  operator.  Define  the 
inner  product  ( w,v)o  =  ( Div,v )  and  the  norm  Hu||d  =  \[{Dv,v). 

We  assume  that  the  operator  A  satisfies  the  following  conditions: 

(2.2)  ™\\w\\2d  ~  ( Aw,w )  <  M\\w\\2d,  Vw  G  W, 

(2.3)  (D~l  Aw,  Aw)  <  K  (Aw,w),  Vu>  G  W, 

where  0  <  m  <  A/,  K  >  0 
norm  ||v||d-'  defined  as 

(2.4) 

and  hence  for  any  v,w  £  W 

(2.5)  |(*Mtf)|<  |M|d-.|H|d. 

We  have  the  equality  ||u||£>-i  =  D~x  u,  ?;).  Note  that  (2.3)  can  be  rewritten  as 
||Ait/||u_i  <  K  (Aw,w). 


Note  that  A  need  not  be  symmetric.  We  introduce  the 


=  sup 


!(e,te)| 


w£W  U>  £> 
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Remark  2.1.  Condition  (2.3)  is  equivalent  to 

(D~l  w,w)  <  K  (A-1u;,u;),  Vtu  G  W. 

If  A  is  symmetric  then  K  =  M  (it  follows  from  (2.2)).  □ 

Remark  2.2.  Let  AT  be  the  transpose  of  A  with  respect  to  ( • ,  • ),  i.e.,  (ATu,  v )  = 
(u,  Av)  for  any  u,t)6  W.  It  is  evident  that  A  =  As  +  An,  where  As  =  \{A  +  A1)  (the 
symmetric  part  of  A)  and  An  =  |(A  —  AT)  (the  skew-symmetric  part  of  A).  Assume 
that 

(2.6)  (D~xAnw,  Anw)  <  Kx(Aw,  w),  Vie  G  W, 

where  K\  >  0.  We  show  that  K  depends  on  K\  and  M. 

First  we  notice  that  (Asw,  w)  =  (Aw,  w)  for  any  w  G  W.  We  have 

(2.7)  (D~lAw,  Aw)  =  (D~x  Asw,  Asw)  +  2(£>-1  Asw,  Anw)  +  (D~xAnw,  Anw). 

It  is  easy  to  show  that  (Asu,u)  <  M(Dv,v)  is  equivalent  to  ( D~lv,v )  <  M(A11v,v) 
for  any  v  G  W .  Hence 

(2.8)  (D~1  Asw,  Asw)  <  M(A$X  Asw,  Asw)  =  M(Aw,w). 

Further 

(2.9)  (D-1  Asw,  Anw)  <  (D~l  Asw,  Aswy'2(D~l  ANw,  ANw)l/2. 

Combining  (2.7)  with  (2.6),  (2.8)  and  (2.9)  we  get 

(D~l  Aw,  Aw)  <  M(Aw,w)  +  2\J  M  Kx(  Aw,  w)  +  Kx(Aw,w)  =  (\/M  +  \J~K^)2(Aw,w). 

From  the  above  it  follows  that  we  can  set  I\  =  [y/M  +  \/K\)2.  □ 

Remark  2.3.  If  we  have  the  following  estimates 

(2.10)  ni\ ||ui|| <  (Aw,w).  Vw  G  W 
and 

(2-11)  |  (Aw.  n)  )<  ;U,  |ii0|jc||u|jo.  Vw.veW. 

then  we  can  take  m  —  mx,  M  =  Alx  and  K  =  mxl  M2  in  (2.2)  and  (2.3).  We  show  that 
the  inequalities  (2.10)  and  (2.11)  imply  (2.3).  Taking  v  =  D~l  Aw  in  (2.11)  we  get 

\(Aw,D-1Aw)\<Mx\\w\\d\\D-1Aw\\d. 

But  ||  D~l  Aiello  =  ||  Ato || £5-1  and  hence 

|  {Aw,  D~l  Aw)  \<  M2\\w\\2d. 
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Since  Ijiollc  <  mxl(Aw,  w),  then 

|  (Aw,  D~x  Aw)  |<  m^1  M*(Aw,  w). 

Usually  the  constant  K  is  smaller  then  mxl  Mx . 

Tt  is  also  possible  to  show  that  (2.2)  and  (2,3)  implies  (2.11).  We  have 

(Au,v)  <  i|Au||£>-i||u||r>  <  Vl<  M\\u\\D\\v\\D, 

since  (£)_1  Au,  Au)  <  K{Au,u)  <  KM\\u\\2D.  We  can  take  Mx  =  VKM  and  mx  = 
m.  From  the  above  considerations  it  follows  that  the  conditions  (2.2)  and  (2.3)  are 
equivalent  to  (2.10)  and  (2.11).  The  condition  (2.3)  is  important  only  for  nonsymmetric 
A.  If  A  is  symmetric  it  follows  from  (2.2).  □ 

In  order  to  decompose  the  space  W  we  choose  linear  subspaces  W,  C  W,  i  = 
0, . . . ,  k,  such  that 

(2.12)  W  =  WQ +...  +  Wk, 

i.e.,  any  w  (=  W  can  be  represented  (usually  non-uniquely)  as  w  =  io0  + . . .  +Wk,  where 
wx  G  Wi.  We  assume  that  there  exists  a  constant  6  >  0  such  that  for  any  v  G  W  there 
exists  such  a  decomposition  v  =  u,  G  W„  i  =  0, . . . ,  k,  that 

(2.13)  EINId<%IId- 

1=0 

Note  that  6  may  depend  on  k  and  dim  W{,  i  =  0, . . . ,  k. 

Let  Si,  i  =  0, . . . ,  k,  be  the  orthogonal  projectors  onto  with  respect  to  (  • ,  • )d. 
that  is,  for  any  w  €  W 

(2.14)  (SiW,Vi)D  =  {w,Vi)D,  Vu,  G  W{. 

We  assume  that  there  exists  a  constant  7  such  that  for  any  v  G  W 

(2.15)  E  II5'uIId  ^  tIMId- 

■  =  0 

We  consider  that  7  is  the  smallest  constant  satisfying  (2.15)  for  any  v  G  H  . 

Remark  2. 4-  Since  ||5,c||d  <  |M|d,  then  E?=o  Ill’ll/?  -  ("-'  +  l)IMIr>  an<J  we  can 
always  take  7  =  fc  +  1.  But  for  some  decompositions  of  W  we  find  that  7  does  not 
depend  on  k.  0 

We  now  introduce  the  local  decomposition  operators  R,  :  W  — *  Wt,  1  —  0 ....  ,k, 
defined  for  any  w  G  W  as  follows 

(2.16)  (/?,?/’,  v,)D  =  (w.v,),  Vv,  G  IV,. 
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The  global  decomposition  operator  is  a  linear  combination  of  Ru  i  =  0, . . . ,  k,  that  is 

«=0 

Note  that  R  :  W  —*  W ,  since  W  is  a  linear  space.  Denote  by  Q,-  :  W  —*  Wt,  i  =  0, . . . ,  k, 
the  orthogonal  projectors  onto  VV) ,  i.e.,  for  any  w  E  W 

(QiW,Vi)  =  ( w,v{ ),  Vvi  e  W. 

From  (2.16)  we  get  that  for  any  w,  v  €  W 

(■ DRiW,QiV )  =  (w,Q{v) 

and,  since  Qi  is  symmetric  and  R,w  =  QiR,w,  we  find  that  for  any  v  £  W 


This  implies  that 


{QiDQiRiW,v)  =  (QiW,v). 


DiRiW  =  QiW, 


where  D{  =  QiDQi.  Hence  RiW  =  D,  1QiW  and 


(2.17) 


R  =  ZDr'Qi 


Note  that  ft,,  i  =  0, . . . ,  k,  is  symmetric  with  respect  to  ( • ,  • ),  but  it  is  not  a  projector 

(R1  /  «<). 

We  now  show  that  R  is  equivalent  to  D~l,  i.e., 

8~1(D~1w,w)  <  ( Rw,w )  <  7 (D~lw,w), 

where  6  and  7  are  defined  in  (2.13)  and  (2.15)  respectively.  From  the  above  it  follows 
that  1  <  67. 

Lemma  2.1.  If  (2.13)  is  satisfied ,  then  for  any  w  €  W 


(2.18) 


8(Rw,w)  >  |MId->, 


Proof.  For  any  v  €  W  we  have  a  decomposition  t>  =  ut,  u,  G  IV’,,  satisfying 
(2.13).  Note  that  {w,Vi)  =  (f?,u;,t;,)o,  see  (2.16).  Hence 


(2.19) 


>,u)l=  YL(w'v')  <'52\{R>w’vi)d\  ■ 
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Further, 

(2.20)  Y,  <  SH^IIdINId 

t=0  t=0 


Since  (R,w,  Riw)o  =  ( w,R{W ),  see  (2.16),  then  from  (2.19),  (2.20)  and  (2.13)  we  obtain 

|  (w,v)  |<  (iu,it!uj)1/2«51/2||u||£,. 

We  divide  the  above  by  ||u||z)  and  take  the  supremum  over  v  €  W .  Using  (2.4)  we 
obtain 

(2.21)  Mo-^S^Rw,™)1'2, 

which  completes  the  proof.  0 

Since  D-1  is  positive  definite  Lemma  2.1  implies 
Corollary  2.1.  R  is  invertible. 

We  now  obtain  the  estimate  of  ( Rw,w )  from  above. 

Lemma  2.2.  Let  (2.15)  be  satisfied.  Then  for  any  w  6  W 

(Rw,w)  < 

Proof.  From  (2.16)  and  (2.14)  we  find  that  for  any  v,  £  W,,  i  =  0, . . . ,  k, 


( RtDw,Vi)D  =  ( Dw,Vi )  =  (SiW,Vi)D. 


Hence  for  any  w  £  W 

(2.22)  RiDw  =  Siw. 

Since  S,  =  Sf 


k  k  k 

(RDv,  Dv)  =  ^2(R,Dv,v)d  =  ^,{SiV,v)D  =  ^||S,v||2D  <  7|MId- 

i=0  i=0  i=0 

Taking  Dv  =  w  we  obtain  ( Rw,w )  <  o-i  *  D 

3.  Symmetrization.  It  is  evident  that  a  nonsymmetric  equation  can  be  trans¬ 
formed  to  an  equivalent  equation  with  a  symmetric  operator.  See  [2],  where  two  kinds 
of  symmetrization  of  nonsymmetric  equations  are  described.  We  shall  use  one  of  them. 
Consider  the  following  equation 

(3.1)  T w  —  z, 
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where  T  =  RATRA,  z  =  RAJ Rf  and  AT  is  transpose  of  A  with  respect  to  ( • ,  • ), 
i.e.,  ( Atu,v )  =  ( u,Av )  for  any  u,v  €  W.  Notice  that  T  is  symmetric  with  respect  to 
( • ,  •  )r-i,  since  ( Tw ,  v)R-i  =  (RAw,  Av)  and  R  is  symmetric.  We  can  use  the  conjugate 
gradient  method  (CG)  to  solve  (3.1),  see  [2].  It  is  rather  easy  to  find  Rw  for  any  w  e  W , 
but  it  is  hard  to  compute  R~lw.  Thus  we  introduce  some  changes  in  CG  algorithm. 
We  use  the  version  of  CG  studied  in  [10],  see  also  [1]. 

We  apply  the  following  algorithm  to  solve  (3.1).  We  assume  that  we  are  given  an 
initial  approximation  wq  to  the  solution  w  of  (3.1). 

Algorithm  S 

r0  :=  ATR(f  -  Aw0)-,  po  :=  RrQ ;  i  :=  0; 
while  error  >  tolerance  do 

begin 

a{  := 

(3.2)  ^<+1  := 

r,+i  := 

&  := 

Pi+\  •'= 

end. 

Here  we  have  introduced  Rrt  instead  of  r,  used  in  [10]  and  we  have  used  the  (  •  ,  •  )R-i 
inner  product  instead  of  the  standard  ( • ,  • ),  since  T  is  symmetric  with  respect  to  the 
( • ,  -  )fi-i  inner  product  and  our  estimates  are  given  in  ||  •  ||r-i  norm.  Note  that  R  can 
be  considered  as  a  preconditioner  for  the  equation  Ar RAw  =  At Rf .  We  get  the  same 
algorithm  and  the  same  rate  of  convergence.  On  each  iterative  step  we  use  the  operator 
R  twice:  to  compute  Ar  RAp,  and 

We  now  study  the  rate  of  convergence  of  CG  applied  to  (3.1).  We  have 
Theorem  3.1. 

(2m  \  n 

where  wn  is  an  approximation  of  w  produced  by  the  CG  method  after  n  iterative  steps. 
Proof.  We  show  that 

(3.3)  r2m2 IMI2*-,  <  (Tw,w)r->  <  72/f  Af|H|2fi_,  for  all  w  €  W. 

We  have 

(Tw,w)r-\  =  {RAw,  Aw). 


(Rr„  fj) 

{Pi,  ATRAPi)  ’ 

Wi  +  a,pt; 

—  ctiAT  RApp, 

(fif1+i,f,+i) 
{RfiJfj 
Rri+l  +  0iPi\ 

i  +  1 
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By  Lemma  2.2 

IMIo  =  (RDw,w)r-i  <  (i2Z)ut,£)uf)1/3||uj||R-i 

<  7,,2ii£MId-.imu-  =71/2wid||«>iu-'- 

Hence 

(3.4)  IMId  <  7lMI*-»- 
Using  Lemma  2.2,  (2.3),  (2.2)  and  (3.4)  we  get 

(RAw,  Aw)  <  <  yK(Aw,w)  <  'yKMWwW^,  <  'y2 K M \\w\\2R~i . 

Thus  we  have  proved  the  second  inequality  in  (3.3).  We  now  prove  the  first  inequality. 
By  Lemma  2.1 

(3.5)  {RAw,  Aw)  > 

We  now  show  that  ||u||£,  >  5_1||v||h-j.  We  have 

(R~1v,v)  =  (jR-1u,  Dv)d-i  <  (R~*v,  R~1v)^1(Dv,  v)1^2. 

Using  Lemma  2.1  (w  =  R~lv)  we  find  that 

(R~'v,  R-'v)1^  <  6'/2(R~lv,  v)l/2. 

Hence 

(3.6)  {R~lv,v)<6(Dv,  v). 


From  (2.4)  and  (2.2)  we  get 


.  „  1  (Aw,  v)  I  |  (Aw,  w)  I 

At0  D-1  =  sup  '-J— - >  — — - >  m  H  D- 

vew  |u|j£)  Mo 


Using  (3.7)  and  (3.6)  we  obtain 


II^HId-'  >  m<5_1/2|Mifl->- 


Combining  the  above  and  (3.5)  we  finally  have 

{RAw,  Aw)  >  6-2m2|MI/?-i  - 

We  conclude  that  the  condition  number  k  of  T  is  bounded  by  62^2KM/m2.  It  is 
well  known  that  for  the  CG  method,  see  [2,  11), 

IM  -  WnHfi-1  <  2  ||w  —  c&o||a-i  . 

Since  k  <  62~f2 K M/m2,  then 

\/k  —  1  _  i  2  ^  2m 

\J~K  +  1  \J~ii  +  1  67 \J  J\  M  4-  m 

□ 
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4.  Application.  In  this  section  we  apply  the  theory  given  in  the  previous  sections 
to  the  Dirichlet  boundary  value  problem. 

4.1.  Differential  problem.  Let  ft  denote  an  open  bounded  polygon  in  R 2  with 
a  boundary  5ft.  Consider  the  following  Dirichlet  boundary  value  problem 


(4.1) 


{L:: 


=  /  in  ft, 

0  on  5ft, 


where  L  is  the  elliptic  operator  of  the  form 

iu(l)  =  ~  Jr,  h  (a‘i(l)i^) +  +  c(x)“(l)’ 

and  a,j(x)  =  aji(x)  for  i,j  =  1,2  and  any  x  £  Cl.  We  suppose  that  L  is  strongly  elliptic, 
that  is,  there  exists  a  constant  cq  >  0  such  that 

v*<=ft. 

•'..7=1  .=1 

We  will  consider  the  equation  (4.1)  in  the  weak  form:  Find  u  6  Hq(CI)  such  that 


(4.2) 

where 


and 


( An,  u)  =  (/>),  Vu  G  ifo(ft), 


{Au' v)  =  Jr,  d* + 5  X  6,Ir dx + /« ix 


(/,u)  =  /  fv  dx. 

J  o 

Here  ( • ,  • )  means  the  L2(ft)  inner  product. 

We  assume  that  /  €  L2(ft)  and  the  coefficients  6,  and  c  are  such  that  the 
operator  A  is  well  defined.  Moreover,  we  suppose  that  the  equation  (4.2)  has  the 
unique  solution  u  6  //q (ft)  for  any  fixed  /. 


4.2.  The  finite  element  method.  We  approximate  the  solution  of  the  equation 
(4.2)  by  means  of  the  finite  element  method  with  piecewise  linear  triangular  elements. 
We  first  introduce  a  two  level  triangulation  of  ft  €  R2  and  the  finite  element  spaces. 

For  a  given  polygonal  region  ft,  we  define  {ft,}*_j  to  be  a  regular  finite  element 
triangulation  of  ft,  that  is,  {ft,}  is  a  set  of  non-overlapping  triangles  such  that 

k 

ft  =  [J  ft,. 

,=i 
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and  moreover,  no  vertex  of  a  triangle  lies  on  the  edge  of  another  triangle. 

Let  Hi  be  the  diameter  of  fl,  and  Hi  be  the  diameter  of  the  largest  inscribed  ball  in 
fl;.  We  assume  that  the  ratio  Hi/ Hi  is  uniformly  bounded  from  above,  that  is,  we  have 
a  regular  triangulation.  Let  H  =  ma.x{H\, . . .  ,Hk}.  We  call  fl,  the  coarse  mesh  of  fl. 
Note  that  usually  k  depends  on  H  (k  =  cH~ 2),  but  this  fact  will  not  be  used  further. 

Next  we  divide  each  f2,  into  smaller  triangles,  denoted  by  rj,  j  =  l,...,n;.  We 
assume  that  fl,  =  Ujii  Tj  and  the  triangulation  {rj}  is  regular.  We  also  assume  that 
no  vertex  of  a  smaller  triangle  lies  on  the  edge  of  any  other  smaller  triangler.  Let  /i'  be 
the  diameter  of  rj  and 

h  =  max  h\. 

We  call  {rj}  the  fine  mesh  of  fl.  We  now  introduce  the  piecewise  linear  finite  element 
spaces.  Let 

VH  =  {vH\  continuous  on  fl,  and  vH\nt  is  linear  on  fl;,  vH  =  0  on  Sfl  } 

and 

Vh  =  {t7h|  continuous  on  fl,  and  vh\T .  is  linear  on  rj,  vh  =  0  on  <9fl  }. 

Note  that  VH  C  Vh. 

The  finite  element  method  is  defined  as  follows:  Find  Uh  €  Vh  such  that 

(4-3)  (Auh,vh)  =  (f,vh),  Vvh  €  Vh 

We  assume  that  the  coefficients  a;^ ,  6,  and  c  are  such,  that  the  conditions  (2.2) 
and  (2.3)  are  satisfied  with  constants  0  <  m  <  M,  K  >  0.  From  the  Lax-Milgram 
Theorem,  see  e.g.  [4],  it  follows  that  the  equation  (4.3)  has  a  unique  solution  U/,  G  Vh 
for  any  fixed  /. 

4.3.  The  additive  Schwarz  method.  Let  Uh  be  the  orthogonal  projector  from 
L2(fl)  onto  Vh  6  Hq(CI)  with  respect  to  ( • ,  • ).  We  define  A  =  UhAUh  and  /  =  Uh/- 
Usually  we  choose  D  —  /I5,  where  As  is  the  symmetric  part  of  A,  i.e.,  As  =  |(A  +  AT). 
Then  we  have  m  —  M  =  1.  Note  that  the  operator  A  :  Vh  — *  Vh  is  the  finite 
dimensional  operator  and  satisfies  the  conditions  (2.2)  and  (2.3).  We  have  the  equation 
(2.1).  We  now  define  the  space  \V  and  its  decomposition. 

We  extend  each  fl,  to  a  larger  region  fl,  such  that  fl,  C  fl,.  We  assume  that  there 
exists  a  constant  a  >  0  such  that 

dist  {dfli,  dfl;)  >  a//,,  Vt  =  1 . k. 

Moreover,  we  suppose  that  the  boundary  dQi  does  not  cut  through  any  /«-level  elements. 
We  cut  off  the  parts  of  fl,  that  are  outside  of  fl.  We  define  for  1  =  1. . . . ,  k 

Wt  =  VhDHx0(ht), 
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where  we  extend  each  function  of  W,  by  zero  to  the  complement  of  Q,.  It  is  known  that 
this  extension  is  continuous.  Thus  Wi,  i  =  1, . . . ,  k,  are  the  subspaces  of  W.  We  also 
define 

Wo  =  VH. 

Finally  the  space  W  is  defined  as  follows: 


W  =  W0  +  Wx  +  •  •  •  +  Wk. 


It  is  clear  that  W  =  Vh. 

We  check  now  that  the  conditions  (2.13)  and  (2.15)  are  satisfied.  It  can  be  shown 
that  7  is  independent  of  k.  Also  the  constant  6  is  independent  of  k,  h  and  H .  The 
proof  of  this  is  given  in  [5].  As  a  conclusion  we  get 

THEOREM  4.1.  The  rate  of  convergence  of  the  method  (3.2)  is  independent  of  k, 
h  and  H. 


5.  Numerical  results.  Consider  the  following  problem  defined  on  fi  =  [0,1]  x 
[0, 1]  C  R 2 


(5.1) 


d  (  du 

Lu  =  Ux 


dx  \  dx 


d  (  du\  du  du 

^\atry)+b'ai  +  b^  +  cu-!' 


with  u  =  0  on  d£l.  The  function  /  is  defined  so  that  the  solution  of  (5.1)  in  Hq(Q)  is 
u  =  xexv  sin(7rx)  sin(7ry).  The  coefficients  are  as  follows: 

(5.2)  ax  =  1  +  x2  +  y 2,  a2  =  e*y,  bx  =  x  +  y,  b2  —  1/(1  -F  x  -f  y)  and  c  =  0.5 


or 

(5.3)  ax  =  <r,  a^  =  a,  b\  =  1,  62  =  1  and  c  =  1, 

(T  will  be  specified  later.  This  test  prob'em  is  similar  to  that  in  [3].  The  stopping 
criterion  for  iterative  methods  is  ||r,|jR-i  <  IO~3[[r0j[ R-i .  The  operator  D  =  As,  where 
As  is  the  symmetric  part  of  A.  The  program  run  in  double  precision  on  a  Sun  SPARC 
station  machine. 
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Fig.  5.1  Coarse  mesh  of  Cl  Fig.  5.2  Subregion  Cl{  with  fine 

mesh  inside 


Let  triangles  {0,}^  be  the  division  of  Cl  (coarse  mesh),  see  Fig.  5.1.  We  extend 
each  fL  U  i  =  1, 3, . . .  ,2k  —  1  to  a  larger  rectangle  Cli,  i  =  1, . . . ,  k,  such  that 

each  side  of  is  parallel  to  its  corresponding  axis,  see  Fig.  5.2.  We  divide  each  Cli  into 
smaller  triangles  { .ine  mesh)  such  that  dCl,  does  not  cut  through  any  h- level  triangles. 
The  width  of  the  strip  Cli  \  (Cl,  U  fi,+i)  in  h-units  will  be  called  the  size  of  overlap.  We 
do  not  extend  Q,  outside  fi. 

Test  1. 

We  examine  the  test  problem  (5.1).  For  different  h  and  H  we  will  study  the  effective¬ 
ness  of  the  algorithm  and  test  whether  it  depends  on  the  mesh  sizes  h  and  H.  The 
overlapping  factor  a  is  fixed.  We  will  try  to  keep  a  «  1/3. 

Let  ite  denote  the  number  of  iterative  steps  required  by  the  method  (3.1),  max 
err  and  L 2  denote  respectively  the  maximum  error  and  error  in  L2-norm  between  the 
numerical  solution  given  by  '"LI)  and  the  exact  solution.  The  mark  *  in  the  tables 
means  that  the  overlapping  factor  a  is  less  then  1/3. 


■■ 

H 

ite 

H 

ite 

max 

err 

L 

Mil 

1/15 

1/3 

15* 

1/5 

18 

3.8 

X 

10~3 

1.4 

X 

10“3 

1/30 

1/3 

16* 

1/5 

18 

9.2 

X 

10~4 

Q 

KB 

1/45 

1/3 

16 

1/5 

18 

5.0 

X 

10~4 

1.8 

X 

io-4 

1/60 

1/3 

16 

1/5 

18 

4.0 

X 

10~4 

1.3 

X 

10~4 

Table.  5.1  Problem  (5.1),  (5.2) 
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h 

H 

ite 

H 

ite 

max 

err 

L2 

1/15 

1/3 

15* 

1/5 

15 

9.0 

X 

10“3 

4.2 

X 

10“3 

1/30 

1/3 

16* 

1/5 

16 

2.3 

X 

10“3 

1.1 

X 

10“3 

1/45 

1/3 

17 

1/5 

16 

1.1 

X 

10'3 

4.9 

X 

10"4 

1/60 

1/3 

17 

1/5 

16 

7.5 

X 

10“4 

3.0 

X 

10"4 

Table.  5.2  Problem  (5.1),  (5.3)  with  o  —  -v/2/30 


In  Tab.  5.1  and  Tab.  5.2,  respectively,  the  problem  (5.1)  with  coefficients  (5.2)  and 
the  problem  (5.1),  (5.3)  with  <r  =  -\/2/30  are  tested.  From  these  tables  it  is  seen  that 
the  method  does  not  degenerate  with  h  when  H  is  fixed. 

Test  2. 

We  test  how  the  method  (3.1)  depends  on  H.  We  keep  h  =  1/30  and  vary  the  parameter 
H.  In  Tab.  5.3  results  are  given  for  the  problem  (5.1)  with  coefficients  (5.2).  In  Tab.  5.4 
and  Tab.  5.5  we  give  results  for  the  problem  with  coefficients  (5.3)  and  with  a  =  \/2/30 
and  cr  =  \/2/100  respectively.  It  can  be  seen  that  ite  is  in  agreement  with  the  theory 
where  an  upper  bound  on  the  number  of  iterative  steps  is  independent  of  h  and  H .  A 
particular  number  of  iterative  steps  may  vary  in  some  range. 


“ 

H 

1/3 

1/5 

1/6 

1/10 

ite 

16* 

18 

18* 

17 

Table.  5.3  Problem  (5.1),  (5.2) 


H 

1/3 

1/5 

1/6 

1/10 

ite 

16* 

16 

15* 

14 

Table.  5.4  Problem  (5.1),  (5.3)  with  o  =  \/2/30 


H 

1/3 

1/5 

1/6 

1/10 

ite 

22* 

22 

_ 

17* 

19 

Table.  5.5  Problem  (5.1),  (5.3)  with  o  =  \/2 / 100 


Acknowledgments.  The  author  thanks  Professor  James  H.  Bramble  for  his  stim¬ 
ulating  discussion  and  valuable  remarks. 

REFERENCES 


[1]  O  Axelsson  and  V.  A.  Barker.  Finite  element  solution  of  boundary  value  problems.  Academic 
Press,  1984. 


Domain  Decomposition  Methods 


[2]  J.  H.  Bramble  and  J.  E.  Pasciak.  Preconditioned  iterative  methods  for  nonselfadjoint  or  indefinite 

elliptic  boundary  value  problems.  In  H.  Kardestuncer,  editor,  Unification  of  Finite  Element 
Methods.  Elsevier  Science  Publishers  B.  V.,  1984. 

[3]  X.-C.  Cai.  An  additive  Schwarz  algorithm  for  nonselfadjoint  elliptic  equations.  In  T.  F.  Chan 

et  al.,  editors,  Third  International  Symposium  on  Domain  Decomposition  Methods  for  Partial 
Differential  Equations,  pages  232-244.  SIAM,  Philadelphia,  1990. 

[4]  P.  G.  Ciarlet.  The  finite  element  method  for  elliptic  problems.  North-Holland,  Amsterdam,  1978. 

[5]  M.  Dryja.  An  additive  Schwarz  algorithm  for  two-  and  three-dimensional  finite  element  elliptic 

problems.  In  T.  F.  Chan  et  al.,  editors,  Domain  decomposition  methods,  pages  168-172. 
SIAM,  Philadelphia,  1989. 

[6]  M.  Dryja  and  O.  B.  Widlund.  An  additive  variant  of  the  Schwarz  alternating  method  for  the  case 

of  many  subregions.  Technical  Report  339,  Dept,  of  Comp.  Sci.,  Courant  Institute,  1987. 

[7]  M.  Dryja  and  O.  B.  Widlund.  Some  domain  decomposition  algorithms  for  elliptic  problems. 

In  L.  Hayes  and  D.  Kincaid,  editors,  Iterative  Methods  for  Large  Linear  Systems.  Academic 
Press,  1990. 

[8]  M.  Dryja  and  O.  B.  Widlund.  Towards  a  unified  theory  of  domain  decomposition  algorithms  for 

elliptic  problems.  In  T.  F.  Chan  et  al.,  editors,  Third  International  Symposium  on  Domain 
Decomposition  Methods  for  Partial  Differential  Equations,  pages  3-21.  SIAM,  Philadelphia, 
1990. 

[9]  A.  M.  Matsokin  and  S.  V.  Nepomnyaschikh.  A  Schwarz  alternating  method  in  a  subspace.  Soviet 

Mathematics,  29:78-84,  1985. 

[10]  J.  K.  Reid.  On  the  method  of  conjugate  gradients  for  the  solution  of  large  sparse  systems  of 

equations.  In  J.  K.  Reid,  editor,  Large  Sparse  Sets  of  Linear  Equations,  pages  231-254. 
Academic  Press,  New  York,  1971. 

[11]  A.  A.  Samarskii  and  E.  S.  Nikolaev.  Numerical  Methods  for  Grid  Equations.  Birkhauser  Verlag, 

Basel,  1989. 


INELASTIC  MICROSTRUCTURE  IN 
RAPID  GRANULAR  FLOWS  OF  SMOOTH  DISKS 
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Hanover,  New  Hampshire  03755 

and 


Michel  Y.  Louge1" 

Department  of  Mechanical  Engineering 
Cornell  University 
Ithaca,  New  York  14853 


ABSTRACT 

Computer  simulations  of  two-dimensional  rapid  granular  flows  of  uniform  smooth 
inelastic  disks  under  simple  shear  reveal  a  dynamic  microstructure  characterized  by  the 
local,  spatially  anisotropic  agglomeration  of  disks.  A  spectral  analysis  of  the  concentration 
field  suggests  that  the  formation  of  this  inelastic  microstructure  is  correlated  with  the 
magnitude  of  the  total  stresses  in  the  flow.  The  simulations  confirm  the  theoretical  results 
of  Jenkins  and  Richman  (7.  of  Fluid  Mech.  192,  pp.  313-328,  1988)  for  the  kinetic 
stresses  in  the  dilute  limit  and  for  the  collisional  stresses  in  the  dense  limit,  when  the  size  of 
the  periodic  domain  used  in  the  simulations  is  a  small  multiple  of  the  disk  diameter. 
However  the  kinetic  and,  to  a  lesser  extent,  collisional  stresses  both  increase  significantly 
with  the  size  of  the  periodic  domain,  thus  departing  from  the  predictions  of  the  theory  that 
assumes  spatial  homogeneity  and  isotropy.  The  corresponding  paper  discussing  the 
simulation  technique  and  the  formation  of  inelastic  microstructure  is  presently  under  review 
by  Phys.  Fluids  A  (1990). 
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CURVE  DESIGN  AND  ANALYSIS  USING  SPLINES  AND  WAVELETS  * 

Charles  K.  Chui* 

Department  of  Mathematics 
Texas  A&M  University 
College  Station,  TX  77843 

ABSTRACT.  While  it  is  well  known  that  spline  functions  provide  a  very  useful  tool 
for  designing  and  representing  curves,  this  paper  initiates  an  application  of  wavelets  for 
analyzing  their  shapes.  An  iterative  design  and  analysis  procedure  is  recommended  for 
yielding  satisfactory  results.  The  local  optimal-order  interpolatory  scheme  introduced 
in  our  earlier  work  provides  a  moving  average  algorithm  for  the  design  purpose,  while 
the  decomposition  formula  in  wavelets  yields  a  decomposition  algorithm.  After  necessary 
modifications  are  made,  a  reconstruction  algorithm  is  used  to  restore  a  more  desirable 
curve.  This  algorithm  is  a  consequence  of  the  two-scale  relation  of  B-splines  and  that  of 
the  compactly  supported  B- wavelets  we  recently  constructed. 

1.  INTRODUCTION.  Many  curve  design  schemes  are  available  in  the  literature.  In 
particular,  when  spline  functions  are  used,  these  schemes  are  especially  efficient  and  the 
resulting  spline  representations  of  curves  are  usually  very  satisfactory.  However,  much  less 
has  been  studied  on  the  analysis  of  the  “shapes”  of  these  spline  curves.  This  paper  initiates 
an  application  of  spline  wavelets  for  such  an  analysis.  The  decomposition  sequences  of 
splines  into  splines  with  fewer  knots  and  the  orthogonal  spline  wa\'elet  complements  are 
used  to  yield  a  wavelet  decomposition  algorithm,  while  the  pair  of  two-scale  relations  give 
rise  to  a  reconstruction  algorithm. 

Let  us  first  recall  that  for  each  positive  integer  m,  the  mth  order  B-spline  Nm,  defined 
recursively  by 


(1)  Nm(x)  =  {Nm-\  *  Ni)(x)  =  f  Nm-i{x  -  t)dt, 

Jo 

with  N\  =  \[0  1)t  generates  a  multi  resolution  analysis  of  L 2  =  L2(—  oo.oo)  (cf.  M  [10.11]). 
In  particular,  if  V*  is  the  L2 -closure  of  the  linear  span  of 

(2)  Nm.k,j(x)  Nm( 2kx  -  j),  j  G  Z, 

then  {V*}  is  a  nested  sequence  of  closed  subspaces  of  L2,  whose  closure  is  all  of  L2,  and 
whose  intersection  is  the  zero  function.  For  each  k  G  Z,  let  H4  denote  the  orthogonal 
complementary  subspace  of  14+ 1  relative  to  V*;  that  is, 

*  Supported  by  SDIO/IST  managed  by  the  U.S.  Army  Research  Office  under  Contract 
Numbers  DAAL  03-87-K-0025  and  DAAL  03-90-G-0091 
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Wk±.Vk  and  Vjt+i  =  Vk  4-  Wk, 


and  we  use  the  standard  notation 


(3)  Vk+l  =  Vk  ©  Wk. 

It  is  also  well  known  (cf.  M  [10,11])  that  there  exists  some  function  ipm  in  Wo  that  generates 
all  the  spaces  Wk;  that  is,  by  sett  .g 

(4)  =  x  ~  j)i 

then  each  Wk  is  the  closure  of  the  linear  span  of  ipm-kiJ,  j  €  Z.  Here,  ipm  is  called  a  wavelet 
corresponding  to  the  5-spline  Nm.  In  M  [10,11],  however,  since  an  orthonormalization 
procedure  is  used  to  give  i/>m,  this  wavelet  has  to  have  infinite  support.  By  introducing 
a  new  approach  to  the  construction  of  waveWs  in  CW  [6,7],  we  have  now  a  compactly 
supported  spline  wavelet 


2m— 2 

(5)  *,(1)=^  £  (— 1)J JV2m(j  +  l)N^’(2x  -  j). 

j- 0 

Since  this  wavelet  has  minimum  support  (cf.  CW  [8]),  we  may  call  it  a  5- wavelet  of  order 
m  associated  with  the  5-spline  Nm  (cf.  Fig.  1  and  Fig.  2).  The  mth  order  spline/ wavelet 
pair 


(6)  (Nm,il>m), 

given  in  (1)  and  (5),  will  be  used  for  the  design/analysis  of  curves. 

2.  DESIGN,  DECOMPOSITION,  AND  RECONSTRUCTION  ALGORITHMS.  The  de¬ 
sign/analysis  scheme  is  an  iterative  procedure  that  requires  three  “moving-average”  algo¬ 
rithms.  The  first  one  is  an  interpolation/approximation  moving  average  algorithm  derived 
from  the  optimal  order  real-time  interpolatoin  scheme  discussed  in  CD  [5]  (see  also  C'  [2] 
and  CD  [4]).  The  second  algorithm  is  a  so-called  decomposition  algorithm.  It  is  derived 
from  the  decomposition  formula 


(7)  Nm(2l  ~  l)  =  ^  0.f-2nNm(x  -  H  )  ^  bt-2nVm(x  ~  n). 

n  n 

The  validity  of  (7)  follows  from  the  decomposition  of  V’j  as  an  orthogonal  sum  of  V0  and 
Wo  as  given  by  (3).  The  third  algorithm  comes  from  the  pair  of  two-scale  relations: 
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(8) 


Nm{ x)  =  ^ p„Nm(2x  -  n) 

n 


and 

(9)  rl'm(x)  =  ^  qnNm(2x  -  n). 

n 

Here,  (8)  and  (9)  are  consequences  of  VJj  C  V\  and  W0  C  V\ ,  respectively.  The  four 
sequences  {a„},  {6„ },  {pn},  and  {<?„}  are  easily  computed  by  using  the  results  in  CW 
[7].  Note  that  since  both  Nm  and  rpm  have  compact  supports,  {p„}  and  {<?„}  are  finite 
sequences.  The  sequences  {a„}  and  {&„}  have  exponential  decay,  and  by  a  duality  principle 
established  in  CW  [7],  the  two  pairs  ({an},  {£>„})  and  ({pn}>{<7n})  are  interchangeable. 
Details  and  generalities  can  be  found  in  CW  [6,7,8]. 

2.1.  DESIGN  ALGORITHM.  We  employ  the  real-time  interpolatory  scheme  discussed 
in  CD  [5]  (see  also  C  [2],  and  CD  [4]).  However,  since  the  5-splines  must  be  used  for 
decomposition  purposes,  we  must  change  the  real-time  table-look-up  scheme  in  CD  [5]  to 
a  moving  average  scheme.  We  emphasize  that  this  interpolation  scheme  has  the  optimal 
order  of  approximation.  The  construction  introduced  in  CD  [4]  can  be  summarized  as  fol¬ 
lows.  Quasi-interpolation  formulae  are  constructed  using  finite  blocks  of  data  information. 
Next,  a  completely  local  interpolation  scheme  is  used,  and  finally  a  blending  procedure 
is  employed  to  yield  the  desired  interpolatory  scheme  that  utilizes  only  finite  blocks  of 
data  information  but  gives  the  optimal  order  of  approximation.  That  is.  we  have  a  finite 
sequence  . such  that 

(10)  (S«/)(x)  =  £  (jF *«)  |  iVm(2"x  -  j) 

satisfies: 


(ID 

and 

(12)  l|S,,/-/|U  =  0(?lj?),  A/ -4oo. 

for  all  /  €  Cm.  Here,  :=  0  if  |p|  >  km\Cm  is  a  positive  integer  depending  on  the  order 
m  of  the  splines  (e.g.  P2  =  1  and  1 4  =  2,  cf.  C  [2.3]);  and  M  6  Z  with  fm/2‘U  denoting 


(5a,/-/)(^7G„)  =0. 


j  €  Z. 
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the  sampling  rate.  Note  that  S\tf  is  an  mth  order  spline  with  knots  at  2~Ml.  Also,  the 
length  of  the  “weight”  sequence  also  depends  on  m.  It  is  trivial  that  k2  ~  0 

and  Wg2)  =  1  for  linear  splines.  For  cubic  splines,  we  have  k4  =  4  (i.e.  u>p4)  =  0  for  |p|  >  4) 
and 


w 


(4)  _  (4) 


-4 


W, 


(4)  (4) 

W_3  =  w3 


(4)  (4) 

W_  2  =  U’2 


W 


^  ~  I0j4) 

29 
24 


-l 


(4) 

Wn  = 


48 

_i_ 

_  12 
1 

~ 8 
1_ 

12 


(cf.  CD  [4]). 

2.2.  DECOMPOSITION  ALGORITHM.  From  the  decomposition  formula  (7)  and  the 
scale-change  notation  in  (2)  and  (4),  we  have 


(13) 


^  m;k  +  i  .j  —  ^  ^  Gj —2n^  m;k  ,n  4"  ^  '  ^j-2n^i'm.k.n- 


Suppose  that  the  samples 


(14)  {/(#)}• 

are  used  to  design  the  interpolator)'  mth  order  spline  curve  with  knots  at  2 ~  V/Z.  Here.  M 
is  a  sufficiently  large  positive  integer.  Set 

f  =  (5a//)(x)  =  y  c*f.Vm;.vi.7(j) 

I  c^icJ'j.jez. 

Recall  from  (11)  and  (12)  that  /,\/  interpolates  /  at  {j(,„/2  u } .  j  €  Z.  and  yields  the  mth 
order  of  approximation  to  /.  Hence,  we  have 


1 


where  we  recall  that  u>pn)  =  0  for  |p|  >  km ,  and  m  is  even  since  we  only  use  even  order 
splines  for  interpolation.  For  k  <  M,  we  also  set 
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(16) 


/*(*)  =  7LCjN>n*.j(X) 

<  J 

k  c*  =  {c{},j€Z. 

where  /*  is  the  projection  of  /t+1  from  Vk+\  to  V*  as  indicated  by  (3).  Let  <?*  €  IV'*  be 
its  orthogonal  complement;  that  is,  g*  J./*  and  fk+i  =  /*  +  <?*• Write 


(17) 


|  St(*)  =  ’I’m-.k.iU) 


d‘  =  {dp.  >  e  2. 


Then  {c*}  and  {d*},  fc  =  M  —  1, . . . ,  M  -  L  (where  L  is  arbitrary)  can  be  computed  by 
applying  the  following  recursive  formula: 


(18) 


n 

>.  n 


Hence,  starting  from  cM  in  (15),  we  can  easily  compute  cA/_I ,  du_I :  cA,~2.  du-2; . .  : 
cM~L ,dA/_2  by  applying  (18).  That  is,  by  using  (16)  and  (17),  we  have  a  (mutually) 
orthogonal  decomposition: 


(19)  f\ m(i)  =  jm-i(x)  +  ■  •  •  4-  g.w-L(z)  +  /.v/-i(x). 

where  f\i(x)  interpolates  f(x)  at  {j(m/ 2,v/}.  J  6  Z.  and  approximates  f(x)  with  the  rn,h 
order  of  approximation,  and  where 


(20)  II/ai-lIIoo-O.  L  — ►  oc. 

It  will  be  clear  that  the  orthogonal  decomposition  (19)  has  significant  information  on  the 
curve  y  =  f\t{x).  To  verify  (18),  we  need  the  following  result. 

LEMMA  1.  Let  E  L'flL2  on  (  -  oc,oo)  be  non- trivial.  Then  {£(  —  j):  j  €  Z}  is  t 2  -linearly 
independent ,  in  the  sense  that 

(21)  £«>$(•—./)  =  °a.e., 

xez 
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where  {a; }  £  i2 ,  implies  that  a}  =  0  for  all  j. 

The  proof  of  this  lemma  easily  follows  by  using  Fourier  transform  argument.  Indeed, 
the  Fourier  transform  of  the  left-hand  side  of  (21)  is  the  product  of  the  symbol  (or  Fourier 
series)  of  {a^}  and  the  Fourier  transform  £  of  £.  Since  £  is  nontrivial,  then  (21)  holds  if 
and  only  if  the  symbol  of  {a^}  identically  vanishes,  or  a}  —  0  for  all  j. 

Now,  we  are  ready  to  verify  (18).  From  (13)  and  the  definitions  in  (16)  and  (17).  we 
observe  that  /*+1(x)  =  /*(x)  4-  gt(x)  if  and  only  if 

^  ^  ^  ^  Cn  an  —  2]-^  m\k,](x  )  d"  ^  ^  Cn  ^n-2jt’m;i,;(X  ) 

n  j  n  j 

=  iy',cj  Nm;k,j(x)  d~  ^  dj  1pm;k,j(x)- 
) 

Now,  by  Lemma  1,  both  sets  { Nm-k :  j  6  Z}  and  J  €  Z)  are  CMinearly  in 

dependent.  Hence,  since  these  two  sets  are  orthogonal  to  each  other,  we  have  verified 

(13). 

2.3.  RECONSTRUCTION  ALGORITHM.  Once  we  have  obtained  the  orthogonal  decoin- 
positoin  (19),  we  can  judge  which  portion  of  the  curve  y  =  /.\/(x)  needs  adjustment.  Note 
that  j7t(x)  contains  the  information  on  the  portion  of  /.\/(x)  with  slope  proportional  to  2*. 
since 


^;U’m;Jk,.,(x))  =  2kvm(2ki  -;). 

Furthermore,  this  information  is  localized,  in  view  of 

supp  (v'm.k,j)  =  suppUw.j)  = 

Of  course,  the  magnitude  of  this  slope  is  further  multiplied  by  the  coefficients  dk  in  addi¬ 
tion  to  the  2k  factor.  The  orthogonality  of  this  decomposition  implies  that  the  localized 
information  is  meaningful. 

After  certain  portions  of  the  curve  y  =  have  been  adjusted,  say  by  replacing  d* 

with  d*  and  c,u_/'  with  cx,~L ,  we  may  reconstruct  c,u  =  {c‘u }.  which  gives  the  modified 
spline  curve 

f\i(x)  =  7  C^.Vm:,\;.;(x). 

j 

by  applying  the  recursive  formula: 

<22>  7'  =  +  9j-2..rfS) 


J_  J(xm  - 

Ok  '  ok 
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n  =  M  —  M  —  1. 

To  prove  (22),  we  note  from  the  pair  of  two-scale  relations  (8)  and  (9)  that 


(23) 


Nm;k,j{x)  —  ^  ^  Pn^m\k+l  ,2j+  n{x  )  —  ^  ^  Pn  —  2j^  m;fc+l  .n  (  J  ) 
n  n 

1pm-,k,j{%)  =  ^  ^  ffn-^mifc  +  l  ,2j+n(^-  )  =  ^  ^  Qn—2]  N  m.  k  +  l  ,n(%  )■ 


Hence,  /fc+i(i)  =  A(i)  +  gk(x)  if  and  only  if 

^2ckj+lN)ik+i,J(x)  =  Y^i^Nm.k  ,}{x)  +  dktpm,kj(x)) 

j  ) 

=  ^  ^  ^  Pn—2)  4"  djQn-  2)  )Nm;k+\  ,n(x  ) 

J  « 


=  X1  |S(gnPi-2n  +^?J-2n)|  -Vm;t+  , .,( J  ). 


So,  (22)  follows  by  an  application  of  Lemma  1. 

3.  THE  RECONSTRUCTION  AND  DECOMPOSITION  SEQUENCES.  From  (S).  (9). 
and  the  definition  of  the  spline  wavelet  t/’m(x)  in  (5),  it  is  easy  to  derive  the  reconstrution 
sequences  {pn}  and  {gn}  for  each  fixed  positive  integer  m.  In  fact,  we  have: 


(24) 


and 


Pn  = 


2-m  +  l(m)  for  0  <  n  <  m 

0  otherwise 


f  ^rrr  E  ("!)N2 m(n  -  j  +  1)  for  0  <  n  <  3m  -  2 
(25)  </„  =  <  >=°  J 

\  0  otherwise. 

To  verify  (25),  one  simply  applies  the  B-spline  identity 

C(*)  =  £(-l V  (m)jV„U-j) 

j=0  ' 

(cf.  [1,  chap.  lj).  To  describe  the  decomposition  sequences  {«„ }  and  {&„},  we  need  the 
Euler- Frobenius  polynomials 
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2  m  —  2 

n2m_,(z)  =  (2m-l)!  £  .V2m(n  +  l)r" 

n-=0 

(which  are  polynomials  of  degree  2m  —  2).  It  is  well  known  (cf.  S  [12])  that  II2m_i(c )  has 
2m  —  2  distinct  (real)  negative  roots  which  are  in  reciprocal  pairs;  that  is,  arranging  {r ;} 
in  decreasing  order,  we  have 

r2m-2  <'  r2m  —  1  <  *  ’  '  ^  <  ~  1  <  —  1  <  ■  •  •  <  Tj  <0 

with  r\ r2m_2  =  •••  =  rm_, rm  =  1.  In  particular,  since  II2m_i(c)  #  0  for  |r|  =  1,  its 
reciprocal  has  an  absolutely  convergent  Fourier  series  expansion: 


(26) 


1 


n 


2m  — 1 


(z)  (2m -1)! 


Ea- 

nez 


m)  ,n  —  m  +  1 


Ul  =  l. 


where 


(27)  kT’l  =0(|rm|-l"l).  |n|  — >  do. 

The  decompositiozi  sequences  {a,,}  and  { b„)  are  given  by 


(28) 


and 

(29) 


_ ]_  f  (1  +  C)m  n2m_j(c)  n-2  J 

~  2zn  J  2m  n2m_,(z2)*  f" 
1*1=1 

(2m  -  1)!  (I-;)™  ,w-2i. 

2m  n2m_,(--2r 


Hence,  in  view  of  (27),  we  have 


(30)  |onUM  =  0(|rm|  "  ).  |n|  ►  x. 

Here,  the  larger  the  order  m  of  splines  we  use,  the  smaller  is  the  exponent  |r„,|  (cf  S  [11]). 

It  is  possible  to  interchange  the  pair  ({«„},{&„})  of  decomposition  sequences  and 
the  pair  (  {pn  }i  {<?n } )  of  reconstruction  sequences.  This  is  the  so-called  duality  principle 
introduced  in  CW  [7].  To  do  so.  we  simply  replace  the  B-splines  Nm{x  —  j)  and  wavelets 
t/’m(x  —  j)  by  (an  appiopriate  shift  of)  their  dual  bases.  Details  are  given  in  CW  [7]. 
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4.  FINAL  REMARKS.  Although  we  have  obtained  the  same  wavelet  decomposition  (19) 
as  M  [10,11],  we  have  uesd  different  bases  {iVm;i  j}  and  {U'm.k.}}-  In  M  [10.11],  in  order  to 
use  Fourier  methods,  the  B-splines  {iVm;* are  orthonormalized,  and  thus  orthonormal 
wavelets  are  also  obtained.  For  curve  design,  however,  it  is  much  more  efficient  to  use 
the  B-spline  series  in  order  to  keep  the  local  structure.  As  a  bonus,  we  have  two  finite 
sequences  {p„}  and  {qn}  for  reconstruction.  If  the  decomposition  sequences  {an}  and 
{6„}  are  also  required  to  be  finite,  then  it  is  proved  in  CW  [8]  that  we  no  longer  have 
spline  curves.  In  fact,  it  is  proved  in  CW  [8]  that  this  requirement  is  achieved  only  by 
the  compactly  supported  orthonormal  wavelets  of  Daubechies  (cf.  D  [9])  which  have  no 
explicit  formulations  and  are  not  as  smooth  as  the  spline  wavelets  in  this  paper. 
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Fig.  1  (Linear  8-wavelet  and  its  magnitude  spectrum) 


Fig.  2  (Cubic  B-wavelet  and  Its  magnitude  spectrum) 
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Fig.  3  (Cubic  Spline/Wavelet  decomposition 
of  curves) 
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ABSTRACT 

As  a  new  product  is  being  designed,  there  are  generally  two  or  more  different 
kinds  of  engineering  analyses  that  need  to  be  performed.  It  is  still  often  the  case 
that  in  the  course  of  developing  a  new  product  design,  several  geometric  models 
may  have  to  be  constructed,  because  different  engineering  analysis  software  pack¬ 
ages  generally  require  different  forms  of  input.  Irrespective  of  the  specific 
representation  used  for  a  three-dimensional  geometry /material  database,  rarely 
does  an  application  code  read  that  database  directly.  Rather,  a  specific  interroga¬ 
tion  method  is  used  to  pass  particular  geometric/material  attributes  to  the  appli¬ 
cation  code. 

In  this  paper,  the  strategy  used  in  the  BRL-CAD  Package  is  presented.  A  variety 
of  procedural  interfaces  have  been  provided  so  that  diverse  analysis  codes  can  be 
driven  from  a  single,  central  geometric  model.  These  interfaces  include  the  ability 
to  produce  a  wireframe  representation  of  the  model,  intersect  rays  with  the  model, 
tessellate  the  model  into  a  3-D  surface  mesh,  and  perform  topological  feature 
extraction.  In  addition,  extensions  to  the  software  will  be  discussed,  including  the 
ability  to  approximate  the  model  as  a  3-D  finite-element  volume  mesh,  and  con¬ 
verting  the  model  to  a  homogeneous  trimmed  B-Spline  representation. 
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Multiple  Families  of  Engineering  Anal>ses 
Interrogating  a  Single  Geometric  Model 

Michael  John  Muvas 

U.  S.  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground 
Maryland  21005-5066  USA 

1.  Introduction 

As  a  new  product  is  being  designed,  there  are  generally  two  or  more  different  kinds  of 
engineering  analyses  that  need  to  be  performed.  For  example,  vulnerability  analysis,  structural 
analysis,  thermal  analysis,  and  computational  fluid  dynamics  (CFD),  as  well  as  predictive  radar, 
infra-red  (IR),  and  X-ray  signatures.  At  any  stage  of  the  design,  almost  any  kind  of  CAD  system 
can  be  used  to  make  engineering  drawings.  However,  drawings  are  suitable  only  for  interpretation 
by  human  beings,  not  for  automatic  computerized  analysis.  Therefore,  computerized  drawings  can 
not  be  used  to  provide  the  geometric  input  required  for  these  analysis  codes. 


Figure  1  —  The  Design  Loop 


If  the  product  is  designed,  not  on  a  computer  drafting  system,  but  instead  on  a  full  3-D  solid 
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modeling  system,  then  in  addition  to  producing  drawings,  the  model  can  be  subjected  to  numerous 
engineering  analyses,  allowing  the  effects  of  varying  many  parameters  to  be  studied  in  a  controlled 
and  automatic  way.  Thus,  the  real  payoff  from  building  a  solid  geometric  model  comes  when  it  is 
time  to  analyze  it.  This  capability  is  so  powerful  that  it  ordinarily  justifies  any  extra  time  or 
equipment  investments  needed  to  support  the  construction  of  the  3-D  solid  model. 

2.  Solid  Modeling  and  the  Design  Process 

For  more  than  twenty  years,  the  Ballistic  Research  Laboratory  (BRL)  has  been  utilizing 
solid  geometric  modeling  methods  to  support  vulnerabilily/lethality  and  neutron  transport  studies 
of  military  targets.  1'-  In  such  item-level  studies  target  geometry  and  material  information  are 
passed  to  various  application  codes  to  derive  certain  measures-of-performance.  Building  on  the 
general  paradigm,  the  BRL  and  many  others  have  extended  the  general  techniques  to  support 
many  predictive  signature  models3  including  optical,  millimeter  wave  (MMW),  infra-red  (1R), 
magnetic  and  X-ray. 

It  is  important  to  note  that  this  type  of  analysis  must  generally  be  supported  by  a  solid 
geometric  model.  A  solid  model'4  is  a  computer  description  of  closed,  solid,  three-dimensional 
shapes  represented  by  an  analytical  framework  within  which  the  three-dimensional  material  can 
be  completely  and  unambiguously  defined.  Properly  utilized,  the  solid  model  becomes  the  central 
element  in  the  iterative  process  of  taking  a  design  from  idea  to  prototype  design  to  working  design 
to  optimized  design.  This  iterative  process  is  termed  the  “design  loop’’.  The  early  form  of  the 
design  loop  is  illustrated  in  Figure  1. 

In  a  full  scale  solid  modeling  system,  there  is  no  need  for  initial  drawings;  the  designer 
expresses  the  initial  structures  directly  into  the  modeling  system’s  editor,  just  as  a  modern  author 
creates  his  “rough  draft”  directly  into  a  word  processor.  At  the  completion  of  each  version  of  the 
design,  the  model  is  subjected  to  a  battery  of  analyses  appropriate  to  the  function  of  the  object 
being  designed.  Strength,  volume,  weight,  level  of  protection,  and  other  similar  evaluations  can  be 
reported,  along  with  the  production  of  a  variety  of  images  and/or  drawings  These  automated 
analyses  help  identify  weaknesses  or  deficiencies  in  a  design  early  in  the  design  process.  By  detect¬ 
ing  flaws  early,  the  designer  has  the  opportunity  to  correct  his  plans  before  having  invested  too 
much  time  in  a  bad  design,  or  the  designer  can  switch  to  an  entirely  different  approach  which  may 
seem  more  promising  than  the  original  one.  In  this  way,  the  solid  modeling  system  allows  the 
designer  to  concentrate  on  the  important,  creative  aspects  of  the  design  process.  Freeing  the 
designer  of  routine  analysis  permits  designs  to  be  finished  in  less  time  than  previously  required,  or 
allows  much  more  rigorously  optimized  designs  to  be  delivered  in  comparable  timeframes  and  at 
the  same  cost  as  unoptimized  designs  created  using  older  techniques.5  Furthermore,  the  modeling 
system  allows  sweeping  design  changes  to  be  made  quickly  and  cheaply,  allowing  great  flexibility 
in  the  face  of  ever  changing  requirements  and  markets.  The  time  needed  to  create  a  new  product 
can  be  further  decreased  by  re-utilizing  elements  of  earlier  models  and  then  modifying  them  as 
appropriate.  If  an  existing  component  already  in  inventory  is  entirely  suitable  for  use  in  a  new 
design,  significant  manufacturing  and  inventory  savings  will  be  realized. 

Two  major  families  of  solid  model  representations  exist,  each  with  several  unique  advan¬ 
tages.  The  first  is  the  Combinatorial  Solid  Geometry  Representation  (CSG-Rep)  u  Solid  models 
of  this  type  are  expressed  as  boolean  combinations  of  primitive  solids  which  are  geometric  entitles 
described  by  some  set  of  parameters  and  occupying  a  fixed  volume  in  space.  The  second  is  the 
Boundary  Representation  (B-Rep),  of  which  there  are  two  sub-types,  explicit,  where  each  solid  is 
described  by  an  explicit  enumeration  of  the  extent  of  the  surface  of  the  solid,  and  implicit,  where 
the  surface  of  the  solid  is  described  by  an  analytic  function  such  .as  a  Coons  patch,  Bezier  patch. 
B-spline.  etc.  Hybrid  systems  such  as  the  BRL-CAl)  Package'  also  exist.. 

The  objective  of  a  given  application  will  to  a  large  degree  de  termine  the  most  “natural" 
form  in  which  the  model  might  be  presented.  For  example,  extracting  just  the  edges  of  the  objects 
in  a  model  would  be  suitable  for  a  program  attempting  to  construct  a  wire-frame  display  of  the 
model.  Another  family  of  applications  exists  which  needs  to  be  able  to  find  the  intersection  of 
small  object  paths  (eg,  photons)  with  the  model  Generally,  these  alternatives  are  motivated  by 
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the  representation  of  a  physical  process  being  simulated,  and  each  alternative  is  useful  for  a  whole 
family  of  applications. 

Unfortunately,  it  is  not  often  the  case  that  building  only  one  3-D  model  of  the  product  is 
enough.  Each  of  the  different  engineering  analysis  software  packages  needed  to  perform  the  ana¬ 
lyses  usually  requires  a  different  form  of  input.  As  a  result,  more  than  one  kind  of  geometric 
model  may  have  to  be  constructed.  However,  rarely  do  these  application  codes  read  the  three- 
dimensional  geometry  'material  data  base  directly.  Rather,  each  application  has  a  specific  interro¬ 
gation  method  that  is  invoked  to  obtain  geometric  and  material  attributes  from  a  source  or  refer¬ 
ence  file.  The  physical  simulation  techniques  used  in  the  application  software  are  therefore  con¬ 
strained  by  the  available  techniques  for  extracting  geometric  information  from  the  model.  As  a 
result,  each  analysis  package  often  requires  a  unique  form  of  input.  Without  a  central  geometry 
database  that  can  drive  all  the  analysis  packages,  the  designer  can  be  forced  into  having  to  create 
many  different  representations  of  each  design,  one  for  each  distinctly  different  type  of  analysis 
code.  This  can  be  very  costly  and  time  consuming;  the  time  needed  to  create  a  single  model  ranges 
between  1  week  and  several  man-years,  depending  on  the  complexity  of  the  design  Having  to 
spend  the  effort  to  manually  create  the  same  design  in  different  formats  to  drive  several  analysis 
codes  is  an  unfortunate  and  expensive  necessity. 

The  philosophy  adopted  at  BRL  has  been  to  develop  a  broad  set  of  analyses  which  are  sup¬ 
ported  from  the  ne  geometry  database.3  These  analyses  cover  the  spectrum  from  engineering 
decision  aids,  to  design  validators,  to  signature  prediction  codes,  to  the  generation  of  wireframe 
drawings,  to  high-resolution  image  generation  for  management  comprehension  and  sales  advan¬ 
tage.  Key  analysis  capabilities  have  been  developed  to  assess  the  strength,  weight,  protection,  and 
performance  levels  offered  by  the  structures  represented  by  a  solid  model.  Using  this  analysis 
information  and  additional  domain-specific  applications  tools  makes  it  possible  to  produce  highly 
detailed  designs  constructed  with  a  philosophy  of  system  optimization  right  from  the  start  8  This 
facilitates  the  rapid  development  of  products  with  the  desired  levels  of  performance  at  the  best 
attainable  price. 

To  accomplish  all  these  goals,  the  BRL-CAD  Package1  provides  a  variety  of  procedural 
interfaces  so  that  the  diverse  collection  of  analysis  codes  can  be  driven  from  a  single,  central 
geometric  model.  These  procedural  interfaces  follow  the  natural  object-oriented  programming 
interface.  An  application  program  retrieves  one  or  more  objects  from  the  model  database,  and 
then  requests  those  objects  to  either  interrogate  themselves  in  the  desired  way,  or  convert  them¬ 
selves  into  the  desired  representation.  This  applications  interface  is  depicted  in  Figure  2. 

3.  Wireframe  Representation 

The  interactive  model  editor  mged  program  primarily  employs  3-D  wireframe  outlines  of 
the  various  solid  objects,  in  order  to  maintain  the  highest  possible  speed  of  user  interaction.  The 
conversion  of  database  objects  into  wireframe  drawings  is  the  simplest  of  the  application  interfaces 
provided  by  librt. 

After  the  user  specifies  which  objects  from  the  model  database  should  be  displayed,  mged 
retrieves  the  necessary  database  records  and  invokes  the  ft_plot()  interface.  ft_plot()  passes  the 
database  object  to  the  appropriate  object-specific  wireframe  converter,  which  generates  a 
wireframe  outline  of  that  object. 

The  wireframe  is  comprised  of  a  collection  of  3-D  virtual  pen-plotter  move  and  draw  opera¬ 
tions,  returned  to  the  application  as  a  linked  list  of  viist  structures  attached  to  the  application 
provided  vlhead  structure.  Each  viist  structure  has  three  elements,  vl_pnt,  the  XYZ  coordinates 
of  a  point  in  space,  vl_draw,  a  flag  which  indicates  whether  the  virtual  pen  should  be  moved 
invisibly  from  the  current  position  to  vLpnt  ( vl_draw=VL_CMD_LINE_MO\  I  ■  ,i  moved  visibly, 
drawing  a  line  from  the  current  position  to  vLpnt  (vl_draw=\rL_CMD_LI \ I  Id.’  \W).  There  is 
also  a  vl_forw  pointer  to  provide  the  linkage  to  the  next  viist  structure  in  >!.•  list. 
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Applications  I/F 


Figure  2  —  The  Applications  Interface 

4.  Ray  Tracing 

Many  phenomena  that  are  ordinarily  difficult  to  model  can  be  handled  simply  and  elegantly 
with  ray-tracing.  For  example,  an  illumination  model  based  on  ray-tracing  merely  needs  to  fire  a 
ray  at  each  light  source  to  determine  the  total  light  energy  at  each  point.  Ray-tracing  also  makes 
it  easy  to  deal  with  objects  that  are  partly  or  entirely  reflective,  and  with  transparent  objects  that 
have  varying  refractive  indices.  Furthermore,  by  applying  the  proper  sorts  of  dither,9  motion- 
blur,  shadow  penumbra,  depth-of-field,  translucency,  and  other  effects  are  easily  achieved. 

The  power  of  the  lighting  model  code  can  further  extended  by  making  a  provision  to  recoru 
the  paths  of  all  the  rays  followed  when  computing  the  light  intensity  for  each  pixel  in  an  auxiliary 
file.  This  capability  allows  one  to  follow  the  path  of  the  light  rays  passing  through  lenses  reflect¬ 
ing  from  mirrors  while  performing  image  rendering,  with  no  additional  computation.  Studying 
the  paths  of  light  rays  as  they  are  repeatedly  bent  by  passing  from  air  to  glass  and  back  again  has 
traditionally  been  a  painstaking  manual  procedure  for  lens  designers.  By  modeling,  it  becomes 
possible  to  predict  lens  behavior,  including  making  a  determination  of  the  exact  focal  length,  find¬ 
ing  the  precise  influence  of  spherical  distortions  and  edge  effects,  determining  the  amount  of  image 
distortion  due  to  internal  reflection  and  scattering,  and  finding  the  level  of  reflections  from  the 
lens  mounting  hardware.  Furthermore,  experiments  can  be  conducted  to  determine  the  effects  of 
adding  or  removing  baffles,  irises,  special  lens  coatings,  etc. 

Rays  begin  at  a  point  "P,  and  proceed  infinitely  in  a  given  direction  TJ  Any  point  X  on  a 
ray  may  be  expressed  as  a  linear  combination  of  I*  andl?: 

X=T*+  t  *15 

where  valid  solutions  for  (  are  in  the  range  [  0,  oc).  (Dx,Dv,Dj)  are  the  direction  cosines  for  the 
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ray,  being  the  cosine  of  the  angle  between  the  ray  and  the  appropriate  axis.  While  not  necessary, 
in  this  analysis  it  is  assumed  that^  is  of  unit  length,  i.e.  Iffl-l. 

The  most  traditional  approach  to  ray-tracing  has  been  batch-oriented,  with  the  user  defining 
a  set  of  “viewing  angles”,  turning  loose  a  big  batch  job  to  compute  all  the  ray  intersections,  and 
then  post-processing  all  the  ray  data  into  some  meaningful  form.  However,  the  major  drawback  of 
this  approach  is  that  the  application  has  no  immediate  control  over  ray  paths,  making  another 
batch  run  necessary  for  each  level  of  reflection,  etc. 

In  order  to  be  successful,  applications  need:  (1)  interactive  control  of  ray  paths,  to  naturally 
implement  reflection,  refraction,  and  fragmenting  into  multiple  subsidiary  rays,  and  (2)  the  ability 
to  fire  rays  in  arbitrary  directions  from  arbitrary  points.  Nearly  all  non-batch  implementations 
have  closely  coupled  a  specific  application  (typically  a  model  of  illumination)  with  the  ray-tracing 
code,  allowing  efficient  and  effective  control  of  the  ray  paths.  The  most  flexible  approach  of  all  is 
to  provide  the  ray-tracing  capability  through  a  general-purpose  library,  and  make  the  functional¬ 
ity  available  as  needed  to  any  application. 

4.1.  RT  Library  Interface 

In  order  to  give  all  applications  interactive  control  over  the  ray  paths,  and  to  allow  the  rays 
to  be  fired  in  arbitrary  directions  from  arbitrary  points,  BRL  has  implemented  its  third  generation 
ray-tracing  capability  as  a  set  of  library  routines.  The  RT  library  exists  to  allow  application  pro¬ 
grams  to  intersect  rays  with  model  geometry.  There  are  two  parts  to  the  interface:  preparation 
routines  and  the  actual  ray-tracing  routine.  Three  "preparation"  routines  exist;  the  first  routine 
which  must  be  called  is  rt-dirbuild(),  which  opens  the  database  file,  and  builds  the  in-core  data¬ 
base  table  of  contents.  The  second  routine  to  be  called  is  rt_gettree(),  which  adds  a  database 
sub-tree  to  the  active  model  space.  rt_gettree()  can  be  called  multiple  times  to  load  different 
parts  of  the  database  into  the  active  model  space.  The  third  routine  is  rt_prep(),  which  com¬ 
putes  the  space  partitioning  data  structures,  and  does  other  initialization  chores,  prior  to  actual 
ray-tracing.  Calling  this  routine  is  optional,  as  it  will  be  called  by  rt_shootray()  if  needed 
rt_prep()  is  provided  as  a  separate  routine  to  facilitate  more  accurate  timing  of  the  preparation 
and  ray-tracing  phases  of  applications. 

To  compute  the  intersection  of  a  ray  with  the  geometry  in  the  active  model  space,  the  appli¬ 
cation  must  call  rt_shootray()  once  for  each  ray.  Ray  behaviors  such  as  perspective,  reflection, 
refraction,  etc,  are  entirely  determined  by  the  applications  program  logic,  and  not  by  the  ray¬ 
tracing  library.  The  ray-path  specification  determined  by  the  applications  program  is  passed  as  a 
parameter  to  rt_shootray()  in  the  application  structure,  which  contains  five  major  elements: 
the  vector  a_ray.r_pt  (?)  which  is  the  starting  point  of  the  ray  to  be  fired,  the  vector  a_ray.r_dir 
(15)  which  is  the  unit-length  direction  vector  of  the  ray,  the  pointer  *a_hit()  which  is  the  address 
of  an  application-provided  routine  to  call  on  those  rays  where  some  geometry  is  hit  by  the  ray,  the 
pointer  *a_miss()  which  is  the  address  of  an  application-provided  routine  to  call  on  those  rays 
where  the  ray  does  not  hit  any  geometry,  the  flag  a_onehit.  plus  various  locations  for  applications 
to  store  state  (recursion  level,  colors,  etc). 

When  the  a—onehit  flag  is  set  to  zero,  the  ray  exhibits  classical  behavior,  and  is  traced 
through  the  entire  model.  Applications  such  as  lighting  models  may  often  only  be  interested  in 
the  first  object  hit:  in  this  case.  a_onehit  may  be  set  to  the  value  one  to  stop  ray-tracing  as  soon 
as  the  ray  has  intersected  at  least  one  piece  of  geometry.  Similarly,  if  only  the  first  three  hits  are 
required  (such  as  in  the  routine  that  refracts  light  through  glass),  then  a_onehit  may  be  given  the 
value  of  three.  Then,  at  most  three  hit  points  will  be  returned,  an  in-hit,  an  out-hit,  and  a  subse¬ 
quent  in-hit.  When  only  a  limited  number  of  intersections  are  required,  the  use  of  this  flag  can 
provide  a  significant  savings  in  run-time. 

The  return  from  the  application  provided  a_hit()/a_miss()  routine  is  the  formal  return  of  the 
function  rt_ahootray().  The  rt_shootray()  function  is  prepared  for  full  recursion  so  that  the 
application  provided  a_hit()/a_miss()  routines  can  themselves  fire  additional  rays  by  calling 
rt_shootray()  recursively  before  deciding  their  own  return  value.  In  addition,  the  function 
rt_shootray()  is  fully  prepared  to  be  operating  in  parallel  with  other  instances  of  itself  in  the 
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same  address  space,  allowing  the  application  to  take  advantage  of  parallel  hardware  capabilities 
where  such  exist. 


4.2.  Sample  RT  Application 

A  simple  application  program  that  fires  one  ray  at  a  model  and  prints  the  result  is  included 
below,  to  demonstrate  the  simplicity  of  the  interface  to  librt. 

struct  application  ap; 
struct  rt_i  *rtip; 
main()  { 

rtip  =  rt_dirbuild("model.g"); 
rt_gettree(rtip,  "car”); 
rt-prep(rtip); 

VSET(  ap.a_point,  100,  0,  0  ); 

VSET(  ap.a—dir,  -1,  0,  0  ); 
ap.a_ hit  =  &hit_geom; 
ap.a— miss  =  &miss_geom; 
ap.a_rt_i  =  rtip; 
rt_shootray(  &ap  ); 

} 

hit_geom(app,  part) 
struct  application  *app; 
struct  partition  *part; 

{ 

printf("Hit  %s" ,  part->pt_forw->pt_regionp->reg_name); 

} 

miss_geom(){ 

printf("Missed"); 

} 

4.3.  Ray  Intersection  Data 

If  a  given  ray  hits  some  model  geometry,  the  application  provided  routine  indicated  in  the 
a_hit()  pointer  is  called.  A  pointer  to  the  head  of  a  doubly-linked  list  of  partition  structures  is 
provided.  Each  partition  structure  contains  information  about  a  line  segment  along  the  ray;  the 
partition  has  both  an  “in”  and  an  “out”  hit  point.  Each  hit  point  is  characterized  by  the  hit  dis¬ 
tance  hit_dist,  which  is  the  distance  t  from  the  starting  point  r_pt  along  the  ray  to  the  hit  point. 
The  linked  list  of  partition  structures  is  sorted  by  ascending  values  of  hit-dist. 

As  a  result  of  this  definition,  the  “line-of-sight”  distance  between  any  two  hit  points  can  be 
determined  simply  by  subtracting  the  two  hit-dist  values.  This  will  give  the  distance  between 
the  hit  points,  in  millimeters. 

If  the  flag  a_onehit  was  set  non-zero,  then  only  the  first  a_onehit  hit  points  along  the  parti¬ 
tion  list  are  guaranteed  to  be  correct;  any  additional  hit  points  provided  should  be  ignored.  This 
is  usually  important  only  when  a— onehit  was  set  to  an  odd  number;  the  value  of  pt_outhit.  in  the 
last  partition  structure  may  not  be  accurate,  and  should  be  ignored. 

If  the  actual  3-space  coordinates  of  the  hit  point  are  required,  they  can  be  computed  into  the 
hit-point  element  with  the  macro: 

struct  xray  *rayp; 

struct  hit  *hitp; 

VJOINl(  hitp->  hit-point,  rayp->r_pt,  hitp->hit_dist,  rayp->r_dir  ); 
which  is  the  C-language  version  of 

X  =V  +  t  * U 
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4.4.  Surface  Normals 

Aa  an  efficiency  measure,  only  the  hit  distances  are  computed  when  a  ray  is  intersected  with 
the  model  geometry.  For  any  hit  point,  the  surface  normal  at  that  point  can  be  easily  acquired  by 
executing  the  C  macro: 

struct  hit  *hitp; 

struct  soltab  *stp; 

struct  xray  *rayp; 

RT_HIT_NORM(  hitp,  stp,  rayp  ); 

In  addition  to  providing  the  unit-length  outward-pointing  surface  normal  in  hitp- >  hit-normal, 
this  macro  also  automaticly  computes  the  3-space  coordinates  of  the  hit  point  in  hitp-  >  hit-point. 

4.5.  Gaussian  Curvature 

For  any  hit  point,  after  the  surface  normal  has  been  computed,  the  Gaussian  surface  curva¬ 
ture  at  that  hit  point  can  be  acquired  by  executing  the  C  macro: 

struct  curvature  *curvp; 
struct  hit  *hitp; 

struct  soltab  *stp; 

RT_CURVE(  curvp,  hitp,  stp  ); 

A  curvature  structure  has  three  elements,  crv_pd!»*  the  principle  direction,  crv-cl  the  curva¬ 
ture  in  the  principle  direction,  and  crv_c2  the  curvature  in  the  other  direction.  The  principal 
direction  vector  crv_pdir  has  unit  length,  (cl  |  <  —  |c2|,  i.e.  cl  is  the  most  nearly  flat  principle 
curvature.  A  positive  curvature  indicates  that  the  surface  bends  toward  the  (outward  pointing) 
normal  vector  at  that  point,  cl  and  c2  are  the  inverse  radii  of  curvature.  The  other  principle 
direction  is  implied,  and  can  be  found  by  taking  the  cross  product  of  the  normal  with  crv_pdir, 
ie,  pdir2  =  normal  x  pdirl. 

4.8.  U-V  Mapping 

Both  the  U  and  V  coordinates  range  from  0.0  to  1.0  inclusive.  A  given  (U,V)  coordinate  may 
appear  at  more  than  one  place  on  the  surface  of  the  object.  The  (U,V)  coordinate  of  the  hit  point 
is  returned  in  uvcoord  structure  elements  uv_u  and  uv_v. 

In  addition,  the  approximate  “beam  coverage”  of  the  ray,  in  U-V  space,  is  returned  in  the 
structure  elements  uv_du  and  uv_dv.  These  approximate  values  are  based  upon  the  ray’s  initial 
beam  radius  (a_rbeam)  and  beam  divergence  per  millimeter  (audiverge)  as  specified  in  the  applica¬ 
tion  structure.  These  delta-U  and  delta-V  values  can  be  helpful  for  anti-aliasing  or  filtering  areas 
of  the  original  texture  map  to  produce  an  “area  sample”  value  for  the  hit  point. 

For  any  hit  point,  after  the  value  of  hit-point  has  been  computed,  the  U-V  coordinates  of 
that  point  can  be  acquired  by  executing  the  C  macro: 

struct  application  *ap; 

struct  hit  *hitp; 

struct  soltab  *stp; 
struct  uvcoord  *uvp; 

RT_HIT_UVCOORD(  ap,  stp,  hitp,  uvp  ); 

For  some  simple  lighting-model  applications,  it  is  sometimes  desirable  to  create  a  mapping 
between  the  coordinate  system  on  the  surface  of  an  object  to  the  coordinate  system  of  a  square, 
the  so-called  "U-V”  coordinates.  This  is  generally  used  to  drive  simple  2-D  texture  mapping  algo¬ 
rithms.  The  most  common  application  is  to  extract  a  “paint”  color  from  a  rectangular  RGB 
image  file  at  coordinates  (U,V),  and  apply  this  color  to  the  surface  of  an  object.  These  parameters 
can  also  be  used  to  simulate  the  effect  of  minor  surface  roughness  using  the  bump  mapping  tech¬ 
nique.  Here,  the  U  and  V  coordinates  index  into  a  rectangular  file  of  perturbation  angles:  the  sur¬ 
face  normal  returned  by  RT_HIT_NORM()  is  then  modified  by  up  to  plus  or  minus  90  degrees 
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each  in  both  the  U  and  V  directions,  according  to  the  stored  perturbation. 

5.  3-D  Surface  Mesh 

Combinatorial  Solid  Geometric  (CSG)  models  are  formed  by  the  boolean  combination  of 
“primitive”  solids.4  For  example,  a  plate  with  a  hole  is  most  easily  modeled  as  a  plate  primitive 
minus  a  cylinder  primitive.  It  is  important  to  note  that  in  CSG  models,  there  is  no  explicit 
representation  of  the  surfaces  of  the  solids  stored:  indeed,  for  complex  boolean  combinations  of 
complex  primitives,  some  of  the  resultant  shapes  may  have  very  convoluted  topology  and  surfaces 
that  may  be  at  best  high  degree  polynomials. 

There  are  many  applications  that  would  benefit  from  being  able  to  express  an  approximation 
of  these  complex  shapes  created  using  CSG  modeling  as  a  collection  of  planar  N-gons  which 
together  enclose  roughly  the  same  volume  of  space  as  the  original  CSG  solid.  The  most  obvious 
such  application  is  to  drive  polygon-based  rendering  routines  (lighting  modules)  for  predictive  opt¬ 
ical  signatures.  On  many  modern  workstations  there  is  direct  hardware/firmware  support  for 
high-speed  rendering  of  polygons.  In  addition,  there  are  whole  collections  of  2.5-D  infra-red 
predictive  signature  programs  and  3-D  polygon  radar  codes.  The  very  best  predictive  radar  signa- 
tures  can  be  calculated  using  the  Method  of  Moments,  which  requires  having  a  3-D  surface  tessella¬ 
tion  to  sub-wavelength  resolution  of  the  entire  model. 

A  sensible  strategy  for  converting  a  CSG  model  to  the  equivalent  approximate  3-D  surface 
mesh  is  to  tackle  the  problem  in  two  parts.  First,  a  routine  has  to  be  written  to  convert  each  of 
the  primitive  solids  into  tessellated  form.  Second,  a  routine  has  to  be  written  to  take  two  tessel¬ 
lated  objects  and  combine  them  according  to  a  boolean  operation  (union,  intersection,  or  subtrac¬ 
tion)  back  into  a  consistent  set  of  solid  tessellated  objects.  Until  very  recently,  it  has  been  this 
second  step  that  has  proven  extremely  difficult.10  The  topology  of  solid  tessellated  objects  has 
traditionally  been  represented  using  the  “wjnged-edge”  data  structure.  The  major  breakthrough 
is  due  to  Kevin  Weiler, 11,12  who  noted  that  the  “winged-edge”  data  structure  was  unable  to 
represent  non-3-Manifold  conditions  that  often  occur  when  performing  boolean  operations.  Weiler 
proposed  changing  from  the  “winged-edge”  data  structure  to  the  “radial-edge”  data  structure, 
suitable  for  representing  all  the  Non-Manifold  Geometric  (NMG)  and  topological  configurations 
that  boolean  operations  might  produce.  Thus,  NMG  objects  are  closed  under  boolean  operations. 

Employing  the  NMG  representation  for  faceted  solid  objects  gives  rise  to  the  rich  set  of  pos¬ 
sibilities  diagramed  in  Figure  3.  From  this  diagram  it  should  be  clear  that  the  final  evaluated 
NMG  solid  object  can  be  employed  in  a  variety  of  ways.  The  primary  use  will  be  for  input  to 
analysis  codes  that  need  an  approximate  3-D  surface  mesh  of  the  solid  model.  In  this  case,  the 
NMG  objects  are  sent  across  the  interface,  either  directly  into  an  application,  or  via  a  triangulator 
that  turns  the  planar  N-gon  faces  of  the  NMG  objects  into  simple  triangle  lists,  and  thence  to  the 
application.  However,  a  very  powerful  second  use  will  be  to  create  new  faceted  shapes  which  are 
then  stored  back  in  the  database  as  new  geometric  objects,  suitable  for  future  editing  or  analysis. 

While  a  detailed  description  of  the  NMG  data  structures  is  beyond  the  scope  of  this  paper, 
there  are  several  advantageous  properties  of  the  NMGs  that  are  worth  mentioning.  The  NMG 
representation  maintains  full  topology  information,  so  that  the  relationships  between  vertices, 
edges,  loops,  faces,  and  shells  are  continuously  available.  The  geometry  information  associated 
with  a  planar  face  is  the  plane  equation  (which  includes  the  outward-pointing  surface  normal):  the 
plane  equation  does  not  have  to  be  re-derived  from  the  vertices.  For  applications  that  would 
prefer  visual  realism  rather  than  geometric  fidelity,  there  is  room  in  the  vertex  geometry  structure 
to  carry  around  a  “phony”  normal  for  each  vertex,  suitable  for  use  in  Gouraud  shading13  algo¬ 
rithms. 

One  of  the  most  exciting  current  research  projects  at  BRI.  i-  the  extension  of  the  NMG 
framework  to  permit  faces  either  to  be  planar  N-gons,  or  trimmed  nun-uniform  rational  B-splines 
(“trimmed  NURBS”).  This  will  permit  many  of  the  tessellation  operations  to  be  implemented 
exactly,  rather  than  as  approximations.  This  will  also  permit  solids  to  enjoy  the  economy  of  hav¬ 
ing  most  faces  be  represented  as  planar  N-gons,  which  are  very  compact  and  efficient  to  process, 
while  those  few  faces  that  require  sculptured  surface  shape  control  can  be  represented  as  trimmed 
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Figure  3  —  NMG  Wiring  Diagram 

NURBS.  This  combination  provides  both  efficiency  and  full  shape  control  in  the  rich  non- 
Manifold  topological  framework;  a  combination  that  does  not  exist  in  any  current  commercial 
CAD  system. 

0.  Topological  Representation 

Some  predictive  radar  signature  codes,  such  as  the  TRACK  code  of  GTRI,H  do  not  operate 
directly  on  a  geometric  representation  of  an  object.  Instead,  they  rely  on  the  fact  that  large  radar 
returns  occur  primarily  due  to  the  existence  of  dihedral  and  trihedral  structures  in  the  object.  The 
original  model  is  analyzed  to  locate  all  instances  of  the  topological  features  of  interest,  for  exam¬ 
ple,  planar  face  elements,  edges  where  two  locally  planar  elements  join  to  make  a  dihedral,  edges 
where  three  locally  planar  elements  join  to  make  a  trihedral,  etc.  Then  this  list  of  topological 
features  is  used  as  input  to  the  feature-based  analysis  code. 

Due  to  the  rather  broad  set  of  possible  interpretations  of  the  term  '‘feature”,  each  kind  of 
topological  feature  extraction  is  itself  considered  an  application  program,  rather  than  being  imple¬ 
mented  as  a  standard  part  of  the  interrogation  library.  The  process  of  topological  feature  extrac¬ 
tion  is  currently  programmed  using  the  interrogation  features  described  above. 

7.  Extensions 

To  date,  most  BRL-CAD  applications  programs  have  been  implemented  using  the  ray¬ 
tracing  paradigm,  because  of  ray-tracing  having  a  20-year  head  start.  By  choosing  the  ray  sam¬ 
pling  density  within  the  Nyquist  limit  for  a  given  spatial  resolution,  many  applications  based  on 
ray-tracing  are  well  satisfied  by  extracting  ray/geometry  intersection  information.  However,  a 
mathematical  ray  has  as  its  cross  section  a  point,  while  physical  objects  have  significant  cross- 
sectional  area.  This  lack  of  cross-sectional  area  will  always  lead  to  some  sampling  inaccuracies. 
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Applications  which  simulate  particles  or  small  rocks  approaching  the  model  might  benefit  from 
having  a  direct  cylinder/geometry  intersection  capability,  and  applications  which  shine  beams  of 
light  on  the  model  such  as  spotlights  or  even  highly  collimated  light  such  as  laser  light  might  bene¬ 
fit  from  cone/geometry  intersection  capabilities.15' 18  Applications  which  are  attempting  to  simu¬ 
late  wave  effects  might  be  well  expressed  in  terms  of  plane/geometry  intersection  curves,  and 
structural  analysis  routines  would  probably  prefer  to  obtain  the  geometry  as  a  collection  of  con¬ 
nected  hyperpatches. 

While  very  recent  research  has  begun  to  explore  techniques  for  intersecting  cylinders,  cones, 
and  planes  with  geometry,17  ray-tracing  and  polygon-based  techniques  are  by  far  the  most  well 
developed  approaches.  However,  there  are  several  additional  type  of  interface  to  the  model 
geometry  that  are  likely  to  be  of  general  applicability.  These  interfaces  are  candidates  for  imple¬ 
mentation  in  a  future  release  of  the  BRL-CAD  Package. 

7.1.  3-D  Finite-Element  Volume  Mesh 

Many  forms  of  energy  flow  analysis,  such  as  heat  flow,  vibrational  analysis  (acoustic  energy 
flow),  and  stress  analysis  require  the  use  of  3-D  Finite-Element  Mesh  (FEM)  techniques.  While 
there  has  been  some  work  on  using  the  ray-tracing  paradigm  to  construct  finite  element  and  finite 
difference  meshes18  it  has  been  difficult  to  deal  with  high  spatial  frequency  (fine  detail)  portions  of 
the  model.  In  particular,  meshing  small  diameter  pipes  is  problematic;  undersampling  can  cause 
the  pipe  to  incorrectly  be  separated  into  multiple  pieces. 

In  order  to  improve  on  the  current  state  of  affairs,  it  seems  necessary  to  provide  support  for 
the  generation  of  volume  meshes  directly  as  part  of  the  application  interface.  This  would  provide 
the  meshing  algorithm  to  have  unrestricted  access  to  the  underlying  geometry,  the  space  partition¬ 
ing  tree,  and  other  internal  data  in  order  to  perform  a  better  job. 

Even  more  promising  still  would  be  a  strategy  that  takes  advantage  of  the  NMG  support.  A 
first  pass  might  tessellate  the  model  and  evaluate  the  booleans  to  produce  a  surface  mesh.  The 
second  pass  would  then  take  the  surface  mesh  and  fill  the  interior  (or  exterior)  volumes  with 
appropriately  chosen  volume  elements.  A  very  good  fit  could  probably  be  achieved  using  only 
parallelepiped  (“brick”)  elements  and  20-node  “superelements”.  The  brick  elements  would  be 
used  to  fill  interior  volume  that  does  not  border  on  a  face,  and  the  superelements  would  be  used 
for  volume  that  contacts  a  face.  Recourse  could  be  made  back  to  the  underlying  geometry 
(perhaps  via  firing  a  few  well  chosen  rays)  to  get  the  curvature  of  the  superelement  faces  to  match 
the  curvature  of  the  underlying  primitive,  rather  than  having  to  relay  strictly  on  the  NMG 
planar-face  approximation. 

7.2.  3-D  Volume  Elements  (Voxels) 

A  representation  which  is  similar  to  the  finite-element  mesh  is  based  on  Volume  Elements 
(VOXELS).  There  are  two  distinct  kinds  of  voxels.  The  first  kind  of  voxels  can  be  considered  a 
special  case  of  volume  meshing  discussed  previously,  in  which  the  model  is  “diced”  into  a  large 
collection  of  homogeneous  parallelepiped  (“brick”)  elements.  .As  one  example,  ERIM  has  a  utility 
program  which  uses  ray-tracing  to  covert  BRL  CSG-format  geometry  to  this  kind  of  voxel 
representation  to  feed  a  first-principles  IR  model.13 

A  distinctly  different  form  of  voxel  representation  is  based  upon  the  use  of  8-way  binary 
space  subdivision  stored  using  an  “oct-tree”  data  structure.  In  this  technique,  the  model  is 
enclosed  in  a  bounding  box.  The  bounding  box  is  evenly  split  along  the  X,  Y.  and  Z  axes  to  form 
eight  smaller  boxes.  This  algorithm  is  applied  recursively  so  that  all  boxes  which  are  neither 
entirely  full  nor  entirely  empty  are  repeatedly  split,  until  the  size  of  the  voxels  satisfies  some  ter¬ 
mination  condition.  In  this  way,  small  voxels  that  lie  along  the  surface  of  objects  can  fit  arbi¬ 
trarily  tightly  to  the  surface,  while  the  interior  of  an  isomorphic  region  will  be  contained  primarily 
in  a  single  large  voxel.  The  oct-tree  representation  provides  the  application  program  with  a  homo¬ 
geneous  geometric  representation  based  entirely  on  cubes  of  varying  size.  Having  such  a  homo¬ 
geneous  representation  can  often  greatly  ease  the  task  of  algorithm  development.  On  the  other 
hand,  achieving  a  good  approximation  of  curved  objects  using  cubes  requires  a  huge  number  of 
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voxels  to  be  used,  resulting  in  very  large  voxel  datasets,  and  an  exponential  increase  (order  N**6) 
in  the  number  of  element-to-element  equations  to  be  solved.  The  oct-tree  approach  to  IR  signa¬ 
ture  generation  is  employed  in  the  BRL-CAD  program,  lgt.20 

7.3.  Homogeneous  Trimmed  B-SpHnes 

When  support  for  trimmed  NURB  faces  has  been  added  to  the  NMG  capability,  it  will  be 
possible  to  represent  all  existing  primitives  either  with  exact  rational  B-spline  versions,  or  with 
very  good  rational  B-spline  approximations.  This  could  be  done  even  for  faces  that  were  com¬ 
pletely  planar. 

This  offers  the  hope  that  it  might  be  possible  (albeit  memory  intensive)  to  convert  an  entire 
CSG  solid  model  into  a  homogeneous  collection  of  non-uniform  rational  B-spline  faces  organized  in 
a  non-manifold  topological  data  structure.  In  addition  to  the  conceptual  simplicity  afforded  by 
having  a  uniform  representation  for  shape,  this  affords  the  opportunity  to  create  new  analysis 
codes  that  can  process  curved  surfaces,  yet  at  least  initially  only  have  to  deal  with  one  kind  of 
shape.  This  would  also  provide  a  very  direct  and  natural  interface  to  spline-based21  and  Bezier- 
patch22  based  modeling  systems. 

7.4.  Analytic  Analysis 

Given  a  homogeneous  geometric  representation  such  as  the  Trimmed  B-Splines  just  discussed 
which  also  has  an  analytic  representation,  a  further  processing  capability  arises.  Rather  than  inter¬ 
rogating  the  data  base  by  means  sampling  or  subdivision  techniques,  the  direct  mathematical 
manipulation  of  the  source  geometry  through  its  parametric  representation  becomes  possible.  Cal¬ 
culations  of  physical  properties  requiring  integration  over  a  surfaces  can  often  be  evaluated  with 
greater  accuracy  using  an  explicit  analytic  calculation  than  would  be  provided  by  numerical 
evaluation.  While  this  may  be  difficult  in  general  due  to  the  complexity  of  the  parametric  expres¬ 
sion,  some  classes  of  surface  representations  good  candidates.  Splines,  for  example,  are  piecewise- 
polynomial  functions  which  have  relatively  simple  Fourier  transform  representations.  Since  2-D 
spatial  Fourier  transforms  arise  frequently  in  far-field  electromagnetic  scattering  calculations, 
exploitation  of  the  parametric  spline  representation  is  of  interest  in  predictive  scattering  calcula¬ 
tions.  Direct  use  of  spline  parameters  in  a  Physical  Optics  scattering  model  is  part  of  the  metho¬ 
dology  used  at  the  Aircraft  Division,  Northrop  Corporation. 

With  the  rapidly  developing  potential  of  symbolic  calculation,  treatment  of  seemingly  impos¬ 
sible  formulas  resulting  from  the  geometry /physics  interaction  may  become  tenable.  This  could 
help  to  reduce  the  trend  towards  employing  numerical  methods  at  the  onset  of  a  problem  and 
avoid  some  of  the  accompanying  instabilities  and  errors. 

8.  Summary 

Much  of  the  power  and  flexibility  of  the  BRL-CAD  Package  comes  from  the  diversity  of 
shapes  that  can  be  represented,  and  the  variety  of  analysis  interfaces  that  are  available.  Having  a 
diversity  of  interface  possibilities  has  permitted  a  wide  variety  of  analysis  codes  to  be  driven  from 
a  single,  central  geometric  model. 
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Abstract 

In  this  report,  we  present  and  analyze  the  results  of  applying  parallel  al¬ 
gorithms  to  a  two  dimensional  gas  dynamics  code  using  front  tracking.  We 
also  discuss  the  ideas  to  generalize  the  algorithm  to  three  dimensions.  The 
main  purpose  of  this  paper  is  to  demonstrate  that  parallel  computations  can 
be  applied  to  complex  algorithms.  We  take  front  tracking  as  an  example. 

A  number  of  computations  for  two  dimensional  fluid  flow  have  been  success¬ 
ful  to  simulate  chaotic  mixing,  shock  interactions  and  oil  reservoir  simulation. 

In  most  of  these  cases,  further  progress  depends  upon  three  dimensioned  com¬ 
putations.  Performing  systematic  studies  of  chaotic  mixing  in  three  dimensions 
will  require  the  extensive  computer  resources  which  parallel  computations  can 
offer. 

In  order  to  develop  a  reliable  parallel  programming  paradigm  for  three  di¬ 
mensional  studies,  we  first  parallelize  the  relatively  simpler  serial  two  dimen¬ 
sional  gas  dynamics  code  on  two  representative  distributed-memory  MIMD 
parallel  supercomputers,  the  iPSC/860  and  the  NCUBE/2. 

The  main  algorithmic  issues  for  front  tracking  in  three  dimensions  are:  (1) 
the  construction  of  surface  grids,  (2)  the  construction  of  volume  grids  which 
are  adapted  to  (i.e.  which  respect,  or  do  not  overlap  with)  sped  fin  c!  surfaces. 

(3)  the  efficient  computation  of  interface  topology,  (4)  the  resolution  of  self 
intersections  in  a  tangled  interface,  and  (5)  parallel  computation.  Most  of 
these  issues  are  important  for  numerous  methods  of  computation.  We  present 
methods  for  addressing  these  issues  which  are  appropriate  to  the  front  tracking 
context. 

'Supported  by  the  Applied  Mathematics  Subprogram  of  the  U  S.  Department  of  Energy  DE- 
FG02-90ER25084,  the  National  Science  Foundation,  grant  DMS-89018844,  and  the  AFSOR,  grant 
90-0075. 
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1  Introduction 

Significant  effort  has  been  devoted  to  developing  parallel  algorithms  for  scientific 
problems  with  natural  load  distribution.  We  concentrate  on  exploring  parallelization 
for  problems,  where  such  uniform  load  distributions  do  not  exist,  such  as  the  front 
tracking  algorithms  for  fluid  interfaces. 

Front  tracking  [3,  7,  9,  10]  is  an  algorithm  which  preserves,  and  explicitly  recog¬ 
nizes,  discontinuity  surfaces  as  computational  degrees  of  freedom.  Marker  points  are 
introduced  to  define  the  location  of  an  interface  between  computational  domains. 

This  method  has  given  very  high  quality  solutions;  the  price  to  pay  is  probably 
the  complexity  of  its  implementation.  Indeed,  the  value  of  this  method  has  been 
demonstrated  by  a  series  of  computations  for  two  dimensional  fluid  flow.  However,  is 
it  possible  to  generalize  such  a  method  to  three  dimensions?  Is  it  feasible  to  introduce 
parallel  computations  to  this  method? 

In  order  to  answer  these  questions,  we  have  performed  a  series  of  timing  studies  on 
three  typical  different  environments  for  three  dimensional  front  tracking  algorithms 
[4],  The  algorithms  studied  were  interface  topology  algorithms  which  arise  as  a  sub¬ 
problem  of  gas  dynamics  simulations.  To  address  the  issue  of  feasibility  of  developing 
parallel  computations  is  the  main  point  of  this  paper  in  which  we  report  initial  results 
for  the  decomposition  of  a  two  dimensional  fluid  interface  into  sub-interfaces  located 
on  computational  subdomains. 

Techniques  of  generating  a  suriace  grid  and  an  unstructured  interior  volume  grid 
for  three  dimensional  computational  domains  are  also  discussed.  Several  on-going 
aspects  of  this  project  will  also  be  mentioned  briefly. 

It  is  typical  that  physical  systems  involve  a  number  of  materials  that  are  separated 
by  interfaces.  Decomposing  such  a  system  into  disjoint  components  separated  by 
lower-dimensional  physical  objects  (curves  in  two  dimensions  and  surfaces  in  three 
dimensions),  specifying,  modifying,  and  moving  dynamically  such  interfaces  is  the 
basis  of  front  tracking  or  interface  methods.  The  application  of  these  algorithms  to 
two  dimensional  hyperbolic  systems  was  described  by  Glimm  and  McBryan  [11]  in 
1985.  The  code  has  since  then  been  used  in  a  number  of  physical  problems  [2,  5.  6. 
8], 

2  The  Parallelization  Method:  Domain  Decom¬ 
position 

Parallelizing  this  code  is  of  immediate  interest  to  prove  the  applicability  of  parallel 
computations  in  front  tracking.  Decomposing  the  self-contained  interface  data  struc¬ 
tures  into  sub-domains  and  then  mapping  these  sub-domains  onto  the  participating 
processors  is  our  first  step  towards  parallelization.  Several  requirements  are  imposed 
on  the  decomposition:  At  initialization,  individual  processors  are  activated  to  gener¬ 
ate  correctly  positioned  sub-interfaces  while  at  re-start  from  an  existing  full  interface, 
individual  processors  are  to  pick  up  the  right  portion  of  the  interface.  The  distribu- 
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tion  of  computational  load  is  rarely  uniform;  a  heavier  load  appears  mostly  where  the 
interface  occurs.  Moreover,  a  uniform  distribution  of  the  interface  is  not  common. 
Therefore  conventional  uniform  domain  decomposition  is  not  adequate  for  a  good 
speedup.  To  decrease  load  im-balance,  we  consider  non-uniform  subdomains.  For  a 
number  of  problems  in  the  scientific  applications,  we  are  considering  interfaces  that 
frequently  cluster  in  a  horizontal  (vertical)  direction,  which  leads  us  to  decompose 
the  interfaces  into  vertical  (horizontal)  strip-wise  sub- interfaces. 

Clipping  a  sub-interface  from  a  full  interface  is  conceptually  straightforward.  First, 
we  determine  the  border  curves  that  bound  the  sub-interface  and  the  find  all  curves 
that  intersect  these  border  curves.  Then,  we  split  these  intersected  curves  into  two 
pieces,  one  of  which  lies  within  the  border,  and  is  inserted  into  the  sub-interface. 
Finally,  we  find  the  intersection  points  and  insert  them  as  boundary  nodes  into  the 
sub-interface.  The  locally  “interior”  elements  (nodes  and  curves)  are  inserted  un¬ 
changed. 

The  following  four  figures  are  included  to  show  two  typical  decompositions  we 
performed  on  an  interface  obtained  from  a  study  of  bubble  growth  for  Ravleigh- 
Taylor  unstable  interfaces  This  interface  (Figure  1)  is  generated  in  a  domain  with 
0  <  x  <  10  and  0  <  y  <  20  on  a  120  x  240  mesh  at  the  750th  time  step.  Starting 
from  this  interface,  we  first  use  16  processors  to  decompose  it  into  15  sub-interfaces 
(one  processor  idle)  with  each  being  on  a  40  x  4S  mesh.  In  Figure  2,  we  draw  the 
sub-interface  clipped  by  processor  10.  Of  course,  this  represents  a  random  choice.  In 
Figure  3,  we  "glue”  all  15  sub-interfaces  obtained  by  the  15  processors  in  one  frame 
with  size  comparable  to  the  original  interface  for  comparison.  In  Figure  4.  we  show  a 
vertically  strip-wise  decomposition,  with  each  sub-interface  lying  in  a  20  x  240  mesh. 
As  we  can  see  from  Figures  3  and  4,  gathering  the  scattered  sub-interfaces  produces  a 
perfectly  identical  interface  to  the  orginal  one  (Figure  1).  A  quantitative  comparison, 
though  fairly  tedious,  also  proves  exact  consistency. 

We  must  realize  that  an  unconventional  domain  decomposition,  which  may  help 
reduce  the  load  imbalance  overhead,  can  cause  significant  communication  penalty.  We 
are  currently  studying  the  possibility  for  a  optimal  case.  The  answer,  being  machine- 
dependent,  is  not  meaningful  in  general.  We  need  to  tune  for  possible  optimizations 
for  a  particular  machine. 

In  a  message- passing  computing  environment,  communication  ptr  se  is  important 
and  interesting.  We  will  report  results  on  communication  in  a  separa ■  <  paper  shortly. 
Here,  we  just  outline  t  he  main  ideas  for  reducing  communication  cost:,:  (1)  exchange 
minimum  boundary  information,  which  is  possible  due  to  the  locality  of  the  interface 
algorithm,  transfer  data  extended  two  mesh  blocks  beyond  a  subdomain  which  is 
typically  about  40  mesh  blocks  wide  in  each  dimension  (as  in  our  example  case  above). 
(2)  map,  in  the  hypercube  communication  logic,  the  nearest-neighbor  subdomains 
onto  the  possible  nearest  neighbor  processors  to  reduce  unnecessary  costs  due  to 
crossing  processors. 

Another  issue  was  software  support  for  portability  and  ease  of  programming.  We 
have  tried  to  build  the  code  on  two  representative  distributed-memory  MIMD  parallel 
supercomputers:  iPSC/S60  and  XCUBE/2  as  well  as  a  cluster  of  workstations,  with 
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Figure  2:  A  representative  material  interface  decomposed  from  processor  10. 
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Figure  3:  A  material  interface  is  decomposed  into  15  subdomains  Ibox-wise).  The 
numbers  in  the  sub-interfaces  denote  the  processor  ID’s. 
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Figure  4:  A  material  interface  is  decomposed  into  6  subdomains  (strip-wise).  The 
numbers  in  the  sub- interfaces  denote  the  processor  ID’s. 


a  home-made,  self-contained  “interface”  library.  It  appears  to  be  sufficient. 

Several  deeper  computer  science  issues  have  also  been  addressed.  For  example, 
a  fairly  new  idea  of  dynamically  managing  the  local  physical  memories  with  chunks 
of  objects  is  implemented  in  this  code  to  broadcast  interfaces  among  processors.  It 
would  have  been  a  simple  matter  if  this  were  done  on  conventional  computers  or  even 
shared- memory  machines  with  “flat”  memory  space.  The  major  operation  involved 
in  communicating  objects  among  distributed  memory  processors  is  the  packing,  in 
the  source  memory,  and  the  unpacking,  at  the  destination  processor,  of  the  objects. 
On  the  destination  processors,  we  restore  the  copied  objects  by  computing  their  new 
pointers. 

3  Surface  and  Volume  Grid  Generation  in  3D 

The  interface  data  structures  for  three  dimensions  are  an  extension  [12]  of  those  of 
two  dimensions.  It  is  conceptually  easy  to  understand.  The  basic  data  structures 
in  three  dimensions  include  COMPONENT,  POINT,  NODE,  TRI(  ANGLE),  BOND, 
CURVE,  SURFACE,  and  INTERFACE. 

Decomposing  an  interface  system  into  disjoint  components  separated  by  lower¬ 
dimensional  physical  objects  (curves  in  two  dimensions  and  surfaces  three  dimen¬ 
sions)  and  specifying,  modifying  and  moving  dynamically  such  interfaces  is  the  basis 
of  interface  methods.  A  package  of  routines  that  provide  facilities  for  defining,  build¬ 
ing,  and  modifying  such  decompositions  and  for  efficiently  solving  various  physical 
problems  comprise  the  main  algorithmic  issues  of  the  interface  methods. 

The  surfaces  are  defined  as  a  collection  of  contiguous  triangles.  After  propagation, 
regridding  may  be  needed  to  achieve  triangles  of  approximately  uniform  size  and 
good  aspect  ratios.  Local  interface  operations  of  merger  or  refinement  of  triangles 
can  accomplish  this.  For  volume  grids,  a  frontal  construction  is  more  suitable  than  a 
Delaunay  triangulation,  as  the  latter  does  not  respect  given  surfaces  unless  additional 
points  are  inserted  in  the  surface.  Interface  topology  is  computed  on  the  basis  of 
a  precomputation  of  hashed  lists.  Self  intersections  are  detected  with  the  help  oi 
these  hashed  lists  and  are  resolved  by  coboundary  component  information.  Interface 
topology  was  addressed  in;  we  discuss  here  only  the  ideas  of  generating  a  surface  grid 
and  an  unstructured  interior  volume  grid. 


3.1  Surface  Grid  Generation 

First,  let  us  consider  the  surface  grid  generation.  In  three  dimensions,  an  interface 
consists  of  a  set  of  surfaces.  Each  surface  contains  a  set  of  tuangles  and  is  bounded 
by  curves.  A  curve  contains  a  set  of  doubly  linked  line  segments  called  bonds.  For 
a  valid  interface,  there  are  no  intersections  between  surfaces  and  no  self-intersections 
on  each  surface.  Surfaces  may  connect  along  boundary  curves  only.  Each  triangle 
has  three  pointers.  They  point  to  three  neighboring  triangles  connected  at  its  edges. 
If  any  edge  of  a  triangle  is  on  the  boundary  of  the  surface,  the  corresponding  pointer 
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points  to  the  bond  of  a  boundary  curve  which  coincides  with  that  boundary  edge. 

We  consider  surfaces  presented  parametrically  by  a  mapping  onto  a  planar  domain. 
We  generate  the  initial  surfaces  in  three  steps:  (1)  approximate  each  surface  by 
its  parametric  plan  surface  and  generate  coarse  triangles  for  the  plane  surface;  (2) 
repeatedly  divide  each  large  triangle  into  two  small  ones  of  same  size  until  all  triangles 
have  size  less  than  the  desired  value;  (3)  map  the  coordinates  of  the  vertices  of  the 
triangles  on  the  planar  surface  to  obtain  the  desired  non-planar  surface  shape.  The 
algorithm  works  for  multi-connected  surface  as  well.  Due  to  the  coarseness  of  the 
triangles,  the  time  spent  in  step  (1)  is  less  than  that  in  steps  (2)  or  (3).  Both  steps  (2) 
and  (3)  are  0(N)  operations.  Here  N  is  the  total  number  of  triangles  on  the  interface 
at  the  end  of  the  surface  grid  generation.  Therefore  the  overall  algorithm  is  0(N). 
For  the  reason  of  numerical  stability,  one  prefers  to  generate  triangles  with  a  good 
aspect  ratio.  We  achieved  this  goal  by  generating  the  coarse  triangles  with  a  good 
aspect  ratio  in  step  (1)  and  dividing  each  triangle  at  the  middle  of  its  longest  edge  in 
step  (2).  When  a  divided  edge  is  a  boundary  edge,  we  also  divided  all  triangles  (on 
other  surfaces)  which  are  connected  to  the  bond  which  coincides  with  the  boundary 
edge.  If  the  parametric  map  defining  the  surface  has  significant  influence  on  the 
aspect  ratio  or  the  size  of  the  triangles,  one  should  process  step  (2)  again  after  the 
mapping.  The  overall  operation  is  still  0(N).  The  algorithm  described  above  applies 
to  certain  classes  of  interfaces  for  which  a  parametric  mapping  exists.  For  surfaces 
of  revolution,  we  apply  a  different  algorithm.  A  surface  of  revolution  is  defined  by  a 
curve  in  three  dimensions,  an  axis  of  rotation  and  the  angle  of  revolution.  The  desired 
triangle  size  determines  the  bond  length  on  the  curve.  Then  bonds  are  generated  for 
the  curve.  The  trace  of  a  bond  on  the  surface  is  a  curved  strip.  The  curved  strip 
will  be  covered  by  triangles  and  the  vertices  of  the  triangles  lie  on  the  edges  of  the 
strip.  We  calculate  the  arc  length  ot  the  curved  edges  of  the  strip,  i.e.,  the  arc  length 
of  traces  of  the  end  points  of  the  bonds  on  the  surface  of  revolution.  From  the  arc 
length,  we  determine  the  positions  of  the  vertices  of  the  triangles.  The  triangulation 
of  the  strip  starts  from  the  bond.  The  bond  will  be  a  (boundary)  edge  of  the  first 
triangle  on  the  strip.  There  are  two  points  adjacent  to  the  bond,  one  f.-om  each  of  the 
edge  curves  of  the  strip.  By  choosing  an  adjacent  point,  we  form  one  of  two  possible 
triangles  on  the  strip.  We  select  the  adjacent  point  which  provides  the  better  aspect 
ratio.  In  the  selected  triangle,  there  is  only  one  edge  which  cuts  through  the  strip. 
That  edge  will  be  one  of  the  edges  in  the  next  triangle  to  be  generated.  Then  in  the 
process  of  generating  the  second  triangle,  that  edge  plays  the  role  of  the  bond  in  the 
first  triangle.  We  apply  same  selection  criterion  recursively  to  obtain  next  triangle. 
This  process  is  repeated  until  we  reach  the  end  of  one  the  curved  edges  of  the  strip. 
Yvt  connect  all  points  left  on  the  other  curved  edge  of  the  strip  to  the  end  point 
of  the  finished  edge  of  the  strip  to  form  the  rest  of  the  triangles.  Then  the  strip  is 
completely  covered  by  the  triangles.  This  algorithm  is  0(N)  also.  The  combination 
ol  these  two  algorithms  is  sufficient  to  generate  initial  interfaces  for  many  problems 
we  will  consider.  The  physical  quantities  are  initialized  by  the  analytic  solution  of 
the  linearized  governing  equations  in  the  case  of  fluid  instability  problems. 
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3.2  Volume  Grid  Generation 


Now  let  us  discuss  the  unstructured  interface  fiting  volume  grid  generation.  This 
is  a  method  to  generate  an  unstructured  mesh  in  three  dimensions  (i.e.,  a  set  of 
tetrahedra)  in  the  mesh  blocks  cut  by  the  interface.  The  interface  is  a  collection 
of  outer  and  inner  boundaries.  In  three  dimensions,  these  boundaries  are  surfaces 
of  arbitrary  shape,  approximated  by  a  set  of  triangles,  which  lie  inside  the  regular 
Cartesian  grid  and  may  divide  the  computational  domain  into  any  number  of  regions. 
Grid  generation  is  intended  to  fill  with  tetrahedra  those  rectangular  blocks  which  are 
cut  by  the  interface.  Its  purpose  is  to  allow  interpolation  of  state  values  defined  on 
the  interface  and  those  defined  on  regular  Cartesian  nodes.  In  many  computational 
fluid  problems,  the  state  values  on  these  two  kinds  of  nodes  are  computed  separately. 
The  essential  feature  of  the  interpolation  algorithm  is  that  interpolation  averages  only 
combine  state  from  a  single  component,  or  a  single  side  of  the  interface. 

The  whole  computational  region  is  divided  by  grid  lines,  parallel  to  the  coordinate 
axis.  One  can  view  the  region  as  being  composed  of  mesh  blocks  (in  two  dimensions, 
mesh  rectangles).  Unstructured  grid  generation  takes  place  in  a  collection  of  mesh 
blocks  through  which  the  interface  cuts.  Other  blocks  in  the  computational  domain 
are  left  intact,  or  more  precisely,  the  grid  in  this  latter  case  is  nothing  but  the  mesh 
block  itself.  The  name  “hybrid  grid”  can  be  used:  the  unstructured  grid  (tetrahedra) 
in  the  set  of  interface-influenced  mesh  blocks,  and  the  structured  grid  (cube)  in  the 
complimentary  set  of  blocks. 

The  major  work  concerned  the  unstructured  grids.  The  main  difficulty  comes 
from  the  fact  that  the  interface  can  be  of  an  arbitrary  shape  and  that  the  algorithm 
must  be  designed  in  such  a  way  that  it  is  independent  of  the  interface.  If  Delaunay 
triangulation  [1]  is  used,  more  points  than  those  already  existing  on  the  interface  must 
be  added  to  preserve  the  surface  geometrical  shape.  To  achieve  this  goal,  the  point 
distribution  should  be  dense  enough  to  specify  the  surface  shape.  This  is  theoretically 
possible,  but  technically  unattractive  and  impractical  because  of  tie  arbitrarily  of 
the  interface.  Based  on  this  reason,  we  chose  not  to  use  the  Delaunay  triangulation 
method.  New  points  are  allowed  in  principle  on  the  interface  itself,  out  not  in  the 
interior,  or  volume  region.  This  restriction  is  based  on  the  final  usage  of  the  grid 
As  mentioned  before,  the  grid  is  used  for  interpolation  of  state  values  between  two 
sets  of  states:  those  defined  at  points  on  the  interface  and  those  on  the  regular 
Cartesian  nodal  points.  Since  the  function  values  are  definited  only  on  the  above 
points,  interpolation  is  needed  to  determine  the  value  whenever  a  new  point  is  added. 
This  is  possible  within  any  triangle  of  the  surface.  However,  additional  points  would 
tend  to  generate  more  tetrahedra  and  complicate  the  computation  thereafter,  and  we 
decide  to  make  full  use  of  the  given  points  and  disallow  any  new  point  being  added. 

The  general  approach,  rather  than  a  method,  stems  from  the  advancing  front 
method  [13,  14].  As  in  the  advancing  front  method,  we  share  the  same  feature  oi 
’’advancing”  the  inner  or  outer  boundaries  forward  by  adding  tetrahedra  until  the 
region  is  filled.  A  base  triangle  is  chosen  from  the  front,  then  a  vertex  is  determined  by 
some  criterion  to  form  a  tetrahedra.  What  is  different  lies  in  the  verte  x  determination 


506 


procedure.  As  said  earlier,  a  vertex  can  only  be  one  of  the  existing  points;  furthermore, 
the  tetrahedra  thus  generated  should  be  valid,  i.e.,  should  not  cut  through  other 
existing  tetrahedra  or  triangles. 

The  algorithm  flow  pattern  makes  the  sense  clear.  The  interface  is  composed  of 
surfaces  and  the  surface  is  composed  of  triangles.  First,  two  kinds  of  intersection 
points  —  a  surface  triangle  edge  with  a  mesh  block  face  and  a  mesh  block  edge  with 
a  triangle  interior  —  are  determined.  These  are  zero  dimensional  objects.  Second, 
from  them  the  one  dimensional  line  segments  on  mesh  blocks  face  and  interior  are 
computed.  Then,  out  come  the  two  dimensional  triangles  are  generated  from  the 
segments  and  finally,  the  3-dimensional  tetrahedra  from  two  dimensional  triangles. 
Here  are  the  main  steps  of  the  method: 

Step  1.  Zero  dimensional  points  are  intersections  of  interfaces  and  blocks.  Since 
interfaces  are  composed  of  triangles,  the  points  are  either  the  intersections  of  a  triangle 
with  mesh  block  edge  or  or  of  triangle  side  with  a  mesh  block  face. 

Step  2.  One  dimensional  segments  are  located  in  two  places:  on  block  faces  and 
in  the  interior  of  mesh  blocks.  On  faces  are  the  intersections  of  a  triangle  and  a 
rectangular  face.  One  triangle  will  produce  either  one  segment  on  a  face,  or  not  at 
all.  In  the  interior  of  a  mesh  block,  if  there  are  any  segments  from  a  triangle,  those 
segments  make  a  convex  polygon,  lying  on  the  same  plane  with  the  triangle. 

Step  3.  Triangles  are  generated  from  the  segments.  At  the  start  of  this  step, 
the  front  consists  of  the  sequence  of  straight  line  segments  which  form  the  convex 
polygon  or  the  connection  of  intersection  points  in  Step  1  on  the  mesh  block  face. 
During  the  generation  process,  whose  detailed  description  is  given  later,  any  straight 
line  segment  which  is  available  to  form  an  element  side  is  termed  active  and  kept  in 
a  list,  whereas  any  segment  which  is  no  longer  active  is  removed  from  the  list.  This 
process  ceases  when  the  list  containing  active  segments  become  empty. 

Step  4.  The  three  dimensional  tetrahedra  generation  strategy  is  a  direct  extension 
of  that  presented  above  for  triangles.  Instead  of  segments,  now  the  front  is  formed  by 
triangles.  The  front  is  updated  as  each  new  tetrahedron  is  generated  and  the  process 
terminates  when  the  front  is  empty. 

The  grid  generation  process  is  sketched  below: 

For  the  sake  of  simplicity,  only  the  generation  process  in  Step  3  is  given.  This  is  a 
process  in  two  dimensions.  The  process  in  three  dimensions  of  Step  4  is  essentially  the 
same,  except  for  two  substitutions:  segment  by  triangle  and  triangle  by  tetrahedra. 

(4.0  Initially,  all  the  segments  are  put  into  a  list. 

G.  1  Select  an  active  side  from  the  list  in  the  front  as  a  base.  In  order  to  prevent 
large  elements  from  overshadowing  the  region  of  small  elements,  the  side  with  the 
smallest  length  in  the  active  side  list  is  chosen. 

G.2  Take  an  existing  point  as  the  vertex.  If  the  triangle  made  by  this  vertex 
and  a  base  is  valid,  i.e.,  does  not  intersect  other  triangles,  this  point  is  denoted  as  a 
"candidate  point’’  for  this  particular  base.  Of  all  such  candidate  points,  choose  the 
one  which:  1.  will  produce  smallest  number  of  new  sides;  2.  under  condition  1.  have 
the  shortest  distance  to  the  mid-point  of  base. 

G.3  Add  new  sides  (if  any)  to  the  list.  Delete  the  old  side  which  has  two  triangles 
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attach  to  it. 

G.4  If  there  are  any  sides  left  in  the  list,  go  to  G.l. 

The  algorithm  was  written  for  general  interface  problems.  It  has  been  tested  on 
several  simple  interfaces.  Before  it  can  be  integrated  in  the  overall  computational 
code,  further  tests  are  needed  as  more  complicated  interfaces  become  available  from 
the  surface  generation  procedure  and  time-dependent  interface  dynamics. 

The  volume  grid  construction  is  performed  every  time  step.  It  must  satisfy  two 
somewhat  incompatible  criteria:  efficiency  and  robustness.  Experience  with  two  di¬ 
mensional  front  tracking  indicates  that  these  criteria  can  be  satisfied  by  use  of  two 
(or  more)  algorithm.  One  algorithm  must  be  fast,  but  is  allowed  to  fail  occasionally. 
The  other  may  be  slow,  but  is  only  used  when  the  first  algorithm  fails.  The  volume 
grid  construction  proposed  here  appears  to  be  robust  but  (relative  to  speed  require¬ 
ment  of  the  application)  may  be  slow.  Other  volume  grid  constructions,  not  reported 
here,  are  under  investigation  which  are  intended  to  be  fast  but  not  necessarily  robust. 
Practical  experience  with  complex  dynamically  generated  fronts  will  be  needed  to 
assess  the  success  of  these  algorithms. 


4  Conclusions 

A  previous  study  justifies  the  feasibility  of  performing  parallel  computations  in  three 
dimensions  from  the  point  of  view  of  available  hardware  and  timing  studies  of  rep¬ 
resentative  algorithms.  This  paper  re-aflirms  this  possibility  from  the  perspective  of 
programmability  by  showing  results. 
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Abstract  Two  dimensional,  steady  state,  viscous,  compressible  flows  with  a  streamline  along  which  there  is 
a  jump  in  the  pressure  are  considered.  The  evidence  for  the  existence  of  such  flows  is  reviewed.  In  the  case 
of  the  linearized  flow  equations,  linearized  around  a  smooth  ambient  flow,  detailed  information  concerning 
the  form  of  the  discontinuity  and  its  behavior  near  the  left  and  right  hand  boundary  is  presented. 

1.  Introduction  The  system  of  equations  governing  steady  state,  compressible,  viscous  flow  is  not  elliptic, 
because  the  continuity  equation  is  a  hyperbolic  equation  in  the  density.  Hence  there  arises  the  possibility 
that  there  are  flows  with  interior  discontinuities.  The  purpose  of  this  note  is  to  review  our  ongoing  work  on 
the  existence  and  properties  of  these  putative  flows. 

Although  existence  of  such  flows  has  yet  to  be  rigorously  established,  a  picture  of  such  flows  is  beginning 
to  emerge.  The  picture  includes  the  features  that  the  flows  do  not  appear  “spontaneously”  in  the  interior  of 
the  flow  field,  as  do  shock  waves.  Rather,  the  discontinuities  are  created  by  a  discontinuity  in  the  pressure  on 
an  inflow  portion  of  the  boundary.  Also,  the  curve  of  discontinuity  is  the  streamline  of  the  flow  that  emanates 
from  the  boundary  point  where  the  boundary  pressure  is  discontinuous,  and  (at  least  in  the  linearized  case), 
the  magnitude  of  the  pressure  jump  decays  as  one  moves  along  the  streamline  into  the  flow  region. 

We  shall  be  considering  the  system 

(1.1a)  Li(U,  V,  P)  =  —(2/i  +  A )(/„  -  pU„  -  (p  +  \)VIy  +  (PU2)Z  +  (pUV)y  +  Pz  =  0, 

(1.16)  L,(U,  V,  P)  =  -(/i  +  A )Uxy  -  pVzz  -  (2/i  +  X)Vyy  +  (PUV)Z  +  (PV2)y  +  Py  =  0, 

(1.1c)  L3(U,V,P)=(pU)z  +  (pV)y=  0, 

in  a  region  fi  of  the  xy  plane.  The  quantities  (7,  V  are  the  components  of  the  velocity  field,  P  represents  the 
pressure,  and  p  represents  the  density.  The  latter  two  quantities  are  linked  by  the  thermodynamic  relations 

(11  d)  P  =  P(P),  P=P{p). 

Thus,  we  are  neglecting  the  effect  of  variations  in  internal  energy  on  the  flow.  (We  believe  that  including 
internal  energy  as  a  variable  in  the  problem  would  not  affect  the  conclusions  of  this  study.)  The  system  (11) 
must  be  supplemented  by  boundary  conditions  to  determine  the  solution.  Let  T  denote  the  boundary  of  Q. 
and  let  n  denote  the  outward  pointing  unit  normal  to  T.  Our  boundary  conditions  are  then 

( 1  ‘2a )  U ,  V'  given  on  I\ 

(1.26)  P  given  on  Tln  :=  {(x,y)  e  V  :  [U,  V]T  n  <  0} 

We  note  that  the  boundary  region  T,n  depends  on  the  specified  boundary  values  of  U  and  V.  If,  for  example. 
U  =  V  =  0  on  T,  then  r,n  is  empty,  and  there  is  no  specification  of  pressure.  In  such  a  case,  according  to 
our  understanding  there  will  be  no  discontinuous  solutions  of  (1.1),  (1.2). 

We  shall  consider,  in  addition  to  the  system  (1.1),  the  linearization  of  this  system  about  a  given  smooth 
ambient  flow.  Let  U,  V,  P,  and  p  be  the  ambient  flow;  that  is,  suppose  these  functions  satisfy  (l.la-d).  Let 

*  Supported  in  part  by  the  IJ.  S.  Army  Research  Office. 
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u,v  and  p  be  the  linearized  dependent  variables.  We  shall  regard  p  as  a  function  of  P  according  to  ( 1. Id), 
and  we  set  p'  =  dp/dP.  The  linearized  equations  are 

(1.3a)  Li(u,v,p)  =  -(2/i  +  X)uxx  -  /itiyy  -  (/i  +  A)t>ry  +  p(Jux  +  pVuy  +  px  +  A  =  0, 

(1.36)  L3(u,v,p)  =  -(p  +  A)uiy  -  pvxx  -  (2/i  +  A)uyy  +  pUvt  +  pVvy  +  py  +  B  =  0, 

(1.3c)  L3(u,v,p)  =  p'Upx  +  p'Vpy  +  p(ux  +vy)  +  C  =  0. 

In  these  formulas,  p(P)  and  p'(P )  are  evaluated  at  the  ambient  pressure,  and  A,  B,  and  C  contain  undiffer¬ 
entiated  terms  involving  u,  u,  and  p.  For  example,  C  =  pxu  +  pyv  +  (Ux  -F  Vy)p'p.  Each  term  of  A ,  B ,  or  C 
contains  derivatives  of  the  ambient  variables.  Thus,  in  the  case  of  uniform  ambient  flow,  these  lowest  order 
terms  all  vanish. 

The  appropriate  boundary  .tions  for  (1.3)  are  the  specification  of  u,  t;,  and  p  on  T,  and  the  spec¬ 
ification  of  p  on  Tin-  Note  that  F,n  depends  on  the  ambient  flow,  the  flow  about  which  the  linearization  is 
taken.  If  the  ambient  flov>  v-  .shes  on  T,  Tin  =  0  and  there  is  no  specification  of  pressure.  In  such  a  case, 
our  theory  will  not  give  discontinuous  solutions;  the  discontinuities  are  produced  by  a  discontinuity  in  the 
pressure  that  is  specified  on  rm. 

Some  results  on  the  existence  of  solutions  to  (1.1)  are  contained  in  Valli  [5],  and  Veiga  [2]  They  show 
that  if  the  boundary  data  is  small  and  smooth,  there  is  a  smooth  solution  to  the  system.  The  boundary 
condition  (1.2/  is  implicitly  contained  in  the  treatment  Geymonat  [9]  of  the  linearized  transient  problem.  D 
Hoff  [3,4]  has  considered  the  time  dependent,  viscous,  compressible  flow  equations  in  one  space  dimension 
He  shows  that  there  are  indeed  solutions  for  which  the  pressure  has  a  jump  discontinuity;  however  he  also 
shows  that  this  discontinuity  decays  as  time  goes  to  infinity. 

2.  Jump  conditions  Suppose  the  system  (1.1)  has  a  solution  that  takes  a  jump  across  a  curve  C.  Are  there 
jump  conditions,  analogous  to  the  Rankine  Hugoniot  conditions,  that  must  hold  on  the  curve?  To  answer 
this  question  we  must  first  define  a  weak  solution  to  the  system  (1.1)  in  a  domain  ft.  Let  the  smooth  curve  C 
divide  ft  into  two  subdomains,  fti  and  ft2.  Suppose  [(/,  V ,  P\  are  smooth  functions  in  each  of  the  subdomains 
ft;  .  We  say  that  [U,  V,  P]  is  a  weak  solution  to  (1.1)  provided  that  for  each  <p  €  C£°(ft)  one  has  the  integral 
identities 


j  J {[-(2p  +  \)U4>„  -  p4>yy}  ~(p  +  \)V<t>Xy  -  PU24>X  -  pUV4>y  -  P4>z  }dxdy  =  0. 
j  J  {-(/I  +  \)U<t>Xy  +  V[-p<t>xx  -  (2/1  +  A)0yy]  -  puv<j)z  -  pV20y  -  P<t>y}dxdy  =  0. 

J  J{pU<f>x  +pV<t>y}dzdy  =  0. 

Let  x(s),y(s)  be  a  parametrization  of  a  curve  C  of  discontinuity,  let  =  d/ds ,  and  let  6  denote  the  jump  in 
a  quantity  across  C.  As  stated  in  [8],  the  jump  conditions  are  that  C  is  a  streamline  of  the  flow  and  that 

(71)  Uy=Vi, 

(72)  6U  =  6V  =  6U  =  6V  =  0, 

(73)  (2/i  +  A )y6Ux  -  piSUy  +  (/i  +  A )y6Vy  =  y6P, 

(74)  py6Vx  -  (2/i  +  A)x6Vy  +  (/i  +  A)t )6Uy  =  -x6P 

The  condition  (J  1 )  means  that  C  is  a  streamline  of  the  flow.  The  conditions  (J3)  and  (J4)  say  that  the  net 
normal  fluid  stress  acting  on  an  infinitesmal  element  of  fluid  e  that  straddles  the  curve  C  is  balanced  by  the 
net  hydrostatic  pressure  acting  on  e.  Although  the  width  of  e  is  infinitesmal,  the  jump  in  pressure  creates  a 
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non-zero  net  hydrostatic  pressure,  which  in  turn  requires  a  non-zero  net  normal  stress,  and  hence  a  jump  in 
the  derivatives  of  U  and  V. 

The  conditions  (J 1)  and  (J2)  imply  that  i6Ux  +  y6Uy  =  0,  x6Vx  +  y6Vy  =  0.  These  two  equations, 
together  with  (J2)  and  (J3),  give  4  equations  for  the  jumps  in  the  4  first  derivatives  of  U  and  V .  Solving 
these  equations  and  setting  x  =  U,y  =  V ,  we  obtain 

V2 

SUz  =  (2fi  +  \)(U2+V2)6P' 


Wy  6V*  (2/i  + A  Kf/’+K2) 


5P, 


sv  u 2 

*  (2p  +  \)(U2  +  V2) 


6P. 


The  jump  conditions  (Jl)  -  (J4)  are  derived  in  [8]  for  the  system  (1.1).  Similar  jump  conditions  are 
derived  in  [7]  for  the  linearized  system  (1.3). 


3.  Nonlinear  analysis  In  [1]  we  have  constructed  a  simplification  of  the  system  (1.1)  that  still  retains  the 
“elliptic-hyperbolic’’  character  of  the  original  flow  equations,  and  we  have  constructed  an  existence  theorem 
for  a  discontinuous  solution  of  the  simplified  system.  Here,  we  briefly  describe  the  results  of  this  analysis 
To  obtain  the  simplified  system  we  set  p  =  —  A  =  1  and  we  drop  the  convective  terms  in  (l.la.b).  More 
crucially,  we  replace  the  continuity  equation  by  a  modified  “continuity”  equation,  U  px  +  V  py  =  0.  Thus, 
we  have  dropped  the  term  pUx  +  pVy  from  the  continuity  equation.  This  modified  “continuity"  equation 
has  no  physical  meaning.  Our  only  justification  in  considering  it  is  that  the  elliptic-hyperbolic  character  of 
the  system  is  unchanged,  and  that  we  are  able  to  demonstrate  the  existence  of  a  solution  of  the  modified 
system  with  an  interior  discontinuity.  With  the  assumption  that  the  system  is  barotropic,  the  density 
p  —  p(P)  is  a  function  of  pressure  only  and  the  system  can  be  simplified  further.  Writing  pr  =  (dp/dP)Pr. 
py  =  (dp/dP)Py,  we  obtain  from  (1.1c)  the  equation  U Px  +  V Py  —  0.  We  make  a  further  notational  change. 
The  solution  that  we  obtain  will  be  such  that  the  flow  is  close  to  uniform  flow  in  the  x  direction.  We  therefore 
replace  the  unknown  U  by  1  +U.  This  leaves  the  modified  momentum  equations  unchanged.  The  modified 
compressible  flow  equations  studied  in  this  paper  are  therefore 


(31a) 

—U a  —  Uyy  -f  Px  —  0, 

(3.16) 

-  Kyy  +  Py=  0, 

(3.1c) 

(1  +U)Pr  +  VPy  =  0. 

The  system  (3.1)  is  considered  in  the  strip  D  -  (0,a)  x  (—00,00).  The  width  n  is  chosen  chosen  small 
enough  to  satisfy  certain  inequalities.  We  impose  zero  boundary  conditions  for  V  and  V  on  the  sides  of  the 
strip  D.  and  we  impose  the  boundary  condition  for  the  pressure  on  the  side  of  the  strip  D  through  which 
the  flow  enters.  Thus,  we  are  led  to  the  boundary  conditions: 

(3.2a)  U(0,y)  =  U(a,y)  =  0, 

(3.26)  y(0,y)  =  V(a,y)  =  0, 

(3.2c)  P(0,y)  =  Po(y)- 

The  function  Po(y)  is  chosen  to  have  a  simple  jump  discontinuity  at  y  —  0,  to  vanish  outside  [0,a],  and  to 
be  smooth  for  y  >  0.  We  let  6Pn  =  Po{+ 0)  denote  the  jump  in  Po(y)  at  y  =  0.  We  show  in  [1]  that  the 
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system  (3.1),  (3.2)  has  a  solution  with  the  property  that  P(x,  y)  is  discontinuous  across  a  curve  C.  The  curve 
C  is  the  characteristic  of  (3.1c)  emanating  from  the  point  (0,0),  which  is  the  point  of  discontinuity  of  PQ. 
The  first  derivatives  of  U  and  V  undergo  jump  discontinuities  across  C ,  and  satisfy  certain  jump  conditions 
analogous  to  (Jl)  -  (J4). 

The  existence  proof  reformulates  the  problem  as  a  fixed  point  mapping  T  in  a  certain  Banach  space  A 
and  uses  the  Schauder  fixed  point  theorem.  To  describe  A ,  let  C°(D)  be  the  space  of  continuous  functions 
in  D.  Let 

A  =  {(t/,  V)  :  U  €  C°(D),  V  €  C°(D),  |R/||  +  \\V\\  <  oo), 


where 


Ill'll  =  ^p  \U(x,y)\  +  sup  lt/(^if| 

x ,yer>  x,v,geD,y*9  |y  -  y|(Mn  |y  ~  y II  +  *) 


and  similarly  for  ||V||.  It  is  easy  to  see  that  A  is  a  Banach  space  with  the  norm  ||(L/,  V)||  =  ||f/||  +  ||V'||. 
The  nonlinear  map  T  is  defined  as  the  composition  of  two  maps,  T  =  T2  o  7),  where  7)  and  T2  are  defined 
as  follows.  Let  T\  :  ( U ,  V)  — *  P,  where  P  is  obtained  by  solving 


(3.3a) 


(l  +  U)Pt  +  VPy  =0,  P(0,y)  =  P0(y), 


and  let  :  P  — ♦  (U ,  K),  where  U,  V  satisfy 

-  U„  -  uyy  +PX=  0,  U( 0,  y)  =  U(a,  y)  =  0, 

('  -  V„  -  Vyy  +  Py  =  0,  V(  0 ,  J, )  =  V(  fl ,  V )  =  0 . 

The  particular  form  of  the  norm  ||{/||  reflects  the  elliptic-hyperbolic  character  of  the  system.  The 
modulus  of  continuity  of  U  in  the  y  variable  is  r(|ln|rf|  +  lj.  This  modulus  of  continuity  is  just  enough  to 
guarantee  the  solvability  of  the  hyperbolic  equation  (3.3a)  and  to  provide  a  priori  estimates  that  arise  from 
this  solution.  (The  finiteness  of  )|u|]  and  ||u||  imply  that  the  characteristic  equation  of  (3.3a)  satisfies  the 
Osgood  criterion  [6,  Chapter  III,  Corollary  6.2],  and  hence  is  uniquely  solvable.)  On  the  other  hand,  this 
modulus  of  continuity  is  provided  by  estimates  for  the  weakly  singular  integrals  that  occurs  in  the  solution 
of  the  elliptic  equations  (3.1b),  and  it  seems  that  no  better  modulus  of  continuity  arises  from  these  weakly 
singular  integrals. 

4.  Linear  analysis  -  derivation  of  the  transformed  equations  In  [7]  we  have  exhibited  a  solution  to 
the  linearized  system  (1.3)  for  which  there  is  an  internal  pressure  discontinuity.  We  are  at  present  carrying 
this  analysis  further  to  give  a  detailed  description  of  the  discontinuity  of  the  linear  problem.  We  shall  now 
describe  this  work. 

We  consider  the  linearization  around  a  constant  ambient  flow,  U  =  const  >  0,  V  —  const,  P  =const.  In 
this  case  the  quantities  A  =  B  =  C  —  0.  In  addition  we  drop  the  convective  terms  in  (1.3a,b),  and  we  set 
p(P)  =  p'(P)  =  I.  As  a  result  we  obtain  the  system 

-  A u  +  Pt  =  0, 

(4.1)  -  Ai>  +  py  =  0, 

UpX  +  Vpy  +  UX  +  Vy  =0. 

We  consider  the  system  (4.1)  in  the  strip  0  <  x  <  a,  —oo  <  y  <  oo,  with  the  boundary  conditions 

u(0,  y)  =  u0(y),  u(a,y)  =  ua(u), 

(4.2)  v(0,y)  =  u0(y),  v(a,y)  =  va(y), 

P(0,y)  =  po(y) 


We  suppose  that  uo,  Vo,  and  po  are  smooth  functions  except  possibly  at  y  =  0.  Then  the  line  y  —  l  fr~!j  is 
a  possible  discontinuity  of  the  solution,  and  the  solution  satisfies  the  following  jump  conditions  across  this 
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line: 


Su  =  0,  6v  =  0 

V2 

-UV  c 

^Ur  ~  u2  - f-  V2^P’ 

8^=^  +  V26P' 

-UV  c 

u 2 

=  Ul  +  Vltp' 

Sv'  =  u*  +  v*6p- 

To  solve  the  problem  (4.1),  (4.2)  we  take  the  Fourier  transform  with  respect  to  y.  For  this,  we  must 
recognize  that  the  first  derivatives  of  p  and  the  second  derivatives  of  u  and  v  are  defined  almost  everywhere, 
but  are  not  the  corresponding  derivatives  of  u,  tt,  and  p  respectively  in  the  distributional  sense.  To  calculate 
the  Fourier  transforms  of  these  derivatives,  we  must  take  into  account  the  jumps  across  the  line  y  =  VU~ 1 x . 
We  use  the  following  fact,  established  through  an  integration  by  parts:  if  q(y)  is  a  smooth  function  for  y  >  y0 
and  for  y  <  j/o,  and  with  one  sided  limits  at  y  =  j/o,  and,  letting  q'(y)  denote  the  function,  defined  for  all 
values  of  y  except  j/o  by  the  derivative  of  q,  if  q  and  q '  decay  suitably  at  y  =  ±oo,  then 

(4.4)  (*V)(0  =  itq(t)  -  +  0)  -  q(yo  -  0)]  =  itq(t)  -  -L.e-,’*6q(y0). 

V2?r  V2n 

Suppose  w(x,y)  is  smooth  everywhere  in  S  except  on  the  line  y  =  VU~X. t.  Then  from  (4)  we  obtain 

(4.5)  (Jrwy)(x,t)  =  itw(x,t)  -  06w(x), 

where  we  have  set 

(M(*)  =  w(x,VU~lx  +  0)  -  ui(x,  VU~lx  —  0),  3  = 

\/2sr 

A  similar  formula  may  be  obtained  for  Fwz  in  the  following  way.  We  write 

1  rvu~ ‘r  1  r°° 

w(x,t)  =  —j—  /  u>(x,y)e  ,tydy+-j==  /  w(x,y)e  ,tydy. 

V2ir  J-oo  v2n  Jvv-'x 

Differentiating  with  respect  to  x  we  obtain 

(4.6)  (fwz)(x,t)  =  wr(x,t)  +  VU~l  3f>w{x). 

Using  (4.5)  and  (4.6)  we  obtain  the  transformed  equations 

—  uXI  +  t2 u  +  pT  —  3{VU~l fniz  -  fitly  -  VU-'fp}. 

-vzz  +  t2 v  +  itp  =  3{VU~lf>vx  -  fuy  +  fp}. 

U  Pi  +  itVp+  Ui  +  itv  =  0. 

Inserting  the  jump  conditions  (4.3)  we  obtain  the  same  transformed  equations  as  in  the  case  of  no  .jump: 

-uzx  +  t2u  +  Pi  =  0, 

(4.7)  -vZI  +  t2i>  +  itp  =  0, 

Upi  +  itVp+  Ui  +  itv  -  0. 


with  the  boundary  conditions 

(4  8) 


u(0,<)=«o(O-  u(a,0  =  «a(0. 

t'(0,<)  =  t>o(0,  v(a,  i )  -  va(t), 
p(0,<)  =  po(0- 
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5.  Linear  analysis  -  solution  of  the  transformed  equations  We  seek  a  solution  of  the  problem  (4.7) 
with  an  x  dependence  of  the  form  er^x.  This  leads  to  the  quintic  equation  for  r(t), 


with  roots 


(r2  -  t2)2(rU  +  itV  +  1)  =  0, 


1  tv. 

r  =  ±t,±t, 


This  also  leads  to  the  general  solution  of  (4.7)  in  the  form 


(5.1) 

where  the  ij  are  given  by 


= 


Z\  =  i  e'*,Z2  = 


i  e~  1 ,  i3  =  \  0  \  e 


-(l  +  ttVlU-'i 


-1-2  t{U  +  iV)  +  tx 
itx 
2 1 


-tX  ■  _ 

e  ,  -5  = 


1  -2 t(U  -iV)  +  tx 
-itx 
2 1 


-U{l  +  itV) 

a  ~  1  +  2.<l/-t2((/2  + V2)’ 

itU2 

13  ~  \  +  7itV  —  i2{U2  +  V2) 

The  coefficients  are  chosen  to  satisfy  the  boundary  conditions  (4.8).  Imposing  these  boundary  conditions, 
we  are  led  to  the  linear  system 

-ci]  r  uo  ‘ 

c2  it  o 

B  c3  =  po 


where 


B=  0 


-1 

iUV  Dt~l 

-2  (U  +  iV)t 

-2 (U  -  iV)t 

i 

-iU2Dt~l 

0 

0 

0 

1 

2 1 

2 1 

~P~l 

« UVDr'q 

(-2(7/  +  iV)t  +  a<]p 

[~2(U  -  iV)t  +  at]p-1 

ip-1 

-iU2Dt-lq 

itap 

—  itap~ 1 

=  eat , 

q  =  e-70. 

7  =  (l  +  i  tV)U~\  D  =  (U2+  v2)-'. 

1  using 

Maikemaitca. 

Let  A  =  B~l  =  [a,-*]. 

Because  of  the  fundame 

and  where 


of  this  matrix  for  the  linearized  compressible  flow  equations,  we  record  the  following  asymptotic  forms  for 
the  matrix  entries,  valid  for  large  values  of  (<|.  For  t  >>  0,  we  get 

an  =  WD7qr'p-'(2U  -  a)(t/  +  iV) 


a12  =  -\UD2qirlp-x(2U  -  a)(U  +  iV) 
a, 3  =  lUDqr'p-l(2U  -a) 
aM  =  \aDp~x(U  -  iVr) 


a„  =  -\iDp-l[2(U2  +  V2)  -  a(U  -  iV)} 

a21  =  U2D2t~1(U  4-  iV) 

a22  =  —i 

a23  =  U2Dt~l 

a24  =  -\aD{U  -  iV)p~l 

a2s  =  \iD[2(U2  +  V2)  -  a(U  -  iV)] 

a31  =  D(U  +  iV) 

a32  =  -iD(U  +  iV) 

o33  =  1 

a34  =  -Dp~\a  +  2iV) 
o35  =  iDp~l(2U  -  a) 

041  =  \UD2qp-lr2(U  +  iV) 

042  =  —\iUD2qp~xt~2{U  +  iV) 

a43  =  \UDqp~'r2 

a44  =  —\Dt~lp~l(U  —  iV) 

045  =  —\iDt~xp~x(U  —  iV) 

an  =  -\Dt-\U  +  iV) 

a52  =  \iDt~l(U  +  iV) 

o53  =  —jU  Dt~2 

054  =  \Dt~xp~l(U  +  iV  +  a) 

a55  =  -±iDrlp~l(U  +iV  -a). 

Similarly,  for  t  «  0,  we  get 

on  =U2D2{U  -iV)t~l 

«i2  =  -* 

013  =  U2Dt~' 

a  14  =  \aD(JJ  +  iV)p 

0,5  =  \iD[2(U*  +  V2)  -  a(U  +  iV))P 

02,  =  ±UD2(2U  -  a)(U  -  iV)qr1p 

o22  =  \;-VD2(W  -  a)(U  -  iV)qt~xp 

a23  =  \U  D(2U  —  a)qt~lp 

024  =  -\aD(U  +  iV)p 

o25  =  -\iD[2(U2  +  V 2)  -  a(U  +  iV)]p 

o31  =  D(U  -  iV) 

o32  =  iD{U  -  iV) 

033  =  1 

034  =  D(2iV  -  a)p 

<135  =  -iD(2U  -  a)p 

04,  =  -$D{U  -  iV)t~l 

o42=  -\iD(U  ~iV)t~' 

a43  =  \UDt~2 

044  =  \D{U  -iV  +  0)r'p 
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a45  —  \iD(U  —  iV  —  a)t~'p 
aSi  =  —\UD2{U  -  iV)qt~2p 
052  =  -\iUD2(U  -iV)qr2p 
<353  =  ~\U  Dqt~2p 

054  =  -\D(u  +  iv)rlp 

a55  =  \iD(U  +  iV)t~lp. 

We  illustrate  the  use  of  these  formulas  by  considering  three  special  cases. 

Case  1:  p0  ^  0,  «o  =  i>o  =  «a  =  va  =  0. 

In  this  case,  the  solution  to  (4.7),  (4.8)  is  given  by 

u  ={«i3e<x  -  a2 ae~,x  4-  aa33e~U+,,v^u  1 

-  [-1  -  2 t{V  +  iV)  +  tx]a43etx  +  [1  -  2 t{U  -  iV)  +  tz]a53e-‘x}po, 
v  ={iai3etx  +  ia2 3e~:x  +  0a33e~^+,>v)u  *r  +  t<X043e‘I  -  «t*a53e-‘r}p0, 

P  ~{a 33e  1  •+•  2<043e<T  +  2ta$3e  tx)po- 

Using  the  above  asymptotic  formulas,  we  obtain  asymptotic  expressions  for  u,  v ,  and  p  These  expressions 
are  arranged  into  three  groups,  which  we  denote  the  discontinuous  part,  the  left  hand  part,  and  the  right 
hand  part.  In  this  way  we  arrive  at  the  formulas 


U  =  UD  +  UL  +  Ur  +  Urem, 
V  =  vD  +  vL  +  VR  +  vrem, 
P  —  PD  +  PL  +  PR  +  Prem. 


where 


uD{x,t)  ■■=  UVD 


t 2  +  1 


e-wviw-'*p o, 


ut(x,t)  •=  -lUD{x-J^  +  2V-^-}e-x^ 


#Ti  <2  +  i 

u R(x,t)  :=  -W DE{(a  -  x 


Po, 


1  +2  Vr^^}e-'aV,u'\-^-l)'/r‘T'Po, 


VP+l  t 2  +  1 


i>D(x,t)  :=  -UJD 


it 


t2  +  1 


it 


h(x,i)  :=  \U D(2U  +  x)ir^e-I'/7I+7p 0, 

»/»(*,*)  :=  \UDE(2U  —  a  +  x)-~—e~iaV,u~' e~la~x)'^‘T^ po, 

PD{x,t)  :=e-(l+*,l/>(;'‘*po, 

Pt(*.0  :=  -UD^=L=C-x'/?7TTpo1 
x/F+l 

P*(*.0  :=  U  DE-7^==e~iaVW"  e-^^^po 
x/<2  +  l 

Upon  taking  an  inverse  Fourier  transform,  one  obtains  for  «o,  no.  and  po  the  formulas 

ry-vu-'x 


(5.1a) 


uD(x,y)  =  -  x UVDe-u"xe-{v-vu"x)  J* 


+  nUVDe 


-U-l*e-(y-VW 


r  00 

x)  J 

Jy_V  t/-»X 


Po(t)e{dt 


po(t)c  ‘dt. 
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fy-VV~'x 

vD(x,y)=irUVDe~u~  xe~{v~vu'  r)  /  po(<)e'dt 

J  —  OO 

-  irUVDe-u~lxe-<~v~vu'lx)  [  Po(t)e~‘c 

Jy-VU-'x 


poltfe'dt 


(5.1c)  pD(x,y)  =  2xe~v  'xpo(y  -  VU-1!). 

Ifpo(y)  has  a  jump  discontinuity  at  y  =  0  and  is  smooth  in  (— oo,0)  and  (0,oo),  then  the  formulas  (5.1)  yield 
a  discontinuity  of  pp  and  the  first  derivatives  of  up  and  vp  on  the  line  y  —  VU~xx.  It  may  be  verified  that 
the  triple  [«d.vd>Pd]  satisfies  the  jump  conditions  (4.3).  One  can  also  obtain  expressions  for  the  left  and 
right  functions,  involving  the  convolution  of  po  with  certain  kernels.  We  shall  not  write  down  these  kernels. 
Instead,  we  note  that  because  of  the  presence  of  the  factor  exp{— xy/t2  +  1}  in  the  Fourier  transform  of 
the  left  hand  functions,  these  functions  are  smooth  for  x  >  0,  and  similarly,  because  of  the  presence  of 
the  factor  exp{— (a  —  x)\/t2  +  1}  in  the  right  hand  functions,  these  functions  are  smooth  for  x  <  a.  These 
functions  have  the  “purpose”  of  reconciling  the  the  values  of  the  unknowns  at  the  boundary  where  the  jump 
discontinuity  occurs.  An  analysis  of  the  remainder  terms  shows  that  prem  is  continuous  and  urcm,  wrem  are 
continuously  differentiable  in  the  strip.  We  conclude  that  (5.1)  displays  the  discontinuous  behavior  of  the 
solution  of  (4.1),  (4.2)  caused  by  a  discontinuity  in  the  inflow  pressure. 

Case  2:  «o  ^  0,  no  =  Po  =  u„  =  va  =  0. 

Again,  the  asymptotic  formulas  for  the  coefficients  yield  a  discontinuous  part,  a  left  hand  part,  a  right 
hand  part,  and  a  smoother  remainder.  The  formulas  for  the  discontinuous  part  are 


(5.2a) 


uD  =  " UVDHV ^i=  -  o, 


L  rr  H  \„-(.l+xtV)U-'x. 


pD  =  D{U  +  V 


VFTT 


-(\+itV)U-lx, 


The  formulas  (5.2a)  and  (5.2b)  may  be  inverted  to  give  for  up  and  vp  the  formulas 

up(x,y)=—UV2D2e-u~'z  [°°  K0(\y  -  s  -  VU~l x\)u0(s)ds 
,  T  J- OO 

a  1  - 

--UV2D2e~u  'x  /  sgn(y-s-VU~lx)e-lv-’-vu~'^u0(s)ds, 

**  J  — OO 


1  f°° 

vp(x,y) -~U2V D2e~u  'z  /  I\0(\y  -  s  ~  VU~l x\)u0(s)ds 

^  J  —OO 

1  /■OO 

+  -U2V D2e~u~'z  /  sgn(y  -  s  -  VU~l T'u0(s)ds. 

^  J  —  CO 


The  inversion  of  (5.3)  involves  a  6  function,  and  so  does  not  have  meaning  if  uo  has  a  discontinuity.  If 
u o  is  continuous,  but  u'0  has  a  discontinuity  at  y  =  0,  we  obtain  for  po  the  formula 


2  f°° 

(5.3 c)  pD(x,y)  =  DVe-u~'zu0(y- VU~X x)  - -^DV e~u" x  /  A'0(|y-s- 

V2ir  J-cx 


VU-lx\)u'0(S)ds. 


Suppose  uo  has  a  jump  discontinuity  at  y  =  0  and  is  smooth  elsewhere.  Then  the  functions  up  and  vD 
have  continuous  first  derivatives,  but  their  second  derivatives  become  infinite  on  the  line  y  =  VU~lx,  and 
the  function  pp  is  continuous,  but  the  first  derivatives  of  pp  become  infinite  on  the  line  y  =  V'C’~1x.  We 
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conclude  that  a  jump  in  m!q  does  not  produce  a  pressure  discontinuity,  but  does  produce  a  higher  order 
singularity  in  the  solution  on  the  line  y  =  VU~xx. 

Case  3:  ua  ^  0,  uo  =  vq  =  po  =  va  =  0. 

In  this  case,  the  asymptotic  formulas  yield  only  a  right  hand  part.  There  is  no  discontinuity  in  the 
interior  resulting  from  a  discontinuity  in  «„• 
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ABSTRACT .  Mesh  equidistribution  for  static  approximation  problems  is 
extended  to  a  domain  that  admits  time  dependency.  Instead  of  some  quantity 
being  equidlstributed  over  a  static  mesh,  the  change  in  this  quantity  is 
equidistributed  over  a  dynamic,  moving  mesh.  Applications  are  for  utilization 
in  numerical  methods  to  solve  time-dependent  partial  differential  equations. 

This  mesh  moving  scheme  is  incorporated  into  a  finite  element  code  which 
already  refines  adaptively.  Comparisons  are  made  for  stationary  and  moving 
meshes . 

INTRODUCTION.  An  equidistributed  mesh  in  one  space  dimension  is  a  par¬ 
tition  of  a  given  domain  into  subintervals  such  that  some  given  quantity  is  uni¬ 
form  over  each  subinterval.  More  specifically,  given  an  interval  (a,b)  and  a 
positive  weight  function  w(x)  defined  on  (a,b),  then  an  equidistributed  mesh 
is  a  partition 

{a  =  x0  <  xj  <  x2  <  ...  <  xM_!  <  xM  =  bj 

such  that 

,xj  l  ,b 

/  w(x)dx  =  constant  =  -  /  w(x)dx  ,  j  =  1,2,...,M  (1) 

xj-i  "  a 

The  usual  application  of  such  a  mesh  is  for  approximating  functional  rela¬ 
tionships  to  a  certain  accuracy  with  a  minimum  number  of  mesh  points  by  choosing 
w(x)  appropriately  [1].  Equidistribution  strategies  have  also  been  used  in 
numerical  methods  for  solving  two-point  boundary  value  problems  [2,3].  This  is 
because  it  has  been  shown  [4,5]  that  the  task  of  selecting  a  mesh  to  minimize 
the  discretization  error  is  asymptotically  equivalent  to  equidistributing  the 
local  discretization  error. 

The  successes  in  the  above  fields  of  functional  approximation  and  numerical 
ordinary  differential  equations  have  led  some  investigators  to  consider  the  use 
of  equidistribution  strategies  for  generating  moving  meshes  in  the  field  of 
numerical  partial  differential  equations  (PDEs)  [6-8].  The  general  framework  is 
to  simply  reconsider  Eq.  (1)  with  a  time  dependency.  That  is  to  say,  the 
problem  is  now  to  determine  a  dynamic  mesh 

{a  =  x0  <  xj ( t )  <  x2{t)  <  ...  <  XM-jtt)  <  xM  =  bj 
at  time  t  so  that 
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*  j  ( t )  ib 

/  M(x,t)dx  =  c(t)  =  £  /  w(x, t)dx  ,  j  »  1,2 . M  (2) 

xj-l(t)  M  a 

where  the  positive  weight  function  w(x,t)  is  usually  chosen  to  be  a  function  of 
the  solution  of  the  underlying  PDE.  For  example,  w  has  been  chosen  to  be  pro¬ 
portional  to  the  solution's  gradient,  curvature,  and  local  discretization  error. 

When  applying  Eq.  (2)  in  some  numerical  scheme,  most  investigators  move  a 
fixed  number  of  points  so  as  to  follow  and  resolve  local  nonuniformities  in  the 
solution.  In  order  to  guarantee  a  certain  accuracy,  they  must  be  sure  that  this 
fixed  number  is  large  enough  to  approximate  the  solution  throughout  the  entire 
spatial  domain  for  the  entire  temporal  "life'’  of  the  solution.  Some  see  this  as 
a  limitation  since  the  correct  number  of  fixed  points  necessary  is  not  generally 
known  a  priori. 

Also,  this  moving  mesh  is  not  operating  in  a  vacuum.  It  is  being  used  in 
conjunction  with  some  numerical  solution  procedure.  Since  the  accuracy  of  most 
such  procedures  can  depend  on  the  shape  of  the  space-time  grid,  sometimes  the 
equidistribution  law  can  be  too  dynamic  and  deform  the  grid  enough  so  as  to 
introduce  a  new,  even  larger  source  of  error.  This  can  happen  even  if  the 
equidistribution  law  doesn't  demand  much  moving  of  its  own  accord.  If  a  non- 
equidistributed  mesh  is  used  as  the  initial  mesh,  then  the  grid  can  deform 
drastically  as  the  moving  mesh  tries  to  relocate  to  the  proper  equidistributed 
positions.  In  order  to  avoid  these  difficulties,  some  investigators  have  aban¬ 
doned  moving  altogether  and  developed  local  refinement  methods  [3]. 

A  local  refinement  method  is  a  procedure  where  uniform  fine  grids  are  added 
to  coarse  grids  in  regions  where  the  solution  is  not  adequately  resolved. 
Although  they  can  guarantee  a  solution  to  a  prescribed  accuracy,  they  can  be 
costly,  as  they  involve  recomputing  the  solution,  and  they  are  not  as  good  as 
moving  mesh  methods  at  reducing  dispersive  errors  in  the  vicinity  of  wavefronts. 

The  choice  here  has  been  to  combine  local  refinement  with  mesh  moving  based 
on  equidistribution.  The  purpose  is  twofold.  First,  the  refinement  procedure 
is  incorporated  to  avoid  any  drastic  deformation  of  the  grid  by  the  moving  mesh 
as  well  as  guarantee  a  prescribed  accuracy.  Second,  the  mesh  moving  is  applied 
in  order  to  obtain  as  accurate  a  solution  as  possible  for  any  given  discretiza¬ 
tion  so  as  to  put  off  the  need  for  refinement  for  as  long  as  possible  and  thus 
to  reduce  the  costs  involved. 

Equation  (2)  as  it  stands,  however,  is  not  easily  partnered  with  a  refine¬ 
ment  scheme.  It  is  too  dependent  on  mesh  position  and  the  number  of  extant  mesh 
points.  Hence,  a  refinement  step  can  disrupt  the  nature  of  the  equidistribution 
and  cause  a  drastic  change  in  the  mesh  dynamics  similar  to  that  caused  by  a 
"bad”  initial  mesh. 

The  attempt  to  overcome  the  difficulty  reported  here  was  to  try  to  extend 
Eq.  (2)  in  such  a  way  that  it  worked  with  the  refinement  procedure  rather  than 
against  it.  It  seemed  that  the  dynamics  of  Eq.  (2)  were  based  on  the  static 
spatial  nature  of  Eq.  (1)  and  an  extension  was  needed  that  incorporated  more  of 
the  time  dependency  of  the  domain  and  solution  process. 
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In  the  next  section,  this  extension  of  Eq.  (2)  is  presented  as  well  as  the 
algorithm  for  coupling  the  refinement  and  moving  procedures.  Then,  in  the 
following  section,  results  on  a  series  of  test  cases  are  presented  for  com¬ 
parison.  In  the  Discussion  section,  the  characteristics  of  the  extended 
equidistribution  law  are  discussed  in  light  of  the  results  of  the  previous  sec¬ 
tion.  Finally,  in  the  last  section,  some  conclusions  are  presented. 

PROCEDURES.  The  basic  principle  behind  this  extension  of  Eq.  (2)  is  to  try 
to  equidistribute  temporal  properties  as  well  as  spatial.  To  this  end,  consider 
a  typical  time  interval  of  interest  (0 , T )  and  a  discretization 

(0  =  t0  <  t0  <  t1  <  t2  <  ...  <  tN_j  <  tN  =  T} 

of  that  interval.  Then,  given  any  time  level  tn_j  and  any  mesh 

i  n-1  n-1  n-1  n-1  , 

at  that  time  level,  require  the  new  mesh 

,  n  n  n  n  n  « 

(a  =  x0  <  X!  <  x2  <  ...  <  xM_j  <  xM  =  b} 

at  the  next  time  level  tn  to  satisfy 

n  n-1 

^  j  ^  b  b 

/  n  w(x, tn)dx  =  /  w(x,tn_1)dx  +  ^  {/a  w(x,tn)dx  -  /a  w^t^^dx}  , 

xj_i  xj_! 


j  =  1,2 . M  (3) 

Note  that  Eq.  (3)  is  not  an  equidistribution  law  in  the  sense  of  Eqs.  (1) 
and  (2).  No  quantity  is  being  held  constant  over  any  subinterval  by  the 
enforcement  of  Eq.  (3).  Rather,  it  is  the  change  in  the  quantity  w  over  each 
new  subinterval 


that  is  allowed  to  vary  by  a  constant  amount  (which  is  proportional  to  the  total 
change  in  w  from  tn_j  to  tn)  when  compared  to  its  values  over  the  old  subinter¬ 
val 


In  a  sense  then,  it  is  the  time  change  of  this  quantity  that  is  being  equidis- 
tributed . 


Note  also  that  the  relationship  between  old  and  new  meshes  is  not  as 
dramatic  as  in  Eq.  (2).  If 
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is  a  "bad"  mesh,  e.g.,  suppose  it  was  readjusted  by  a  refinement  procedure,  then 
Eq.  (3)  simply  requires  that 


differs  from 


by  a  constant  amount  over  each  subinterval  and  does  not  require  any  drastic 
readjustment  to  a  new  equidistributed  position. 

In  order  to  test  the  performance  of  Eq.  (3)  and  its  postulated  properties, 
it  was  incorporated  into  a  numerical  POE  solver  that  already  implemented  an 
automatic  refinement  strategy.  The  overall  solution  algorithm  is  as  follows: 

1.  Move  mesh  to  next  time  level  according  to  Eq.  (3). 

2.  Solve  POE  using  finite  elements  in  space  and  finite  differences  in 
time. 

3.  Estimate  error  that  occurred  in  the  solution  process. 

4.  If  the  error  is  less  than  or  equal  to  a  prescribed  tolerance  and  the 
time  level  is  less  than  T,  then  go  to  step  1. 

5.  If  the  error  is  greater  than  the  tolerance,  refine  in  either  space  or 
time  or  both,  then  go  to  step  2. 

6.  If  the  time  level  is  greater  than  or  equal  to  T,  then  stop. 

RESULTS.  The  following  POE  was  solved  numerically  for  all  test  cases: 


ut  -  “xU  +  15  u)  =  256  u*x  '  0  <  x  <  1 

u(x,0)  =  tanh  10(x-l)  ,  04x<l 

u(0,t)  =  tanh  10(-l+t)  ,  t  >  0 


t  >  0 


u ( 1 , t )  =  tanh  lOt  ,  t  )  0 

The  exact  solution,  u(x,t)  =  tanh  10(x-l+t),  is  simply  a  wavefront  that  moves 
through  the  spatial  domain  from  right  to  left  as  time  progresses.  Optimally, 
the  mesh  should  try  to  follow  the  front  as  it  moves  across  the  interval  (0,1). 


For  all  cases,  the  L2  norm  of  the  error  was  prescribed  to 
tolerance  of  0.01,  an  initial  uniform  mesh  with  11  points  (M  = 
tial  time  step  of  0.05  (At  =  0.05)  was  input,  and  the  solution 
allowed  to  proceed  for  75  time  steps. 


be  less  than  a 
11)  and  an  ini- 
process  was 


524 


Case  2.  For  this  case,  Movement  was  based  on  the  first  time  derivative  of 
the  solution  (w(x,t)  »  |  ut(x,t)|  ).  Mesh  trajectories  are  shown  in  Figure  2.  A 
the  end  of  75  time  steps,  N  =  40  and  At  *  0.00579. 


Figure  2 


Mesh  Trajectories  for  Case  2.  Stars  on  x-axis  indicate  the  mesh  input 
before  any  moving  or  refining.  Horizontal  lines  indicate  a  temporal  refinement 
has  occurred  (actual  values  of  At  are  not  shown). 


Case  3.  For  this  case,  movement  was  based  on  the  second  spatial  derivative 
of  the  solution  (w(x,t)  =  |  uxx(x,t)|  ).  Mesh  trajectories  are  shown  in  Figure  3. 
At  the  end  of  75  time  steps,  N  =  23  and  At  =  0.00201. 


Figure  3 

Mesh  Trajectories  for  Case  3.  Stars  on  x-axis  indicate  the  mesh  input 
before  any  moving  or  refining.  Horizontal  lines  indicate  a  temporal  refinement 
has  occurred  (actual  values  of  At  are  not  shown). 
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DISCUSSION.  Overall,  the  results  are  very  encouraging.  The  mesh  trajec¬ 
tories  flow  smoothly  with  whatever  solution  characteristic  the  equidistribution 
law  is  based.  This  is  true  even  when  the  initial  mesh  is  unrelated  to  the 
equidistribution  rule  and  when  the  refinement  procedure  alters  the  mesh  (see 
Figures  2  and  3).  This  is  exactly  as  desired  and  postulated. 

Furthermore,  it  seems  that  mesh  moving  can  decrease  the  amount  of  refine¬ 
ment  necessary  for  a  given  problem  as  hoped.  This  is  evident  when  comparing 
case  1  with  cases  2  and  3. 

In  case  2,  the  level  of  temporal  refinement  is  less  than  in  case  1  for  the 
same  number  of  time  steps  and  the  same  tolerance  level.  This  is  as  expected 
since  the  temporal  component  of  the  error  is  proportional  to  a  time  derivative 
of  the  solution.  Hence,  movement  based  on  equidistributing  this  error  should 
reduce  the  temporal  refinement  necessary. 

Similarly,  in  case  3,  the  level  of  spatial  refinement  is  less  than  in  case 

1.  Once  again,  this  is  as  expected  since  the  movement  here  is  based  on  a  quan¬ 
tity  proportional  to  the  spatial  component  of  the  error. 

CONCLUSIONS.  Whether  or  not  this  new  moving  scheme  will  develop  into  a 
robust  numerical  procedure  is  still  uncertain.  There  are  still  stability 
questions  to  be  answered  as  well  as  some  implementation  difficulties  not 
addressed  here. 

However,  the  results  presented  here  give  credence  to  the  notion  that  mesh 
moving  and  refinement  schemes  can  be  successfully  combined.  Refinement  proce¬ 
dures  do  not  have  to  interfere  with  mesh  movement  and  mesh  movement  can  be  per¬ 
formed  to  reduce  the  levels  of  refinement  necessary  to  solve  a  problem  to  a 
given  tolerance. 
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ABSTRACT.  Consider  the  adaptive  solution  of  two-dimensional  vector  systems  of 
hyperbolic  and  elliptic  partial  differential  equations  on  shared-memory  parallel  comput¬ 
ers.  Hyperbolic  systems  are  approximated  by  an  explicit  finite  volume  technique  and 
solved  by  a  recursive  local  mesh  refinement  procedure  on  a  tree-structured  grid.  Local 
refinement  of  the  time  steps  and  spatial  cells  of  a  coarse  base  mesh  is  performed  in 
regions  where  a  refinement  indicator  exceeds  a  prescribed  tolerance.  Computational 
procedures  that  sequentially  traverse  the  tree  while  processing  solutions  on  each  grid  in 
parallel,  that  process  solutions  at  the  same  tree  level  in  parallel,  and  that  dynamically 
assign  processors  to  nodes  of  the  tree  have  been  developed  and  applied  to  an  example. 
Computational  results  comparing  a  variety  of  heuristic  processor  load  balancing  tech¬ 
niques  and  refinement  strategies  are  presented. 

For  elliptic  problems,  the  spatial  domain  is  discretized  using  a  finite  quadtree 
mesh  generation  procedure  and  the  differential  system  is  discretized  by  a  finite 
element-Galerkin  technique  with  a  hierarchical  piecewise  polynomial  basis.  Adaptive 
mesh  refinement  and  order  enrichment  strategies  are  based  on  control  of  estimates  of 
local  and  global  discretization  errors.  Resulting  linear  algebraic  systems  are  solved  by 
a  conjugate  gradient  technique  with  a  symmetric  successive  over-relaxation 

*  This  research  was  partially  supported  by  the  U.  S.  Air  Force  Office  of  Scientific  Research,  Air  Force 
Systems  Command,  USAF,  under  Grant  Number  AFOSR-90-0194;  by  the  SDIO/IST  under  management 
of  the  U.  S.  Army  Research  Office  under  Contract  Number  DAAL03-90-C--0096;  and  by  the  National 
Science  Foundation  under  Grant  Numbers  CDA-8805910  and  CCR-8920694. 
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preconditioner.  Stiffness  matrix  assembly  and  linear  system  solution  are  processed  in 
parallel  with  computations  scheduled  on  noncontiguous  quadrants  of  the  tree  in  order 
to  minimize  process  synchronization.  Determining  noncontiguous  regions  by  coloring 
the  regular  finite  quadtree  structure  is  far  simpler  than  coloring  elements  of  the 
unstructured  mesh  that  the  finite  quadtree  procedure  generates.  We  describe  a  linear¬ 
time  complexity  coloring  procedure  that  uses  a  maximum  of  six  colors. 

1.  INTRODUCTION.  Partial  differential  equations  that  arise  in  scientific  and 
engineering  applications  typically  feature  solutions  that  develop,  evolve,  and  decay  on 
diverse  temporal  and  spatial  scales.  The  Fokker- Plank  equation  of  mathematical  phy¬ 
sics  may  be  used  to  illustrate  this  phenomena.  Perspective  renditions  of  its  solution  u 
as  a  function  of  two  spatial  arguments  x  and  y  are  shown  at  four  times  t  in  Figure  1 
[20].  As  time  progresses,  a  single  “spike”  in  the  probability  density  arising  from  an 
initial  Maxwell-Boltzmann  distribution  evolves  into  the  two  spikes  shown.  Conven¬ 
tional  fixed-step  and  fixed-order  finite  difference  and  finite  element  techniques  for  solv¬ 
ing  such  problems  would  either  require  excessive  computing  resources  or  fail  to  ade¬ 
quately  resolve  nonuniform  behavior.  As  a  result,  they  are  gradually  being  replaced  by 
adaptive  methods  that  offer  greater  efficiency,  reliability,  and  robustness  by  automati¬ 
cally  refining,  coarsening,  or  relocating  meshes  or  by  varying  the  order  of  numerical 
accuracy. 

Adaptive  software  for  ordinary  differential  equations  has  existed  for  some  time 
and  procedures  that  vary  both  mesh  spacing  and  order  of  accuracy  are  in  common  use 
for  both  initial  [17]  and  boundary  [7]  value  problems.  The  situation  is  far  more 
difficult  for  partial  differential  equations  due  to  the  greater  diversity  of  phenomena  that 
can  occur,  however,  some  production-ready  adaptive  software  has  appeared  for  elliptic 
problems  [12].  The  state  of  the  art  for  transient  problems  lags  that  for  elliptic  systems 
but  some  projects  are  underway  [16].  Adaptive  strategies  will  either  have  to  be 
retrofitted  into  an  existing  software  system  for  solving  partial  differential  equations  or 
have  to  be  coupled  with  pre-  and  post- processing  software  tools  before  widespread  use 
occurs. 

With  an  adaptive  procedure,  an  initial  crude  approximate  solution  generated  on  a 
coarse  mesh  with  a  low-order  numerical  method  is  enriched  until  a  prescribed  accuracy 
level  is  attained.  Adaptive  strategies  in  current  practice  are  classified  as  h-,  p-,  or  r- 
refinement  when,  respectively,  computational  meshes  are  refined  or  coarsened  in 
regions  of  the  problem  domain  that  require  more  or  less  resolution  [6,  12],  the  order  of 
accuracy  is  varied  in  different  regions  [10],  or  a  fixed-topology  mesh  is  redistributed 
[5].  These  basic  enrichment  methods  may  be  used  alone  or  in  combination.  The  par¬ 
ticular  combination  of  h-  and  p-refinement,  for  example,  has  been  shown  to  yield 
exponential  convergence  rates  in  certain  situations  [9]. 

Enrichment  indicators,  which  are  frequently  estimates  of  the  local  discretization 
error  of  the  numerical  scheme,  are  used  to  control  the  adaptive  process.  Resources 
(finer  meshes,  higher-order  methods,  etc.)  are  introduced  in  regions  having  large 
enrichment  indicators  and  deleted  from  regions  where  indicators  are  low.  Using  esti¬ 
mates  of  the  discretization  error  as  enrichment  indicators  also  enables  the  calculation 


Figure  1.  Solution  u(x,y,t )  of  the  Fokker-Plank  equation  at  times  t  =4 
(upper  left),  10  (upper  right),  20  (lower  left),  and  100  (lower  right)  obtained 
by  Moore  and  Flaherty  [20].  Solutions  having  magnitudes  greater  than  0.1 
have  been  omitted  in  order  to  emphasize  fine-scale  structure. 

of  local  and  global  accuracy  measures  which  should  become  a  standard  part  of  every 
scientific  computation.  Estimates  of  the  local  discretization  error  are  typically  obtained 
by  using  either  h-  or  p-refinement.  Thus,  one  uses  the  difference  between  solutions 
computed  either  on  two  meshes  or  with  two  distinct  orders  of  accuracy  to  furnish  an 
error  estimate.  Special  “superconvergence”  points  where  solutions  converge  faster 
than  they  do  globally  can  be  used  to  significantly  reduce  computational  cost  [2]. 

Parallel  procedures  are  becoming  increasingly  important  both  as  hardware  systems 
become  available  and  as  problem  complexity  increases.  Furthermore,  the  efficiency 
afforded  by  adaptive  strategies,  cannot  be  ignored  in  a  parallel  computational 


environment  since  the  demand  to  model  nature  more  accurately  is  always  beyond 
hardware  capabilities.  Models  of  parallel  computation  are  based  on  distributed- 
memory  and  shared-memory  architectures.  Distributed-memory  systems  tend  to  have 
large  numbers  of  relatively  simple  processing  elements  connected  in  a  network.  Avail¬ 
able  memory  on  these  fine-grained  systems  is  distributed  with  the  processing  elements 
at  the  nodes  of  the  network,  so  data  access  is  by  message  passing.  Balancing  com¬ 
munication  and  synchronizing  processing  is  extremely  important  because  processing 
elements  are  typically  operating  in  lock-step  fashion  in  order  to  improve  throughput 
and  processor  utilization.  Shared-memory  systems  involve  a  more  coarse-grained  level 
of  parallelism  with  relatively  few  processors  operating  asynchronously  and  communi¬ 
cating  with  a  global  memory,  although  variations  are  common.  For  example,  process¬ 
ing  elements  may  have  a  local  cache  memory  in  order  to  reduce  bus  contention  and 
may  have  vector  capabilities;  thus,  providing  a  hierarchy  of  coarse-  and  fine-grained 
parallelism. 

Our  goal  is  to  develop  parallel  adaptive  methods  for  partial  differential  equations. 
Fortunately,  our  adaptive  software  utilizes  hierarchical  (tree)  data  structures  that  have 
many  embedded  parallel  constructs.  Transient  hyperbolic  problems  may  generally  be 
solved  using  explicit  numerical  techniques  which  greatly  simplify  processor  communi¬ 
cation.  Experiments,  reported  in  Section  2,  with  a  variety  of  tree  traversal  strategies 
on  an  adaptive  mesh  refinement  finite  difference  scheme  [6]  indicate  that  the  dynamic 
load  balancing  scheme  of  assigning  grid-vertex  computations  at  a  given  tree  level  to 
processors  as  they  become  available  provided  the  best  parallel  performance.  Static 
load  balancing  strategies,  that  either  traverse  the  tree  of  grids  serially  while  processing 
solutions  on  each  grid  in  parallel  or  traverse  the  tree  in  parallel  while  processing  solu¬ 
tions  on  grids  at  the  same  tree  level  are  also  discussed.  These  alternatives  to  dynamic 
processor  assignment  may  provide  better  performance  on  hierarchical  memory  comput¬ 
ers. 


For  elliptic  problems,  system  assembly  and  solution  are  processed  in  parallel  with 
computations  scheduled  on  noncontiguous  tree  quadrants  in  order  to  minimize  process 
synchronization.  “Coloring”  the  elements  of  a  mesh  so  as  to  avoid  memory  conten¬ 
tion  on  a  shared-memory  computer  is  far  simpler  when  an  underlying  tree  structure  is 
present  than  for  more  general  unstructured  grids  that  the  finite  quadtree  structure  gen¬ 
erates.  The  six-color  procedure,  described  in  Section  3,  for  the  finite  element  solution 
scheme  on  quadtree-structured  grids  displays  a  high  degree  of  parallelism  when  piece- 
wise  linear  approximations  are  used.  Unfortunately,  the  same  procedure  does  not  do 
as  well  with  higher-order  piecewise  polynomial  approximations;  however,  an  element 
edge  coloring  procedure  may  improve  performance. 


2.  HYPERBOLIC  SYSTEMS.  Consider  a  system  of  two-dimensional  conservation 
laws  in  m  variables  on  a  rectangular  domain  having  the  form 

ur  +  fx(x,y,r,u)  +  gy(.x,y,r,u)  =  0,  (x,y)e  O,  t  >  0,  (la) 

subject  to  the  initial  conditions 

uOt  ,y  ,0)  =  u°U  ,y ),  (tj)efl(j  dQ,  (lb) 


534 


and  appropriate  well-posed  boundary  conditions.  The  functions  u,  f,  g,  and  u°  are  m- 
vectors,  x  and  y  are  spatial  coordinates,  t  denotes  time,  and  dCl  is  the  boundary  of  Q. 

Our  research  is  based  on  a  serial  adaptive  hr-refinement  algorithm  of  Amey  and 
Flaherty  [6].  We  forego  mesh  motion  at  present  and  briefly  describe  an  h-refinement 
procedure  that  utilizes  their  strategy.  The  problem  (1)  is  solved  on  a  coarse  rectangu¬ 
lar  “base”  mesh  for  a  sequence  of  base-mesh  time  slices  of  duration  A tn, 
n  =  0,  1,  •••,  by  an  explicit  finite  difference,  finite  volume,  or  finite  element  scheme. 
For  a  base-mesh  time  step,  say  from  tn  to  tn+1  =  tn  +  Arn,  a  discrete  solution  is  gen¬ 
erated  on  the  base  mesh  along  with  a  set  of  local  enrichment  indicators  which,  in  this 
case,  are  refinement  indicators.  Cells  of  the  mesh  where  refinement  indicators  at  tn+l 
fail  to  satisfy  a  prescribed  tolerance  are  identified  and  grouped  into  rectangular  clus¬ 
ters.  After  ensuring  that  clusters  have  an  adequate  percentage  of  high-refinement- 
indicator  cells  and  subsequently  enlarging  the  clusters  by  a  one-element  buffer  to  pro¬ 
vide  a  transition  between  regions  of  high  and  low  refinement  indicators,  cells  of  the 
base  mesh  are  bisected  in  space  and  time  to  creatf  finer  meshes  that  are  associated 
with  each  rectangular  cluster.  Local  problems  are  <*.  Ived  on  the  finer  meshes  and  the 
refinement  procedure  is  repeated  until  refinement  indicators  satisfy  the  prescribed  unit- 
step  criteria.  After  finding  an  acceptable  solution  on  the  base  mesh,  the  integration 
continues  with,  possibly,  a  new  base-mesh  time  step  Arn+1.  Data  management  involves 
the  use  of  a  tree  structure  with  nodes  of  the  tree  corresponding  to  meshes  at  each 
refinement  level  for  the  current  base-mesh  time  step.  The  base  mesh  is  the  root  of  the 
tree  and  finer  grids  are  regarded  as  offspring  of  coarser  ones. 

With  an  aim  of  maintaining  generality  at  the  possible  expense  of  accuracy  and 
performance,  we  discretize  (1)  using  the  Richtmyer  two-step  version  of  the  Lax- 
Wendroff  method  [23],  which  we  describe  for  a  one-dimensional  problem  as  follows. 
Introduce  a  mesh  on  £2  having  spacing  Ax;  =  xJ+l  -  x}  and  let  the  discrete  approxima¬ 
tion  of  u(xj  ,tn )  be  denoted  as  U j*.  Predicted  solutions  at  cell  centers  are  generated  by 
the  Lax-Friedrichs  scheme,  i.e., 

+  U/)  -  -  <7)  (2a) 


This  provisional  solution  is  then  corrected  by  the  leapfrog  scheme 


u;+1  =  u;  - 


2Ar„ 


A Xj  +  Ax;_! 


(f/A?  -  r?-3?y 


n+'/2\ 


(2b) 


Following  Amey  et  al.  [4,  6],  refinement  indicators  are  selected  as  estimates  of  the 
local  discretization  error  obtained  by  Richardson’s  extrapolation  (h-refinement)  on  a 
mesh  having  half  the  spatial  and  temporal  spacing  of  the  mesh  used  to  generate  the 
solution.  Fine-mesh  solutions  generated  as  part  of  this  error  estimation  process  may 
subsequently  be  used  on  finer  meshes  when  refinement  is  necessary.  Initial  and  boun¬ 
dary  data  for  refined  meshes  is  determined  by  piecewise  bilinear  polynomial  interpola¬ 
tion  from  acceptable  solutions  on  the  finest  available  meshes. 
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Parallel  procedures  are  developed  for  the  adaptive  h-refinement  solution  scheme 
described  above  using  a  P  -processor  concurrent  read  exclusive  write  (CREW)  shared- 
memory  MIMD  computer.  We  consider  both  static  and  dynamic  strategies  for  balanc¬ 
ing  processor  loading.  As  the  names  imply,  with  static  load  balancing,  processors  are 
assigned  tasks  a  priori  with  the  goal  of  having  them  all  terminate  at  approximately  the 
same  time,  whereas,  with  dynamic  load  balancing,  available  processors  are  assigned 
tasks  from  a  task  queue.  Two  possible  static  load  balancing  techniques  come  to  mind: 
(i)  serial  depth-first  traversal  of  the  tree  of  grids  with  solutions  on  each  grid  being  gen¬ 
erated  in  parallel  and  (ii)  parallel  generation  of  solutions  on  all  grids  that  are  at  the 
same  tree  level.  With  the  depth-first  traversal  procedure,  each  grid  is  statically  divided 
into  P  subregions  and  a  processor  is  assigned  to  each  subregion.  With  the  parallel 
tree  traversal  procedure,  the  P  processors  are  distributed  among  all  grids  at  a  particular 
tree  level  so  as  to  balance  loading.  Thus,  parallelism  occurs  both  within  a  grid  and 
across  the  breadth  of  the  tree  with  this  strategy.  In  both  cases,  the  parallel  solution 
process  proceeds  from  one  base-mesh  time  step  to  the  next. 

Serial  depth-first  traversal  of  the  tree  leads  to  a  highly  structured  algorithm  that 
has  a  straight-forward  design  because  the  same  procedure  is  used  on  all  grids.  Balanc¬ 
ing  processor  loading  on  rectangular  grids  is  nearly  perfect  with  an  explicit  finite 
difference  scheme  like  (2)  and  similar  behavior  could  be  expected  for  geometrically 
complex  regions.  Load  imbalance  occurs  due  to  differences  in  the  time  required  to 
compute  initial  data.  Other  than  at  t  =  0,  initial  data  is  determined  by  interpolating 
solutions  from  the  finest  grid  at  the  end  of  the  previous  base-mesh  time  step  to  the 
present  grid.  Tree  traversal,  required  to  determine  the  correct  solution  vertices  for  the 
interpolation,  would  generally  take  different  times  in  different  regions  due  to  variations 
in  tree  depth.  This  defect  might  be  remedied  by  using  either  a  more  sophisticated 
domain  decomposition  technique  or  a  more  complex  data  structure  to  store  the  tree  of 
grids. 

The  serial  depth-first  traversal  procedure  becomes  inefficient  when  P  is  of  the 
order  of  the  number  of  elements  in  a  grid.  This  situation  can  be  avoided  by  refining 
grids  by  more  than  a  binary  factor,  thus,  maintaining  a  shallow  tree  depth.  Lowering 
the  efficiency  of  clusters  by  including  a  greater  percentage  of  low-refinement-indicator 
cells  will  also  increase  grid  size  but  diminish  optimal  grid  usage.  The  inefficiency 
cited  here  should  not  be  a  factor  on  d^ta-parallel  computers  and  the  serial  tree  traversal 
procedure  might  also  be  viable  there. 

The  parallel  tree-traversal  procedure  requires  complex  dynamic  scheduling  to 
assign  processors  to  grids.  One  possibility  is  to  estimate  the  work  remaining  to  reduce 
error  estimates  to  prescribed  tolerances  and  to  assign  processors  to  subgrids  so  as  to 
balance  this  load.  Were  such  a  heuristic  technique  successful,  the  parallel  tree  traver¬ 
sal  procedure  would  not  degrade  in  efficiency  when  the  number  of  elements  on  a  grid 
is  O  (P ). 

Consider  a  situation  where  Q  processors  are  used  to  obtain  a  solution  on  a  grid  at 
tree  level  /  -  1  and  suppose  that  refinement  indicators  dictate  the  creation  of  L  grids 
G/  , ,  i  =  1,  2,  L,  at  level  /.  Further  assume  that 
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i.  the  prescribed  local  refinement  tolerance  at  level  /  -  1  is  X/_t; 

ii.  the  areas  of  Gt  i  are  Mt  i,  i  =  1,  2,  •  •  L; 

iii.  estimates  Et %i  of  the  discretization  error  are  available  for  G{  i,  i  =  1,  2,  •••,  L; 
and 

iv.  the  convergence  rate  of  the  numerical  scheme  is  known  as  a  function  of  the  local 
mesh  spacing. 

The  Richtmyer  two-step  scheme  (2)  has  a  quadratic  convergence  rate  which  we  use  to 
illustrate  the  load  balancing  technique;  however,  the  approach  easily  extends  to  other 
convergence  rates. 


In  order  to  satisfy  the  prescribed  accuracy  criterion,  Gl%i  should  be  refined  by  a 
factor  of  The  time  step  on  Gli  must  be  reduced  by  a  factor  of  in 

order  to  satisfy  the  Courant  condition.  Hence,  the  expected  work  Wt  i  to  find  an 
acceptable  solution  on  Gt  ,•  is 


Wij  = 


Eu 


13 


*/-l 


(3) 


The  Q  available  processors  should  be  allocated  so  as  to  balance  the  time  required  to 
complete  the  expected  work  on  each  of  the  L  grids  at  level  / .  Thus,  assign  Qt  proces¬ 
sors  to  grid  Gi  j,  i  =  1,  2,  •••,  L,  so  that 


_  W/,2 

Q\  Qi 


=  Q 


(4a,b) 


The  quality  of  load  balancing  using  this  approach  will  depend  on  the  accuracy  of 
the  discretization  error  estimate.  Previous  investigations  [4,  6]  revealed  that  error  esti¬ 
mates  were  generally  better  than  80  percent  of  the  actual  error  for  a  wide  range  of 
mesh  spacings  and  problems.  Equation  (3)  can  be  used  to  select  refinement  factors 
other  than  binary  and,  indeed,  to  select  different  refinement  levels  for  different  meshes 
at  a  given  tree  level.  This  consideration  combined  with  over-refinement  to  a  tolerance 
somewhat  less  than  the  prescribed  tolerance  should  maintain  a  shallow  tree  depth  and 
enhance  parallelism  at  the  expense  of  grid  optimality. 

Simple  dynamic  load  balancing  can  take  full  advantage  of  the  CREW  shared- 
memory  MIMD  environment.  One  just  maintains  a  queue  of  mesh  points  at  a  given 
tree  level  and  compute  solutions  at  these  points  as  processors  become  available.  Load 
balancing  is  perfect  except  for  any  inherent  system  hardware  anomalies.  Balancing 
processor  loads  on  geometrically  complex  regions  is  as  simple  as  on  rectangular 
regions  because  mesh  points  are  processed  on  a  first-come-first-serve  basis  indepen¬ 
dently  of  the  grid  to  which  they  belong.  Nonuniformities  in  initial  data  also  introduce 
no  problems  and  neither  does  the  relationship  of  P  to  the  number  of  cells  in  a  grid. 
Finally,  complex  processor  scheduling  based  on  accurate  error  estimates  is  avoided. 
This  strategy,  however,  might  not  be  appropriate  for  hierarchical  or  distributed-memory 
computers. 
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Binary  refinement  of  space-time  grids  may  be  optimal  in  using  the  fewest  mesh 
points;  however,  tree  depth  tends  to  be  large  and  this  introduces  serial  overhead  into  a 
parallel  procedure.  As  previously  suggested,  serial  overhead  can  be  reduced  by  keep¬ 
ing  tree  depth  shallow  and  to  do  this  we  perform  M  -ary  instead  of  binary  refinement. 
The  value  of  M  is  chosen  adaptively  for  different  clusters  so  that  the  prescribed  toler¬ 
ance  is  satisfied  after  a  single  refinement  step.  Thus,  if  x0  is  the  prescribed  local 
discretization  error  tolerance,  then  choose  M  for  grid  G  1;  as  the  first  even  integer 
greater  than  E^/Zq.  Having  a  good  a  priori  knowledge  of  the  work  required  on  each 
cluster,  processors  can  be  distributed  among  the  grids  according  to  (4)  to  effectively 
balance  loading.  Of  course,  the  refinement  tolerance  may  not  be  satisfied  after  per¬ 
forming  one  level  of  M -ary  refinement.  Should  this  occur,  we  perform  additional  lev¬ 
els  of  2-ary  refinement  until  accuracy  requirements  are  satisfied.  The  terms  “binary” 
and  “2-ary”  refinement  have  been  used  to  distinguish  differences  in  our  methods  of 
checking  the  refinement  condition.  With  binary  refinement,  the  refinement  condition  is 
checked  after  each  of  the  two  finer  time  steps  but  with  2-ary  refinement,  the  condition 
is  only  checked  after  the  second  time  step.  As  a  result,  the  fine  grids  remain 
unchanged  for  both  of  the  two  finer  time  steps  with  2-ary  refinement. 

The  efficiency  of  this  mesh  refinement  strategy  and  of  the  serial  depth-first  traver¬ 
sal  and  dynamic  balancing  techniques  are  appraised  in  an  example.  Performance  of 
the  parallel  traversal  procedure  was  not  as  good  as  either  of  these  schemes  and  results 
are  not  presented  for  it.  Computer  codes  based  on  these  algorithms  have  been  imple¬ 
mented  on  a  16-processor  Sequent  Balance  21000  shared-memory  parallel  computer. 
Parallelism  on  this  system  is  supported  through  the  use  of  a  parallel  programming 
library  that  permits  the  creation  of  parallel  processes  and  enforces  synchronization  and 
communication  using  barriers  and  hardware  locks.  CPU  time  and  parallel  speed  up  are 
used  as  performance  measures. 

Example  1.  Consider  the  linear  scalar  differential  equation 

u,  +  2ux  +  2 Uy  =0,  0  <  * ,  y  <  1,  r  >  0,  (5a) 

with  initial  and  Dirichlet  boundary  data  specified  so  that  the  exact  solution  is 

u(x,y,t)  =  Vi>[l  -  tanh(100;c  -  lOy  -  180/  +  10)].  (5b) 

The  solution  (5b)  is  a  relatively  steep  but  smooth  wave  that  moves  at  an  angle  of  45 
degrees  across  the  square  domain  as  time  progresses. 

Adaptive  refinement  is  controlled  by  using  an  approximation  of  the  local  discreti¬ 
zation  error  in  the  L 1  norm  as  a  refinement  indicator.  Exact  errors  for  this  scalar  prob¬ 
lem  are  also  measured  in  L 1  as 

Ik  (-,-,/ )Hi  =  {{ lPw(x,y,/)  -  U (x  ,y  ,/)l  dxdy ,  (6) 

n 

where  U(x,y,t)  is  a  piecewise  constant  representation  of  the  discrete  solution  and 
Pu(x,y,t)  is  a  projection  onto  the  space  of  piecewise  constant  functions  obtained  by 
using  values  at  cell  centers. 
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Our  first  experiment  involves  the  solution  of  (5)  for  0<r  ^0.35  on  10x10, 
25x25,  and  45x45  uniform  grids  having  initial  time  steps  of  0.017,  0.007,  and  0.004, 
respectively.  No  spatial  refinement  was  performed  and  the  static  and  dynamic  load 
balancing  strategies  were  used.  CPU  times  and  parallel  speed  ups  for  each  base  mesh 
for  the  two  load  balancing  technique  are  shown  in  Figure  2.  Speed  up  with  15  proces¬ 
sors  and  the  static  load  balancing  technique  (shown  in  the  upper  portion  of  Figure  2) 
are  in  excess  of  51,  75,  and  87  percent  of  ideal  with  the  10  x  10,  25x25,  and  45x45 
base  meshes,  respectively.  Speed  up  increases  dramatically  as  the  mesh  is  made  finer 
due  to  smaller  data  granularity.  Similar  speed  up  data  for  the  three  base  meshes  with 
the  dynamic  load  balancing  technique  (shown  in  the  lower  portion  of  Figure  2)  are  53, 
77,  and  90  percent  of  ideal.  The  static  load  balancing  strategy  takes  slightly  more  time 
than  the  dynamic  technique,  except  in  the  uniprocessor  case  where  they  are  identical, 
because  of  load  imbalances  on  the  P  subdomains  due  to  differences  in  the  times 
required  to  generate  initial  and  boundary  data. 

Our  second  experiment  involves  solving  (5)  for  0  <  t  <,  0.35  on  a  10  x  10  base 
mesh  having  an  initial  time  step  of  0.017  using  dynamic  load  balancing  and  adaptive 
h-refinement  with  either  binary  refinement  or  Af-ary  followed  by  2-ary  refinement. 
Refinement  tolerances  of  0.012,  0.006,  and  0.003  were  selected.  The  resulting  CPU 
times  and  parallel  speed  ups  for  each  adaptive  strategy  are  presented  in  Figure  3. 
Maximum  speed  ups  shown  in  the  upper  portion  of  Figure  3  for  the  binary  refinement 
strategy  are  in  excess  of  82,  86,  and  72  percent  of  ideal  for  tolerances  of  0.012,  0.006, 
and  0.003,  respectively.  Initially,  parallel  performance  improves  as  the  tolerance  is 
decreased  due  to  the  finer  data  granularity;  however,  the  performance  ultimately 
degrades  due  to  the  serial  overhead  incurred  when  managing  a  more  complex  data 
structure.  Maximum  speed  ups  for  the  more  sophisticated  Ml -ary  followed  by  2-ary 
refinement  strategy  shown  in  the  lower  portion  of  Figure  3  are  in  excess  of  88,  82,  and 
73  percent  of  ideal  for  the  three  decreasing  tolerances.  Speed  ups  for  this  refinement 
strategy  are  only  marginally  better  than  those  for  the  binary  refinement  technique,  but 
the  CPU  times  for  the  M  -ary  strategy  are  much  less  than  those  for  the  binary 
refinement  strategy.  For  example,  CPU  times  with  15  processors  and  a  tolerance  of 
0.003  were  226.11  and  182.73  for  the  binary  and  Af-ary  strategies,  respectively. 
Maintaining  a  shallow  tree  has  clearly  increased  performance  by  reducing  the  serial 
overhead  associated  with  its  management. 

Speed  up  is  not  an  appropriate  measure  of  the  complexity  required  to  solve  a 
problem  to  a  prescribed  level  of  accuracy.  Tradeoffs  occur  between  the  higher  degree 
of  parallelism  possible  with  a  uniform  mesh  solution  and  the  greater  sequential 
efficiency  of  an  adaptive  procedure.  In  order  to  gage  the  differential,  we  generated 
uniform  mesh  and  adaptive  mesh  solutions  of  (5)  on  various  processor  configurations 
and  to  varying  levels  of  accuracy  for  both  static  serial  tree  traversal  and  dynamic  load 
balancing  strategies.  Computations  on  uniform  grids  ranged  from  a  5x5  mesh  to  a 
45x45  mesh.  All  adaptive  computations  used  a  10x10  base  mesh,  M  -ary  followed 
by  2-ary  refinement,  and  tolerances  ranging  from  0.012  to  0.003. 

Results  for  the  global  L 1  error  as  a  function  of  CPU  time  are  presented  in  Figure 
4  for  computations  performed  on  1,  4,  8,  and  15  processor  systems.  Static  and 
dynamic  load  balancing  strategies  are  shown  in  the  upper  and  lower  portions  of  the 
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Figure  2.  CPU  lime  (left)  and  parallel  speed  up  (right)  for  Example  1  on 
uniform  meshes  without  adaptivity  using  static  (top)  and  dynamic  (bottom) 
load  balancing. 
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Figure  3.  CPU  time  (left)  and  parallel  speed  up  (right)  for  Example  1  using 
dynamic  load  balancing  and  adaptive  h-refinement  with  local  binary 
refinement  (top)  and  M  -ary  followed  by  2-ary  refinement  (bottom). 
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figure,  respectively.  For  each  strategy,  the  upper  set  of  curves,  displaying  non-adaptive 
results,  are  much  less  efficient  and  converging  at  a  much  slower  rate  than  the  adaptive 
solutions  shown  in  the  lower  set  of  curves.  The  adaptive  solutions  are  converging  at  a 
rate  of  approximately  1.4  relative  to  CPU  time  while  the  non-adaptive  solutions  are 
converging  at  a  rate  of  approximately  0.4.  These  results  demonstrate  a  strong  prefer¬ 
ence  for  adaptive  methods  for  all  but  the  largest  tolerances.  Note  that  the  CPU  times 
are  identical  for  the  two  load  balancing  strategies  when  only  one  processor  is  used  for 
both  non-adaptive  and  adaptive  solutions  because  the  configuration  reduces  to  that  of  a 
uniprocessor  system.  Also  note  that  the  global  L 1  error  for  a  particular  choice  of  base 
mesh  (for  non-adaptive  methods)  or  local  refinement  tolerance  (for  adaptive  methods) 
is  independent  of  the  number  of  processors  used. 


3.  ELLIPTIC  SYSTEMS.  With  the  goal  of  describing  a  strategy  for  solving  linear 
algebraic  systems  resulting  from  the  finite  element  discretization  of  elliptic  systems,  let 
us  consider  a  two-dimensional  linear  elliptic  problem  in  m  variables  having  the  form 

-(D(x ,y )ux ]x  -  [D(x ,y )uy ]y  +  Q(x ,y )u  =  f(x ,y ),  (x,y)e  Q,  (7a) 


Ui  =  cf( x  ,y ),  (x  ,y )  6  3Q£,  (Duv),  =  c? [x  ,y ),  {x  ,y )  e  812*, 

i  =  1,2 ,  •  ••,  m. 


(7b, c) 


The  diffusion  D  and  source  Q  are  positive  definite  and  positive  semi-definite  m  xm 
matrices,  respectively,  dCl  =  i  =  1,  2,  ■  m,  and  v  is  a  unit  outer  nor¬ 

mal  to  dci. 


The  Galerkin  form  of  (7)  consists  of  determining  ueHj1  satisfying 

A  (v,u)  +  (v,f)  =  £  j  Vic? ds,  for  all  ve  Hq  ,  (8a) 

1=1  anr 


where 


/4(v,u)  =  |  [v/Dux  +  v^Duy  +  vrQu]dcJy,  (v,u)  =  j  \Tudxdy.  (8b,c) 
n  n 


As  usual,  the  Sobolev  space  1  consists  of  functions  having  first  partial  derivatives  in 
L 2.  The  subscripts  E  and  0  further  restrict  functions  to  satisfy  the  essential  boundary 
conditions  (7b)  and  trivial  versions  of  (7b),  respectively.  Finite  element  solutions  of 
(8)  are  constructed  by  approximating  H1  by  a  finite-dimensional  subspace  SN 'p  and 
determining  U  e  such  that 

A  (V,U)  +  (V,f)  =  £  J  V,c?ds,  for  all  Ve  Sj) (9) 
1=1  anr 


Selecting  SN,p  as  a  space  of  continuous  piecewise  p  th  -degree  hierarchical  polynomi¬ 
als  [24]  with  respect  to  the  partition  of  Cl  into  triangular  finite  elements,  substituting 
these  approximations  into  (9),  and  evaluating  the  integrals  by  quadrature  rules  yields  a 


542 


IQ2 

CPU  Time 


icr 


10 


id 


Figure  4.  Global  L 1  error  as  a  function  of  CPU  time  for  Example  1  using 
non-adaptive  methods  (upper  set  of  curves)  and  adaptive  h-refinement 
methods  Gower  set  of  curves)  with  static  (top)  and  dynamic  (bottom)  load 
balancing.  Computations  are  shown  for  systems  using  1,  4,  8,  and  15  pro¬ 
cessors  (right  to  left  for  each  set  of  curves). 
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sparse,  symmetric,  positive  definite,  N  -dimensional  linear  system  of  the  form 

KX  =  b, 

where  X  is  an  N  -vector  of  Galerkin  coordinates. 

Meshes  of  triangular  or  quadrilateral  elements  are  created  automatically  on  Cl  by 
using  the  finite  quadtree  procedure  [11].  This  structure  is  somewhat  different  than  the 
tree  of  grids  described  in  Section  2.  With  this  technique,  Q  is  embedded  in  a  square 
“universe”  that  may  be  recursively  quartered  to  create  a  set  of  disjoint  squares  called 
quadrants.  Data  associated  with  these  quadrants  is  managed  by  using  a  hierarchical 
tree  structure  with  the  original  square  universe  regarded  as  the  root  and  with  smaller 
quadrants  created  by  subdivision  regarded  as  offspring  of  larger  ones.  Quadrants  inter¬ 
secting  dCl  are  recursively  quartered  until  a  prescribed  spatial  resolution  of  Q  has  been 
obtained.  At  this  stage,  quadrants  that  are  leaf  nodes  of  the  tree  and  intersect  Cl  dCl 
are  further  divided  into  small  sets  of  triangular  or  quadrilateral  elements.  Severe  mesh 
gradation  is  avoided  by  imposing  a  maximal  one-level  difference  between  quadrants 
sharing  a  common  edge.  This  implies  a  maximal  two-level  difference  between  qua¬ 
drants  sharing  a  common  vertex.  A  final  “smoothing”  of  the  triangular  or  quadrila¬ 
teral  mesh  improves  element  shapes  and  further  reduces  mesh  gradation  near  dCl. 

A  simple  example  involving  a  domain  consisting  of  a  rectangle  and  a  quarter  cir¬ 
cle,  as  shown  in  Figure  5,  will  illustrate  the  finite  quadtree  process.  In  the  upper  left 
portion  of  the  figure,  the  square  universe  containing  the  problem  domain  is  quartet 
creating  the  one-level  tree  structure  shown  at  the  upper  right.  Were  this  deemed  to 
a  satisfactory  geometrical  resolution,  a  mesh  of  five  triangles  could  be  created.  As 
shown,  the  triangular  elements  are  associated  with  quadrants  of  the  tree  structure.  In 
the  lower  portion  of  Figure  5,  the  quadrant  containing  the  circular  arc  is  quartered  and 
the  resulting  quadrant  that  intersects  the  circular  arc  is  quartered  again  to  create  the 
three-level  tree  shown  in  the  lower  right  portion  of  the  figure.  A  triangular  mesh  gen¬ 
erated  on  this  tree  structure  is  also  shown. 

Arbitrarily  complex  two-dimensional  domains  may  be  discretized  in  this  manner 
and  generally  produce  unstructured  grids;  however,  the  underlying  tree  of  quadrants 
remains  regular.  Adaptive  mesh  refinement  is  easily  accomplished  by  subdividing 
appropriate  leaf-node  quadrants  and  generating  a  new  mesh  of  triangular  or  quadrila¬ 
teral  elements  locally;  thus,  unifying  the  mesh  generation  and  adaptive  solution  phases 
of  the  problem  under  a  common  tree  data  structure. 

Preconditioned  conjugate  gradient  (PCG)  iteration  is  an  efficient  means  of  solving 
the  linear  algebraic  systems  (10)  that  result  from  the  finite  element  discretization  of 
self-adjoint  elliptic  partial  differential  systems  [8].  The  key  steps  in  the  PCG  pro¬ 
cedure  [22]  involve  (i)  matrix-vector  multiplication  of  the  form 

q  =  Kp  (11a) 


and  (ii)  solving  linear  systems  of  the  form 

Kd  =  r. 


(lib) 
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(J)  Boundary  quadrant 
•  Interior  quadrant 
O  Exterior  quadrant 
a  Finite  element 


Figure  5.  Finite  quadtree  mesh  generation  for  a  domain  consisting  of  a  rec¬ 
tangle  and  a  quarter  circle.  One-level  and  three-level  tree  structures  and 
their  associated  meshes  of  triangular  elements  are  shown  at  the  top  and  bot¬ 
tom  of  the  figure,  respectively. 


where  r  and  p  are  the  residual  vector  and  conjugate  search  direction,  respectively.  The 
preconditioning  matrix  K  may  be  selected  to  reduce  computational  cost.  The 
element-by-element  (EBE)  and  symmetric  successive  over-relaxation  (SSOR)  precondi¬ 
tionings  are  in  common  use  and  seem  appropriate  for  use  with  quadtree- structured 
grids.  The  EBE  preconditioning  is  an  approximate  factorization  of  the  stiffness  matrix 
K  into  a  product  of  elemental  matrices.  If  the  grid  has  been  “colored”  so  as  to 
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segregate  non-contiguous  elements,  then  (lib)  can  be  solved  in  parallel  on  elements 
having  the  same  color.  Since  the  matrix-vector  multiplication  (11a)  can  also  be  per¬ 
formed  in  an  element-by-element  fashion,  the  entire  PCG  solution  can  be  done  in 
parallel  on  non-contiguous  elements.  While  this  simple  approach  has  been  used  in 
several  applications  [18,  19,  21],  we  found  the  SSOR  preconditioning  to  be  more 
efficient  in  every  instance  [13]  and,  therefore,  shall  not  discuss  EBE  preconditionings 
any  further. 

SOR  and  SSOR  iteration  have  been  used  for  the  parallel  solution  of  the  five-point 
difference  approximation  of  Poisson’s  equation  on  rectangular  meshes  by  numbering 
the  discrete  equations  and  unknowns  in  “checkerboard”  order  [1].  With  this  ordering, 
unknowns  at  “red”  mesh  points  are  only  coupled  to  those  at  “black”  mesh  points  and 
vice  versa;  thus,  solutions  at  all  red  points  can  proceed  in  parallel  that  may  be  fol¬ 
lowed  by  a  similar  solution  at  all  black  points.  Preserving  symmetry,  as  with  the 
SSOR  iteration,  will  make  the  SOR  method  a  suitable  preconditioning  for  the  PCG 
method.  Adams  and  Ortega  [1]  describe  multicolor  orderings  on  rectangular  grids 
using  several  finite  element  and  finite  difference  stencils.  However,  multicolor  order¬ 
ings  for  unstructured  meshes  are  more  difficult  since  nodal  connectivity  and  difference 
stencils  for  high-degree  polynomial  approximations  can  be  quite  complex.  The  com¬ 
putational  effort  can  be  reduced  when  using  quadtree- structured  grids  by  considering 
multicolor  orderings  for  block  SSOR  preconditionings  at  the  quadrant  level.  To  be 
specific,  partition  the  stiffness  matrix  K  by  quadrants  as 

K  =  D  -  L  -  Lr  (12a) 

where 


Kl.i 

0 

D  = 

^2,2 

,  L  =  - 

K2.i  0 

kq.q. 

Kgj  Kg  2  0 

Nontrivial  entries  in  a  diagonal  block  K;>i  arise  from  Galerkin  coordinates  that  are 
connected  through  the  finite  element  basis  to  other  unknowns  in  quadrant  i .  Nontrivial 
contributions  to  block  K, ;  of  the  lower  triangular  matrix  L  arise  when  the  support  of 
the  basis  associated  with  a  Galerkin  coordinate  in  quadrant  i  intersects  quadrant  j . 

Using  an  SSOR  preconditioning,  the  solution  of  (lib)  would  be  computed  accord¬ 
ing  to  the  two-step  procedure 

Xn+,/2  =  co(LX"+l/j  +  LrXn  +  r)  +  (1  -  co)Xn,  (13a) 

X"+1  =  co(Lt  Xn+1  +  LXn+*  +  r)  +  (1  -  (0)Xn+*  n  =  1,  2,  •  •  •,  M.  (13b) 

Thus,  each  block  SSOR  iteration  consists  of  two  block  SOR  steps;  one  having  the 
reverse  ordering  of  the  other.  Typically,  M  =  3  SSOR  steps  are  performed  between 
each  PCG  step. 
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Suppose  that  the  Q  quadrants  of  a  finite  quadtree  structure  are  separated  into  y 
disjoint  sets.  Then,  using  the  symmetric  y-color  block  SSOR  ordering,  we  would 
sweep  the  quadrants  in  the  order  Cx,  C2,  ••,  Cr  CT  Cy_h  •••,  Ct,  where  C;  is  the  set 
of  quadrants  having  color  i.  Because  quadrants  rather  than  nodes  are  colored,  a  node 
can  be  connected  to  other  nodes  having  the  same  color.  Thus,  the  forward  and  back¬ 
ward  SOR  sweeps  may  differ  for  a  color  Q,  i  =  1,  2,  •  •  •,  y.  During  an  SOR  sweep, 
unknowns  lying  on  quadrant  boundaries  are  updated  as  many  times  as  the  number  of 
quadrants  containing  them. 

Coloring  the  regular  quadrants  of  a  finite  quadtree  is  far  simpler  than  coloring  the 
elements  of  a  mesh.  Differences  in  the  small  number  of  elements  within  quadrants 
having  the  same  color  may  cause  some  load  imbalance  and  this  effect  will  have  to  be 
investigated.  Naturally,  coloring  procedures  that  use  the  fewest  colors  increase  data 
granularity  and  reduce  the  need  for  process  synchronization.  At  the  same  time,  the 
cost  of  the  coloring  algorithm  should  not  be  the  dominant  computational  cost.  With 
these  views  in  mind,  we  developed  an  eight-color  proceaure  that  has  linear  time  com¬ 
plexity  [13].  This  procedure  only  required  a  simple  breadth- first  traversal  of  the  quad¬ 
tree,  but  performance  never  exceeded  that  of  the  six-color  procedure  which  is 
described  in  the  following  paragraphs.  Four-color  procedures  are  undoubtedly  possi¬ 
ble,  but  we  have  not  formulated  any.  Their  complexity,  unlike  the  eight-  and  six-color 
procedures,  may  be  nonlinear. 

With  the  aim  of  constructing  a  quadtree  coloring  procedure  using  a  maximum  of 
six  colors,  let  us  define  a  binary  directed  graph  called  a  “quasi-binary  tree”  from  the 
finite  qi  dtree  by  using  the  following  recursive  assertive  algorithm. 

i.  T1  ^  root  of  the  quadtree  corresponds  to  the  root  of  the  quasi-binary  tree. 

ii.  Every  terminal  quadrant  is  associated  with  a  node  in  the  quasi-binary  tree;  how¬ 
ever,  not  every  quasi-binary  tree  node  must  correspond  to  a  quadrant. 

iii.  In  the  planar  representation  of  the  quadtree,  nodes  across  a  common  horizontal 
edge  are  connected  in  the  quasi-binary  tree. 

iv.  When  a  quadrant  is  divided,  its  parent  node  in  the  quasi-binary  tree  becomes  the 
root  of  a  subtree. 

Planar  representations  of  simple  quadtrees  and  their  quasi-binary  tree  representa¬ 
tions  are  illustrated  in  Figure  6.  The  leftmost  quadtree  illustrates  root-node  and 
offspring  construction  of  the  quasi-binary  tree.  Connection  of  nodes  across  horizontal 
edges  is  shown  with  and  without  quadrant  division  in  all  three  illustrations.  Subtree 
definitions  according  to  assertion  (iv)  are  shown  in  the  center  and  rightmost  quadtrees. 

From  Figure  6,  we  see  that  the  column-order  traversal  of  a  finite  quadtree  is  the 
depth-first  traversal  of  its  associated  quasi-binary  tree.  Let  us  define  six  colors  divided 
into  three  sets  a,  b,  and  c  of  two  disjoint  colors  that  alternate  through  the  columns  in 
a  column-order  traversal  of  the  quadtree.  Whenever  left  and  right  quasi-binary  tree 
branches  merge,  column-order  traversal  continues  using  the  color  set  associated  with 
the  left  branch.  Two  of  the  three  color  streams,  say  a  and  b ,  are  passed  to  a  node  of 
the  quasi-binary  tree.  At  each  branching,  the  color  stream  a  and  the  third  color  stream 
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Figure  6.  Planar  representations  of  three  quadtrees  and  their  associated 
quasi-binary  trees. 


c  are  passed  to  the  left  offspring  while  the  streams  a  and  b  are  passed  in  reverse 
order  to  the  right  offspring.  Additional  details  and  a  correctness  proof  of  this  algo¬ 
rithm  will  appear  [15]. 

Computational  experiments  of  Benantar  et  al.  [13]  demonstrate  the  excellent 
parallelism  that  may  be  obtained  by  the  six-color  SSOR  PCG  procedure  with  piecewise 
linear  finite  element  approximations.  However,  higher-order  polynomial  bases  create 
additional  possibilities  for  processor  load  imbalance  with  coloring  at  the  quadrant 
level.  Let  us  illustrate  this  with  a  simple  problem.  As  in  Section  2,  a  16-processor 
Sequent  Balance  21000  computer  was  used  for  the  experiment. 

Example  2.  Consider  the  Dirichlei  problem 

Uxx  +  Uyy  =f(x,y),  (x,y)e£2,  (14a) 

u  =  0,  (x,y)ed  Q,  (14b) 

with  Q  =  { (x,y)  1-3  <  x,  y  <  3  }.  We  solved  this  problem  on  a  400-element  mesh 
using  piecewise  linear,  quadratic,  and  cubic  approximations.  Adaptive  p-refinement 
with  the  polynomial  degree  p  restricted  to  be  1,  2,  or  3  was  also  performed.  Parallel 
speed  up  and  processor  idle  time  resulting  from  the  need  to  synchronize  at  the  comple¬ 
tion  of  each  color  are  shown  in  Figure  7. 

Parallel  performance  degrades  as  polynomial  degree  increases,  with  the  adaptive 
strategy  having  the  poorest  performance.  Adaptive  algorithms  typically  have  serial 
logic  which  limits  speed  up.  Of  course,  speed  up  is  not  the  only  measure  of  complex¬ 
ity  and  an  adaptive  solution  strategy  could  require  less  CPU  time  to  solve  the  problem 
to  a  given  level  of  accuracy.  Nevertheless,  additional  research  is  necessary  to  improve 
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Speed  Up 


Figure  7.  Parallel  speed  up  (left)  and  processor  idle  time  (right)  for  the 
finite  element  solution  of  Example  2  using  piecewise  linear,  quadratic,  and 
cubic  approximations  as  well  as  adaptive  p-refinement. 


performance  with  high-order  and  adaptive  strategies. 

Using  a  hierarchical  basis,  all  Galerkin  coordinates  for  polynomial  degrees  higher 
than  one  are  associated  with  mesh  points  that  are  either  along  element  edges  or  within 
elements.  Thus,  the  Galerkin  coordinates  for  continuous  piecewise  linear  approxima¬ 
tions  are  the  only  ones  associated  with  element  vertices.  Parallel  performance  could, 
therefore,  be  improved  by  coloring  element  edges  rather  than  quadrants  and  we  have 
designed  a  three-color  procedure  having  linear  time  complexity  to  do  this  [15].  Since 
hierarchical  bases  add  incremental  corrections  as  the  polynomial  degree  is  increased, 
one  could  conceive  an  algorithm  where  quadrant  coloring  is  used  with  the  piecewise 
linear  portion  of  the  approximation  and  edge  coloring  is  used  for  higher-degree 
approximations. 

4.  DISCUSSION.  High-order  and  hp-refinement  strategies  have  the  highest  conver¬ 
gence  rates  on  serial  processors.  Successful  use  of  adaptive  strategies  in  parallel 
environments  depends  heavily  on  the  efficient  implementation  of  these  procedures  on 
shared-  and  distributed-memory  computers.  The  edge  coloring  procedure  alluded  to  in 
Section  3  should  provide  some  improvement  over  existing  strategies  on  shared-memory 
systems,  but  no  procedure  is  available  for  using  hp-refinement  on  data-parallel  comput¬ 
ers.  High-order  and  hp-refinement  techniques  are  being  added  to  our  collection  of 
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methods  for  solving  hyperbolic  systems  using  the  finite  element  methods  of  Cockbum 
and  Shu  [14].  The  p-hierarchical  Legendre  polynomial  basis  embedded  in  these 
methods  should  also  furnish  error  estimates  similar  to  those  that  we  have  developed  for 
parabolic  systems  [3].  These  techniques  are  far  more  efficient  than  Richardson’s  extra¬ 
polation. 

Our  h-refinement  procedure  for  hyperbolic  systems  could  be  improved  by  begin¬ 
ning  each  base-mesh  time  step  with  an  adaptively  chosen  mesh  that  utilizes  known 
nonuniformities  in  the  solution  discovered  during  the  previous  base-mesh  time  step. 
Processors  would  still  have  to  be  scheduled  to  balance  loads  in  this  case  and  pro¬ 
cedures  for  doing  this  are  unavailable.  Finally,  parallel  procedures  for  distributed 
memory  systems  and  procedures  for  three-dimensional  problems  are  of  great  interest. 
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ABSTRACT 

This  paper  gives  a  methodology  for  constructing  real  time  acceptors  from 
examples  of  their  behavior.  The  technique  involves  constructing  a  behavior  graph 
and  factoring  it  using  Hartmanis-Steams  decomposition  theory.  It  provides  a 
learning  mechanism  that  can  find  a  general  representation  of  a  class  given 
examples  of  that  class. 

INTRODUCTION 

This  paper  describes  a  method  for  automatically  creating  a  real  time 
acceptor  P  from  examples  of  its  behavior  B.  As  an  illustration,  suppose  the 

desired  acceptor  is  to  accept  the  strings  a!a,  aa!aa  ab!ba,  ba!ab . and  reject  all 

others.  The  methodology  begins  with  these  examples  and  creates  a  general  P 
which  will  enter  a  final  state  if  and  only  if  the  input  is  a  string  from  the  general 

set  {w!wR  I  w  €  {a,b}*}. 

This  research  is  a  project  in  learning  theory  and  attempts  to  find 
methodologies  that  generalize  from  examples  to  find  a  representation  of  the  total 
class.  The  paper  defines  the  concepts  of  data  structure  and  control  structure  for  a 
real  time  acceptor  and  then  shows  that  the  behavior  graph  for  the  acceptor  is  the 
product  of  the  data  structure  and  control  structure  graphs.  Synthesis  is  done  by 
creating  the  behavior  graph  from  the  examples  and  factoring  it  into  two  graphs, 
the  data  and  control  structures  for  the  acceptor. 

REAL  TIME  ACCEPTORS 

A  data  structure  is  defined  to  be  a  Moore  type  deterministic  state  machine. 

D  =  {So,  Id,  Od,  So,  do>^D) 


where 

•  Sd  is 

•  Zd  is 
of  D. 


a  finite  or  infinite  set  of  states. 

a  finite  set  of  symbols  called  the  input  alphabet  of  D,  or  the  instruction  set 
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•  8d  :  Sd  x  Ed  ->  Sd  is  a  mapping  by  which  D  changes  states,  called  the 
transition  function  of  D. 

•  doe  Sd  is  the  initial  state  of  D. 

•  Od  is  a  finite  set  of  output  symbols  that  we  interpret  as  the  test  values  on  the 
states  of  D. 

•  Xd  •'  Sd  -»  Od,  is  a  mapping  that  assigns  to  each  state  of  D  a  test  value  from  Od- 

Conceptually,  a  data  structure  has  a  state  and  receives  instructions  from 
the  outside  world  which  modify  its  state.  It  is  also  capable  of  yielding  values 
which  depend  on  its  state.  For  example,  a  pushdown  stack  might  begin  in  the 
empty  state  and  then  receive  sequentially  the  inputs  a  and  b.  It’s  output  would 
then  be  the  item  at  the  top  of  the  stack,  b  in  this  case.  The  state  graph  of  Figure  1 
shows  the  initial  part  of  the  infinite  graph  for  a  stack. 


Figure  1.  The  stack  data  structure. 


A  control  structure  is  a  finite  deterministic  state  machine 

C  =  (SqIc.Oc,  8c,co>XqFc} 

where 

•  Sc  is  a  finite  nonempty  set  of  states. 
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•  Xc  is  a  finite  set  of  symbols  called  the  input  alphabet  to  C.  Xc  i8  the  cross 
product  of  two  alphabets;  the  finite  nonempty  input  alphabet  from  the  outside 
world,  X,  and  the  output  alphabet  of  the  DS.  So  we  have 

Ic  =  X  x  Od 

Oc  is  a  set  of  instructions  that  can  be  issued  to  D.  So  Oq  =  Xz> 

be  ■'  Sc  x  Xc  Sc  is  a  mapping  by  which  C  changes  states  called  the  transition 
function  of  C. 

co  is  the  initial  state  of  C. 

Xc :  Sc  x  Xc  — >  Xd  is  a  mapping  that  assigns  to  each  transition  of  C  an 
instruction  to  D  called  the  instruction  assignment  function. 

•  Fq  c  Sc  is  a  set  of  final  states  of  C. 

The  task  of  the  control  structure  is  to  receive  inputs  from  the  outside  world, 
make  modifications  to  the  data  structure,  and  yield  an  output  indicating 
acceptance  or  rejection  of  the  string.  A  real  time  acceptor  P  operates  as  shown  in 
Figure  2  and  follows  three  rules.  Suppose  the  acceptor  P  has  its  control  structure 

C  in  state  c,  its  data  structure  D  in  state  d,  and  receives  input  a. 

1.  C  changes  states  from  c  to  some  state  c’  using  its  transition  function  5C.  The 
next  state  depends  on  a  and  A-oCd),  so  we  have 

5c(c,(a,  XD(d)))  =  c’ 

2.  While  C  is  changing  states,  it  sends  an  instruction  to  the  DS.  The  instruction 
to  be  sent  depends  on  the  current  state  of  C,  the  input  alphabet  and  the  test 
value  of  the  current  state  of  D  and  is  given  by 

Xc(c,(a,Xo(d)))  =  I 

3.  On  receiving  the  instruction,  D  will  change  states  from  the  current  state,  d, 
to  some  state  d'  using  its  transition  function 

5  D(d, I)  =  d' 
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FIGURE  2.  The  acceptor  with  its  control  structure  C 
and  data  structure  D. 

An  example  control  structure  which  employs  the  stack  data  structure  and 
accepts  the  set  described  above,  w!wR,  is  shown  in  Figure  3.  Its  operation  is  easy 
to  follow.  It  begins  in  the  top  center  state  with  its  associated  stack  empty  and 
moves  left  or  right  depending  on  whether  the  first  symbol  is  an  a  or  b.  For 
example,  on  the  string  ab!ba,  it  moves  left  without  changing  the  stack  after  the 
first  symbol  a.  When  the  b  arrives,  it  pushes  that  on  the  stack  without  changing 
its  control  state.  On  input !,  it  transitions  down  one  state,  and  on  b  and  a,  it  pops 
the  stack  and  then  (with  the  stack  empty)  moves  to  the  final  state  at  the  center. 


FIGURE  3.  Control  structure. 


SYNTHESIS 

A  graph  cross  product  operation  will  be  defined  in  this  section  for  the 
control  structure  and  data  structure  graphs.  Then  we  will  observe  that  the 
behavior  graph  has  the  same  structure  as  this  cross  product  graph.  The 
synthesis  procedure  thus  involves  creating  the  behavior  graph  and  then  factoring 
it  into  the  appropriate  control  and  data  structures. 

Let  control  structure  C  and  data  structure  D  be 
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C  =  {ScIcOc^C,  catoFc) 


and 


D  =  {Sd,  Id,  Od,  Sd,  do,  Ad) 

The  product  CxD  is  defined  to  be  the  Moore  type  state  machine 

McxD  =  {ScxD,L8cxd,PO>FCxD} 


where 

•  ScxD  c  Sc  x  5d- 

•  Va e  S 


•  where 


Scx£/(c,d),cO  =  (5c(c(a,Xr)(d))),8D(d,I)) 


l=Xc(c,(a,  Xo(d))) 


•  po=(cado)- 


•  fed;  e  F CxD  c  G  fie. 

For  example,  if  the  graphs  of  Figures  1  and  3  are  combined  using  this  cross 
product  operati  n,  the  graph  of  Figure  4  results.  Figure  4  contains  no  control  or 
data  structure  information;  it  is  simply  a  graph  of  all  possible  behaviors  for  the 
automaton  with  Figures  1  and  3  as  its  data  and  control  structures. 
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Figure  4.  The  cross  product  graph  McxD- 

All  of  this  is  summarized  in  the  following  theorem  where  L(X)  is  defined  to 
be  the  language  of  X. 

Theorem  1.  L(B)  =  L(Mcxd) 

The  proof  appears  in  Fahmy  [88]. 

The  next  question  asks  under  what  conditions  such  a  decomposition  of  the 
behavior  graph  can  occur.  The  answer  is  given  by  an  extension  to  Hartmanis- 
Steams  decomposition  theory  (Hartmanis  and  Steams  [66]). 

First  some  definitions  must  be  made.  Let  b=(c,d)  represent  the  behavior 
graph  node  b  that  comes  from  composing  control  structure  state  c  and  data 
structure  state  d  in  the  cross  product  operation  of  CxD.  If  b]=(ci,di)  and  b2=(c2,d2), 
then  four  partitions  xc.^D.PCD.  and  pdc  over  the  set  of  states  of  B  can  be  defined. 

6j  =  62  (n c)  if  and  only  if  cj  =C2- 


bj  =b2  (  kb)  if  and  only  if  di  =  <£2- 


bj  =  62  (pCD)  if  and  only  if  Va6  L  Vve  Op,Ao  (CjXa.v'))  =  Ac(c2/a,v)). 
bi  =  b2(pDC)  if  and  only  if  A o(di)  =  Xu(d2). 
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The  decomposition  theorem  is  as  follows: 

Theorem  2.  Given  a  Behavior  Graph,  B,  four  partitions  7tc>  ^D.  PCD  and  poc  over 
the  set  of  states  of  B,  and  a  data  structure  D\  there  exists  a  program 
P  =  [C,Dr)  such  that  L(P)  =  L{B)  and  the  four  partitions  are  the 
associated  partitions  of  program  P  over  the  BG  B  if  and  only  if 

1-  JtC .  pDC  “  M  ( nc ) 

2.  jcd  .  p CD  ~  M  (no) 

3.  ncr  PCD 

4.  kb  ~  p DC 

5.  KC .  kb  =  0 

6.  The  number  of  blocks  of  kq  is  finite. 

7.  nc~  tcf. 

8.  The  machine 

D  =  (SD,I.D,OD,&D,do^D} 

defined  by 

•Sd  =  kb- 

•Id  =  PCD  x  pDC  x  I. 

• Ob  =  pdc- 
•do  =  PtiD(&o)- 
•^•D(PtiD)  =  PpDCtPrcD]. 

•5d(Ptid(&),  (PpCD(6),ppZ?c<^),a))  =PjtD(5(6,a)). 
is  isomorphic  to  a  submachine  of  D’. 
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The  M(x)  operator  is  the  Hartmanis-Steams  maximum  sum  operation  over 

the  partitions  x’  such  that  (x\x)  is  a  partition  pair.  The  proof  of  the  theorem  and 
considerable  elaboration  appears  in  Fahmy  [88]. 

The  synthesis  procedure  is  derived  from  the  decomposition  theorem  and 
follows  these  steps: 

1.  Construct  as  much  of  the  behavior  graph  as  possible  from  the  given 
examples. 

2.  Select  a  proposed  data  structure  for  the  computation. 

3.  Construct  a  partition  xc  on  the  states  of  B  that  will  define  the  state 
transitions  for  the  target  controller. 

4.  Attempt  to  complete  the  factorization  by  building  xd  which  meets  the 
constraints  of  Theorem  2. 


5.  Repeat  4  and  5  until  a  well-formed  factorization  is  found.  The  partitions  xc 

and  xd  then  define  the  state  assignments  and  transitions  for  the  controller 
and  data  structure. 

Fahmy  [88]  gives  a  convergence  theorem  for  this  learning  methodology  and 
a  number  of  examples  of  its  usage.  Thus  a  programmed  version  of  this  approach 
vas  able  to  construct  controllers  of  complexity  shown  in  Figure  3  from  behaviors 
as  illustrated  in  Figure  4  in  a  few  seconds. 
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Abstract 

An  Automatic  Deduction  System  for  NonMonotonic  Reasoning  in  the  Modal  Quantificational 
Logic  Z  is  discussed.  This  system  can  deduce  nonmonotonic  consequences  specified  by 
several  other  theories  of  nonmonotonic  reasoning,  and  in  the  variable  free  case 
provides  a  decision  procedure  for  each  of  these  theories.  These  theories  include 
Moore’s  autoepistemic  logic,  nonconstructive  default  logic,  Reiter’s  default  logic, 

McCarthy’s  Parallel  Circumscription  with  both  fixed  and  variable  predicates,  and  the 
closed  world  assumption.  The  computational  properties  of  these  theories  are  compared. 

1.  Introduction 

A  computer  program  which  can  prove  theorems  in  several  different  theories  of  nonmonotonic 
reasoning  is  described.  This  is  done  by  implementing  a  deduction  system  for  the  modal 
quantificational  logic  Z  [Brown86a],  by  showing  how  every  ''finite)  set  of  nonlogical 
axioms  and  defaults  in  any  of  these  theories  is  represented  by  a  sentence  of  Z  which 
has  essentially  the  same  meaning  [Brown89aJ,  and  by  using  an  automatic  deduction  system 
to  deduce  the  appropriate  consequences  of  the  representation  in  Z  of  such  axiom  sets  of 
these  theories  of  nonmonotonic  reasoning.  The  proof  procedures  are  briefly  introduced 
in  section  2.  Discussions  of  the  solution  methods  and  traces  of  example  proofs  are 
given  in  section  3.  The  results  obtained  are  given  in  section  4.  The  implementation  is 
discussed  in  section  5  and  some  conclusions  are  drawn  in  section  6. 


2.  The  Proof  Procedure 

The  modal  quantificational  logic  Z  consists  of  the  the  following  symbols:  falsity:  F, 
truth:  T,  and:  a,  or:  v,  forall:  V,  for  some:  3,  not:  necessary:  [],  synonymous:  =, 

and  the  following  defined  symbols: 


(a  -»  P) 

(a  <->  p) 

(<>  a) 

(a  =  P) 
([p]a) 

(<P>a) 
(WORLD  a) 
(GEN  a) 

(51[=]S2) 


=df  ((-.a)vP) 
=df  ((o->P)a(P- 
=df  (-[](-  a)) 
=df  ([](a  *-»  P)) 
=df  ([](P  ->  a)) 
=df  (<>(P  a  a)) 


a  implies  p 

►a))  a  iff  p 

a  is  logically  possible 
a  is  synonymous  to  P 
P  entails  a 
a  is  possible  with  p 
=df  ((oa)A(Vy(([a]y)v([a](->Y)))))  for  y  not  in  a  a  is  a  world 
=df  (vi=l,n  (3xli...xmi(as(7u  xli...xmi)))) 

where  all  the  predicates  are  Kl...nn  a  is  a  generator 

=df  □(51=52)  51  necessarily  equals  52 


The  axioms  and  inference  rules  of  Z  include  those  of  first  order  logic  [Mendelson] 
plus  the  following  inference  rules  and  axioms  about  necessity  []: 


HO:  from  a  infer  (□  a) 

Al:  (([]p)-»p) 

A2:  (([p]q)  ->  (([]p)  ->  ([]q)» 

A3:  ((Dp)  v  (D-iDp» 

A4:  ((Vw((WORLD  w)  ->  ([w]p»)  (Dp)) 

A5:  (WORLD(Vp((GEN  p)-»(p*-»[]a))))  for  every  expression  a 
A6:  (-i((jd  xl...xn)s(n2  yl...ym») 

where  nl  and  n2  are  different  predicates. 

A7:  (((n  xl...xn)s(n  yl...yn))<->(TA(xl[=]yl)A...A(xn[=]yn») 

A8:  (-i((°®l  xl...xn)[=](°°2  yl...ym)» 

where  °°1  and  °°2  are  different  functions 
A9:  (((«  xl...xn)[=]0  yl...yn))^(TA<xl[=]yl)A...A(xn[=]yn))) 

The  laws  RO,  Al,  A2  and  A3  constitute  an  S5  modal  logic  which  with  the  nonmodal  laws  is 
similar  to  [Carnap46,Bressan].  A4  says  that  a  proposition  is  logically  necessary  if  it 
is  entailed  by  every  world  proposition.  Laws  A5,  A6,  and  A7  axiomatize  the  predicates. 

A5  is  the  key  axiom  which  says  that  any  exhaustive  conjunction  of  negated  or  unnegated 
distinct  generators  is  a  world  if  there  is  a  sentence  a  expressible  in  the  formal 
language  of  Z  which  holds  when  p  is  an  unnegated  generator  of  that  conjunction.  It 
extends  the  A5  axiom  used  in  [Brown86a]  to  handling  quantifiers  over  arbitrary  domains. 
Laws  A8  and  A9  axiomatize  the  functions.  The  axiom  scheme  A5  which  has  a  recursively 
enumerable  number  of  instances,  expresses  what  is  logically  possible  to  the  extent  that 
it  can  be  so  expressed.  What  is  possible  with  respect  to  a  knowledgebase  K  is  expressed 
by  the  defined  symbol  <K>p  which  means  K  and  p  together  is  logically  possible.  For 
example,  the  sentence  (<>(-,((3x(P  x))->(Vx(P  x)))))  can  be  derived  from  axiom  A5  and  A6  as 
follows.  Assuming  P  is  one  of  the  predicates  and  letting  a  be  (p=(P  A))  axiom  A5  of  Z 
becomes:  (WORLD(Vp(((3x(ps(P  x)))vP)->(pe->[](p=(P  A))))))  where  P  is  the  rest  of  the  GEN 
definition.  This  gives:  (WORLD((Vx((P  x)h([]((P  x)h(P  A)))))a(Vp((P-*(p<-*[](p=(P  A)))))))) 
which  by  axiom  A6  and  since  worlds  are  logically  possible  implies  (<>(Vx((P  x)«->[](x[=]A)))) 
which  implies:  (<>((P  A)a(-.(P  B))))  for  a  function  B  and  hence  (<>(-i((3x(P  x))-»(Vx(P  x))))). 

Nonmonotonic  reasoning  is  performed  in  Z  by  solving  equivalences.  Generally,  some 
variable  such  as  K  represents  the  meaning  of  the  knowledgebase  which  is  then 
axiomatized  by  asserting  that  K  is  synonymous  to  the  conjunction  of  all  the  axioms  of 
the  theory  and  all  the  reflective  statements  such  as  any  defaults.  Thus  the  reflective 
equivalence:  Ksa  where  K  may  occur  within  a,  axiomatizes  the  knowledgebase  K.  Any 
equation  of  the  form  KhP  where  K  does  not  occur  in  P  and  which  implies  Ksa,  is  a 
solution  to  the  original  equivalence. 

3.  Examples 

Two  partial  traces  of  example  deductions  performed  by  our  automatic  deduction  system 
are  listed  below.  The  traces  are  intermixed  with  a  discussion  of  some  of  the  derived 
rules  of  inference  of  Z  which  were  used.  The  deductions  are  carried  out  by  Symeval-like 
techniques  [Brown86b]  of  replacing  expressions  by  logically  equivalent  expressions  in 
an  inner  to  outer  manner  similar  to  LISP  evaluation.  Each  application  shows  the  name  of 
the  rule,  the  lexical  level,  the  input  to  the  rule:  >,  the  intermediate  output  of  the 
rule:  -,  and  the  output  to  the  rule  after  evaluation:  <.  The  number  before  the  lexical 
level  in  some  rules  is  useful  only  when  more  than  one  outputs  are  produced  and  could  be 
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ignored  in  this  example  as  these  rules  when  successfully  applied  use  the  cut!  symbol  to 
delete  any  alternative  rules  that  might  also  be  applicable.  The  first  example  is  a 
propositional  reflective  equation  involving  two  defaults  which  states  that  if  a  is 
possible  then  not  b  and  that  if  b  is  possible  then  not  a: 

INPUT:  (2  k(A(->(<  k  >  a)  (— .  b) )(-»(<  k  >  b)  (— .  a)))) 

The  first  implication  is  rewritten  into  a  disjunction  as  follows: 

<k>def  1>.  (<  k  >  a) 

1  1<.  (<>  (a  k  a)) 

—Kief  1>.  (— »  (<>  (a  k  a))  (-1  b)) 

-><>  2>..  (— i  (<>  (a  k  a))) 

-lAdemorgan  3>.  .  .  (—.(Aka)) 

5  3< . . .  (v  (-,  k)  (-,  a)) 

4  2< . .  (□  (v  (-.  k)  (-,  a))) 

3  1<.  (v  ([]  (v  (-,  k)  (-,  a)))  (-,  b)) 

The  second  implication  is  also  written  as  a  disjunction  and  then  the  rules  about  2  are 
applied  which  causes  case  analysis  on  the  expression  ([](v(-,  k)(~.  a))): 

2nm[]0  1>.(s  k(A(v([](v(-,  k)  (-,  a)))  (—  b))(v([](vh  k)  (-.  b) ) )  (-,  a)))) 

1  1-. . (cut!  (1 et ( (+exp(A(v  $t(— .  b))(v([](v(-.  k)  (— .  b)))(— .  a)))) 

(-exp(A(v  nilh  b) )  (v( [] (v(-,  k)  (-.  b) ) )  (— .  a)))) 
(r(-,(v(-,  a))))) 

(v  (if (free?  k  +exp)(A(s  k  +exp)  (-i(o(a  k  r)))) 

( a (2  k  -t-exp)  (-.(<>(a  +exp  r))))) 

(if(free?  k  -exp) (a(s  k  -exp)(o(A  k  r))) 

( a ( 2  k  -exp) (<>(a  -exp  r))))))) 

+exp  and  -exp  are  the  right-hand  side  of  the  equation  when  the  default  is  or  is  not 
the  case  respectively.  The  other  modal  expression  ([](v(-,  k)(-.  b)))  is  now  analyzed  in 
each  of  the  first  two  cases.  The  +exp  branch  is  analyzed  first: 

=nm[]0  2>..(s  k  (v(  []  (v(— ,  k)  (— .  b>))  (— .  a))) 

2  2- .  .  .  (cut !  (1  et  ( (+exp(v  $  t  ( — .  a)))(-exp(v  ni  1  ( — .  a) ) )  ( r  ( -.( v(-,  b)))i) 

(v(if(free7  k  +exp)  (a(s  k  +exp)  (—.(<>( a  k  rl))) 

(a(2  k  +exp)  (-.(<>( a  +exp  r ) ) ) ) ) 

(if(free7  k  -exp)(A(H  k  -exp)(o(A  k  r) ) ) 

( a ( =  k  -exp) (<>(a  -exp  r))))))) 

2  2< . . (=  k (— 1  a)) 

Assub  2> .  .  (a(s  k  (— .  a))(— .(<>(a  k  a)))) 

3  2— . .  .  (cut!  (a(=  k(— 1  a)  )(—.(<>(  a(—i  a)  a))))) 

3  2< . . (s  k (  1  a)) 

The  -exp  branch  of  is  processed  similarly  contributing  another  alternative  to  the  final 
two  solutions  of  the  original  equation: 

snm[]0  2> .  .(=  k(A(— ,  b)(v([](v(-,  k)  (— .  b)))(-.  a)))) 

4  2< .  .  (2  k(-.  b)) 
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2>-.(a(=  k(-.  b))(o(A  k  a))) 

5  2<. . (=  k(— i  b)) 

1  1<-(v(=  k ( — i  a))(=  k(— i  b))) 

OUTPUT :  (v(=  k(— .  a))  (s  k(->  b))) 

TIME:  17.294  secs.,  CLOSURES:  61  els. 

The  case  analysis  technique  used  in  the  above  example  can  be  generalized  to  provide  a 
decision  procedure  for  computing  all  the  fixed  points  of  any  variable  free  nonmonotonic 
equation  in  Z  as  given  by  Theorem  Z5  [Brovvn89a]  below.  We  define  a  group  of  modalized 
subexpressions  to  be  a  propositional  combination  of  expressions  each  beginning  with  a 
modal  symbol.  For  example,  ([]A)  and  (([]A)a((oB)v(oC)»  are  both  groups  whereas 
((<>A)— >B)  is  not. 

Theorem  Z5:  Decidability  of  variable  free  nonmonotonic  equations  of  Z: 

The  fixed  points  of  any  nonmonotonic  equation:  Ks(f  K)  containing  no  variables 
( except  K)  are  decidable  if  every  K  in  (f  K)  occurs  within  the  scope  of  a  modal  symbol. 

There  are  at  most  2 T n  solutions  to  such  an  equation  where  n  is  the  number  of  groups  of 
modalized  subexpressions  in  (f  K). 

Many  equations  with  universal  and  existential  quantifiers  can  also  be  solved  by  the 
equation  solver.  This  is  done  by  splitting  the  quantified  expression  into  those 
instances  of  it  which  are  true  in  the  theory,  and  those  instances  of  it  which  are  false 
in  the  theoiy.  This  is  done  in  the  following  example  which  states  that  Tweety  (i.e.  tw) 
is  a  bird  and  that  if  it  is  possible  for  birds  to  fly  then  they  do  so: 

INPUT:  (=  k(A(bird  tw)  (V(x)  (~>(A(bi  rd  x)  (<  k  >(fly  x)))(fly  x) ) ) ) ) 

Adivision  1>.(A(bird  tw)  (V(x)  (v([]  (v(-,  k)(-.(fly  x) ) ) )  (-.(bi  rd  x))(fly  x)))) 

1  1-. . (cut! (Iet((y(make-symbol  "Y"))) 

(A(bird  tw) 

(V  nil  (v([]  (v(— .  k)  (—.(fly  tw) ) ) )  (— .(bi  rd  tw))(fly  tw))) 

(V (y)  (—»(—'([=]  y  tw)) 

(V([](v(-.  k)  (— i(f  1  y  y)))) 

(-.(bird  y))(fly  y) ) ) ) ) ) ) 

This  rule  factors  out  of  the  quantified  default  an  instance  of  it  dealing  with  the 
particular  case  of  Tweety  whose  being  a  bird  appears  in  the  conjunction. 

1  1<.(A(bird  tw)  (v([]  (v(-i  k)  (-.(fly  tw))))(fly  tw)) 

(V(y)(v([  =  ]  y  tw)([](v(-,  k)  (—.(fly  y) ) ) ) 

(-.(bird  y))(fly  y) ) ) ) 

Next,  the  case  analysis  rule  is  applied  over  the  unquantified  default: 

snm[]0  1>.(=  k ( A(bi rd  tw) (v( [] (v(->  k)  ( — .(fly  tw))))(f)y  tw)) 

( V  ( y )  ( v ( [  =  ]  y  tw)  ( [ ]  ( v (  .  k)  (— .(fTy  y ) ) ) ) 

(-.(bird  y))(fly  y) ) ) ) ) 

2  1-. . (cut! (1  et( (+exp(A(bi rd  tw)(v  $t(fly  tw)) 

(V (y)  (v( [=]  y  tw)([](v(-,  k)  (-.(fly  y) ) ) ) 

(-.(bird  y))  (fly  y) ) ) ) ) 


564 


(-exp(A(bi rd  tw) (v  nil (fly  tw)) 

(V(y)(v([=]  y  tw)([](v(-,  k)  ( — i(fl  y  y)))) 

(-.(bird  y))  (fly  y))))) 

(r(— >(v(— .(fly  tw)))))) 

(v  (if(free?  k  +exp)(A(=  k  +exp)  (-.(<>  (a  k  r)))) 

(a(=  k  +exp)  (-,(<>(a  +exp  r))))) 

(if(free?  k  -exp)(A(s  k  -exp)(o(A  k  r))) 

(a(s  k  -exp) (<>(a  -exp  r))))))) 

Working  over  the  branch  corresponding  to  -f-exp,  the  system  tries  to  eliminate 
the  quantified  default: 

sk[]el  im  2>..(s  k(A(bird  tw)  (V(y)  (v([=]  y  tw)  ([]  (v(-,  k)(-,(fly  y)))) 

( — >(bi rd  y)) (fly  y))))) 

3  2-...  (define  qexp(V(y)  (v([=]  y  tw)(->(bird  y))(fly  y)))) 

3  2-. . . (if (eq?(k-elim  k 

(A(bird  tw)  qexp 

(V(y) (v@ (extract  modal i zed? 

((03  y  tw)  ( — . ( bi  rd  y) )  (fly  y))) 

(-(v(-(fly  y))))))))  $t) 

(cut!(s  k(A(bird  tw)  qexp))) 

$fai 1 ) 

The  elimination  rule  works  in  the  following  way:  first  it  assumes  that  negation  is 
factored  out  of  ([](v(~,  K)(-,(fly  y))))  giving  (-i(o(a  K  (fly  y)))).  Then  K  is 
replaced  by  an  expression  stronger  than  itself  and  its  possibility  is  tested.  If 
it  is  possible,  as  in  the  present  case,  then  the  conjunct  is  eliminated,  leaving  an 
expression  free  of  defaults,  otherwise  it  fails: 

3  2<..(s  k(A(bird  tw)  (V(y)  (v([=]  y  tw)(-.(bird  y) )  (fly  y) ) ) ) ) 

Since  this  is  a  solution,  it  is  substituted  back  into  the  original  possibility  test: 

Assub  2>..(a(h  k(A(bird  tw) (V(y) (v([=]  y  tw)(-,(bird  y))(fly  y))))) 

(i(o(a  k (f  1  y  tw))))) 

The  new  possibility  expression  is  sent  to  a  heuristic  procedure  that  implements  the 
axiom  A5  of  Z  Modal  Logic  based  on  theorem  ZPl  of  [Brown89a]  which  only  succeeds  if 
it  can  build  a  world  in  which  the  body  is  the  case: 

<>axiom5  3> ...(<> (A(bi rd  tw)(fly  tw) (V(y) ( v([  =  ]  y  tw)(-,(bird  y ) ) (fly  y))))) 

5  3-. . . . (<a5>(A(bi rd  tw)(fly  tw) (V(y) (v([=]  y  tw)(-,(bird  y))(fly  y ) ) ) ) ) 

5  3< . . .  $t 

Since  it  is  possible,  its  negation  produces  $f  which  in  turn  prunes  the  branch 
corresponding  to  this  case. 

4  2< . .  nil 

A  similar  treatment  of  the  default  is  successfully  applied  over  the  other  branch: 
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2>. . (h  k(A(bird  tw)(f1y  tw) 

(V(y)(v([=]  y  tw)([](vH  k)H(f1y  y)))) 

(-.(bird  y))(f1y  y))))) 

6  2<.  .  (=  k(A(bird  tw)(fly  tw)  (V(y)  (v([=]  y  tw)(-,(bird  y))(fly  y))))) 

Substituting  over  the  possibility  test  qualifying  this  case  we  get: 

2>..(a(s  k(A(bird  tw)(fly  tw)  (V(y)  (v([=]  y  tw)(-,(bird  y))(f1y  y))))) 
(<>(a  k(fly  tw)))) 

<>axiom5  3>. . . (<>(A(bird  tw)(fly  tw) (V(y) (v([=]  y  tw)(->(bird  y))(fly  y))))) 

8  3< . . .  $t 

7  2< . . (=  k(A(bird  tw)(fly  tw) (V(y) (v([=]  y  tw)(-,(bird  y))(fly  y))))) 

This  branch  is  then  returned  as  the  only  solution  of  the  original  problem: 

2  1<.(=  k(A(bird  tw)(fly  tw) (V(y) (v([=]  y  tw)(-.(bird  y))(fly  y))))) 

OUTPUT:  (s  k(A(bird  tw)(f1y  tw)  (V(y) (v([=]  y  tw)(-.(bird  y))(f1y  y))))) 

TIME:  27.117  secs.,  CLOSURES:  556  els. 

4.  Results 

The  deduction  system  has  been  used  to  produce  all  the  fixed  points  of  sets  of  axioms 
in  [Moorel’s  Autoepistemic  Logic,  our  Nonconstructive  Default  Logic,  and  [Reiter]’s 
Default  Logic.  It  also  has  been  used  to  compute  the  result  of  [McCarthy’s]  parallel 
Circumscription  with  both  fixed  and  variable  predicates  and  the  Closed  World  Assumption. 
Results  obtained  for  each  of  these  systems  are  described  in  the  following  subsections. 

4.1  Autoepistemic  Logic  and  NonConstructive  Default  Logic 

The  fixed  point  equation  of  a  theory  g  in  Autoepistemic  Logic  [Moore, Konolige]  is: 

k=(classical-theorems-of(g^j{(L  a ):azk}^j(—(L  a):~S adt)})) 

If  the  number  of  groups  of  modal  statements  in  g  is  finite  then  the  meaning  of  the 
intersection  of  those  fixed  points  entails  the  meanings  of  the  same  sentences 
(not  containing  L)  as  does:  (3K(KA(K=(G2A(Ai=l,n((([K]Ai)A(Aj=l,mi(<K>Bij)))— »Ci)))))) 
where  (G2A(Ai=l,n((([K]Ai)A(Aj=l,mi(<K>Bij)))->Ci)))  is  the  meaning  of  g  written  in  a 
normal  form  with  L  replaced  by  [K],  and  [K]  does  not  occuring  in  G2,  Ai,  Bij,  nor  Ci. 

Autoepistemic  fixed  points  with  all  sentences  containing  L  eliminated  are  equivalent 
to  the  fixed  points  of  our  Nonconstructive  Default  Logic.  The  fixed  point  equation 
of  our  Nonconstructive  Default  Logic,  called  NCD,  is: 

k=(classical-theorems-offgvj{(ci):((ai)ek)  a(  /\j=l,mi(— {(-bij)zk))))}) 
where  for  each  i,  the  sentences  ai,bij,  and  ci  constitute  a  default.  If  the  number  of 
defaults  n  is  finite  then  the  meaning  of  the  intersection  of  the  fixed  points  entails 
the  meanings  of  the  same  sentences  as  does: 

(3K(KA(KH(GA(Ai=l,n((([K]Ai)A(Aj=l,mi(<K>Bij)))^Ci))))» 
where  G  is  the  meaning  of  the  sentences  in  g,  and  Ai.Bij,Ci  respectively  are  the 
meaning  of  the  sentences  ai,bij,ci  in  the  defaults.  This  Z  representation  can  be 
generalized  to  the  case  where  quantified  variables  cross  modal  scopes  with  the 
equation:  (3K(KA(K=(GA(Ai=l,n(Vxl...xmi((([K]Ai)A(Aj=l,mi(<K>Bij)))-^Ci))))))) 

Figure  1  gives  examples  in  the  Z  representation  of  Autoepistemic  Logic  and 
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Nonconstructive  Default  Logic.  An  interesting  example  given  at  the  bottom  of  this 
figure  is  the  Gelfond-Przymusinska  example  which  has  2  fixed  points  in  Autoepistemic 
Logic,  as  is  computed  by  our  deduction  system.  It  is  interesting  because  it  only  has  1 
fixed  point  in  Reiter’s  default  logic  as  is  computed  in  figure  3,  contradicting  the 
claim  in  [Konolige]  that  the  two  systems  were  identical. 


( honnonotonl c  flonconstruct  i ve-Proposi t 1 onal -Exanples) 

mo.icu  KM.I...M.C  k  (*  (-  b  (-  .)  (-  c>>>  <*  (<  k  >  .)  .)  (•  (<  k  »  b)  b)  <«  (<  k  >  c)  c)» 

solution,  ( ~  ( ■  k  { ^  a  c  ( ■*  b)))  ( ■  k  ( m  b  (-  a)  (-  c)))) 

TIME!  1.131  «••».,  CLO  SURE!  i  IBS  *1*. 

PAOBLCM  NCO.AaltarZ.2«(  —  k  ( ~  ( *•  ( <  k  >  c)  ( **  d) )  ( ■*  (  <  k  >  d)  ( ■*  e))  ( ■•  ( <  k  >  e)  (-  f)  >>■) 

•  OLUTIOMi  (■  k  {**  (-  d)  (-  f))) 

TIMEi  .$15  it*!.,  CLOSUACSi  153  all. 

pRoittM  MCO.k«lt«r2.Si( ■  k  (-{*(<»<  >  a)  b))  (*  <<  k  >  b)  (-  •)))) 

SOLUTION,  (■  k  (-»  a))  (■  k  b ) ) ) 

TIME.  .362  CLOSUACSi  62  •!«. 

PAOSLCM  NCO.AaltarZ.«i("  k  ,  .  ,  .  .  »*  (<  u 

(~  (*  (<  k  >  «)  a)  <*  (*([k3  b)  «k  >  c))  c)  <  *  (-  <C  k  ]  (-  d  •))  (<  k  >  el)  e) 

(*  (-  (C  k  3  c  a))  (<  k  >  (-  a))  ( <  k  >  ( -  d  a)))  f)  b  (-  c  d  a))  (-  (-  a  c)  (  e)))) 

solutiomi  (■  k  (~  a  b  c  (■*  e))) 

TIMEi  27.570  S««S.,  CLOSUACSi  3624  all. 


PROIKM  NCO.AallarZ.9i  \  /_ 

(•  k  <-  <-  <-  ([  k  )  a)  (<  k  >  (3  (x)  (pr  x))))  (3  (x)  (pr  x) ) )  (-  (<  k  >  a)  a)  <-  (<  k  >  ( -  a))  (  a)))) 

solutioni  (•  (■  k  ( -  a))  { ■  k  ( •*  a  (3  (x)  ( pr  «))))) 

TIME,  2.742  ft.,  CLOSUACSi  440  ait. 

PAOBLCM  NCO.AaltarZ.il  ( ■  k  (■•  (<  k  >  a)  (  **  a)  >  ) 

solutiomi  nil 

TIMEi  -095  iaai.,  CLOSUACSi  17  all. 

PflOSltM  NCO.Aaltar  4.1:  (  »  k  .  /  .  h,»  hj 

<-  <-  (~  (C  k  ]  (-  e  r ) )  (<  k  >  (-  0  f))>  (~  a  T) )  U  k  ]  a)  (<  k  >  b),  b) 

(-  (-  (C  k  ]  (-  a  e) )  (<  k  >  c))  c)  (-  (<  k  >  (-  e))  ('  e))  (-  c  d)  <-  t-  a  b)  e)  (  e  d, 

<♦  d  f))> 

SOLUTION,  (-  k  (-  a  d  f  (-  b)  <-  e)))  (■  k  (-  a  b  c  d  e  f))) 

TIME,  20.515  taai.,  CLOSUACSi  2455  all. 

PAOBLCM  NC0.Aalt«r3.1i(  ■  k  t  ,  ...  ,  .  .  .v  , 

(~  (*  (<  k  >  a)  a)  (-  (-  (C  k  ]  (3  (x  y)  (bp  x  y)))  (<  k  >  c))  c) 

(•  (-  (C  k  J  (-  d  a))  (<  k  >  e))  e)  ,  ,  .  ...... 

(-  (-  (t  k  )  {•  c  e))  (<  k  >  (3  (x)  (V  (y)  (~  (Tp  x  y)  (9  x)))>>> 

(3  (x)  (V  (y)  <-  ito  x  y)  {9  x))))) 

(3  ( f 2)  (W  (M)  (bp  x  ( f 2  x))))  (-  (-  c)  d  •)  <-  c)  C-  «)  -)/>) 

sol  u  tioni  <~  (■  k  <~  a  c  (-  e)  (3  (#*2)  (V  (x)  (bp  x  ( f  2  x)))))) 

(■  k  (-  a  e  C-  c)  (3  (f2)  (W  (x)  (bp  x  ( f  2  x))))))  #  #  .  rr> 

(■  k  (-  c  d  e  (-  a)  (3  (x)  <~  (9  x)  (V  (y)  (fp  x  y))))  (3  CT2)  (W  (x)  (bp  x  ( f  2  x))))))) 

TIME.  58.030  a  a  a  1 . ,  CLOSUACSi  0736  all. 


PAOiLCM  Malta  aiaaku  (■  k  - 

(  -  (lablock  bl )  ( *•  (  <  k  >  (-  (lablock  bl )  ) )  ( -  (tsblock  bl  )  )  ) 

(-  (<  k  >  (-  (iablock  b2)))  ( -  (isblock  b2))) 

( -  ( <  k  >  ('  (isblock  b3) ) )  (-  (iablock  b3))))) 
s0luT10M;  (■  k  (-  (isblock  bl)  <-  (<3block  b2))  ( -  (isblock  b3)))> 

TlMCi  .673  mi.,  CLOSUACSi  175  all. 

PAOBLCM  Calfaaa-Priy"i«»l«»fc  ai  (■  k  (•(*(<  k  >  (  -  r))  3)  (  "•  (  (  k  ]  r)  r))) 

SOLUTION)  (  ■*'  (■  k  r)  (■  k  3)) 

TIMCI  .904  •«•».,  CLOSUACSi  78  all. 
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Figure  1 


Figure  2  gives  examples  which  are  not  representable  in  Autoepistemic  Logic  or 
NonConstructive  Default  logic,  but  which  can  be  viewed  as  extensions  to  these  two 
systems  to  the  case  where  variables  are  allowed  to  cross  modal  scopes. 
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( horwnonoton  1  c  hone on s true 1 t wc-Ou^nt  1 T  \ co 1 1  ona  1  -Exanp 1  es ) 


PROBLEM  «v«r  Oafaalta  .(■  k  ( -  (3  (x)  (q  *))  (-  (p  bl ) )  (V  (x)  (♦  (<  k  >  (p  *))  (p  x))))) 

SOLUTION!  (■  k  (~  (-  ( o  bl))  (V  (y>  (-  ( [■)  v  bl )  (p  y)))  (3  <*)  (q  x)))) 

TIMS.  .978  «««*.,  ctosukC*i.^92  «... 

PROBLEM  MiCtflky'l  IlMku 

<•  k  (-  (fablock  bl)  (isblock  b2)  (isblock  b3)  (V  (x)  (*  (<  k  >  (-  (Itblock  x)))  (lablock  x)))))) 

SOLUTION! 

(■  k  (~  (Ublock  bl)  (  lablock  b2)  ((sblock  b3)  (V  (y)  ( ~  (  [=)  y  bl )  ( £*}  y  b2)  ( (*)  y  b3)  (-<  (lablock  y)))))) 

TIME.  2.128  ccosunct.  441  «l|. 

PROtLCM  TW.PIUII  (■  k  (~  (bird  twenty)  (V  <x)  <*  (-  (bird  r.)  (<  k  >  (fly  *)))  (fly  *))))) 

SOLUTION.  <•  k  (~  (bird  tueety)  (fly  tueety)  (V  (y)  (~  ( [«)  y  tueety)  (^  (bird  y))  (fly  y))))) 

time.  2.236  iitiii  closures.  556  «it. 

PROtLCM  CW.^riltu  (■  k 

(bird  tueety)  (bird  chilly)  (^  (fly  chilly))  (8  (x)  (*  (bird  x)  <<  k  >  (fly  x)))  (riy  x))))) 

SOLUTION.  (■  k 

('*  (bird  chilly)  (bird  tueety)  (fly  tuecty)  (-*  (fly  chilly)) 

(8  (y)  (~  (  [=)  y  chilly)  (  («]  y  tuecty)  (-  (bird  y))  (fly  y))))) 

time.  6.885  closures.  1589  «is. 

proslem  0|iJ«mU««  SttaUritAilM.  (»  k  ( *  ( *  (fly  tueety)  (fly  chilly))  (V  (x)  (■•  (<  k  >  (-  (fly  x)))  (-«  (fly  x)))))) 

solution.  ( -  (•  k  ( *  (fly  chilly)  (*•  (fly  tueety))  (V  (y)  (»  (  (»]  y  chilly)  ( («)  y  tueety)  (-  (fly  y)))))) 

(■  k  (-■  (fly  tueety)  (-  (fly  chilly))  (V  (y)  (-  ( (*)  y  chilly)  ( (*)  y  tueety)  (-  (fly  y))))))) 

Time.  7.896  *«•«.,  closures.  898 

PROtLCM  C*afli«  May  OifAtlll  (M.Oira.it'i  PSD.  Exampla).  (»  k 

(*  (prof  fr)  (nd  fr) 

(W  (x)  (*•  (-  (prof  x)  ( <  k  >  (phd  x)))  (phd  x))) 

(V  (x)  (-  (-  ( nd  x)  (<  k  >  (-  (phd  x))))  (-  (phd  x)))))) 

SOLUTION.  ( - 

(■  k 

(-  (nd  fr)  (phd  fr)  (prof  fr)  (V  (y)  (~  (  (  =  ]  y  fr)  (-  (prof  y)>  (phd  y))) 

(V  (y)  (-  (  (=)  y  fr)  (-  (nd  y) )  (-  (phd  y)>)))) 

(■  k 

(-  (nd  fr)  (prof  fr)  ( -  (phd  fr))  (W  (y)  (~  ( (=)  y  fr)  ( -  (prof  y))  (phd  y) ) ) 

(V#  (y)  (-  ([  =  )  y  fr)  (-  (nd  y))  <-  (phd  y))))))) 
t.mc.  12.400  «*«•.,  closure s.  3486  *i«. 

PROtLCM  O.irlikti  l.l  Aka«rn«li(>  k 

(-  (V  (x)  ( •»  ( *  (bird  k)  (^  (ab  x)))  (fly  x)))  (V  (x)  ( ••  (ostrich  x)  ( ab  x))) 

(W  (x)  (-  (<  k  >  (-  (ab  x)))  (*.  (ab  x)))>>) 

SOLUTION.  (■  k 

(-  (V  <x)  (-  (ab  x))>  (V  (x)  (  -  ( -»  (ostrich  x))  ( ab  x)))  (V  (x)  ( •  (^  (bird  x))  ( ab  x)  (fly  x))))) 

TIME.  .497  mi.,  CLOSURES.  87  all. 

PROtLCM  Gaaaia.atfc  u.  Nllnaa  1.  (■  k 

(~  (bird  tueety)  (-  (ostrich  son))  (V  (x)  (-*  (ostrich  x)  (bird  x))) 

(W  ( x )  ( (  <  k  >  (-  (bird  x)))  (-  (bird  x)))))> 

SOLUTION!  (■  k 

(“  (bird  tueety)  ( -  (ostrich  san) )  (U  (y)  (-  (  (  =  ]  y  tueety)  (^  (bird  y)))) 

(8  (y)  (•  (  (  =  )  y  tueety)  ( -  (ostrich  y))  (bird  y))))) 

TIME.  1.314  CLOSURES.  363  all. 


PROSLEM  Ciaiif.ilk  Nilnaa  2.  (■  k 

( *  (V  (x)  ( ••  (knight  x)  (person  x)))  (W  (x)  ( **  (knave  x)  (person  x)  )  ) 

(V  (x)  («  (knave  x)  (lien  x)))  (3  (x)  ( **  ( -  (liar  x))  ( -  (knave  x))))  (  1  i  ar  nork) 
(knave  bork)  (W  (x)  (■•  (<  k  >  (-  (liar  x)))  (-  (liar  x)))))) 

SOLUTION.  (  0  k 

(-  (knave  bork)  (liar  bork)  (liar  nork)  (person  bork)  (W  (x)  ( »  ( •*  (knight  x))  (person  x))) 

(W  (y)  ( -  (  (  =  )  y  bork)  ( -«  (knave  y) )  (person  y))) 

( 8  (y)  ( "  (  (  =  ]  y  bork)  ( f=]  y  nork )  (liar  y)))) 

(W  (y)  ( ~  (  (  =  ]  y  bork)  (  [  =  ]  y  nork)  (  -»  (knave  y))  (liar  y))) 

(3  (x)  (  -  (-  (knave  x))  (-  (liar  x)))))) 

TIME.  6.94S  CLOSURES.  1168  all. 
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Figure  2 

4.2  Constructive  Default  Logic 

The  fixed  point  equation  for  [Reiter]’s  default  logic,  which  we  call  Constructive  Default 
Logic  to  distinguish  it  from  other  default  logics,  is  defined  as: 
k =( n{p:(  (classical-theorems-of  p)cp)/\(gcp) 

Af  a i  =l,n((((ai)£p)sjAj=l,mi(-{(-ibij)di))))~Mciep)))}) 

For  a  finite  number  of  defaults,  the  meaning  of  the  intersection  of  the  fixed  points 
entails  the  meanings  of  the  same  sentences  as  does: 
(3K(KA(Ka(3P(PA(tP]G)A(Ai=lln((([P)Ai)A(Aj=llmi(<K>Bij)))-*((P]Ci)))))))) 
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where  G  is  the  meaning  of  the  sentences  in  g ,  and  Ai.Bij.Ci  are  respectively  the 
meaning  of  the  sentences  ai,bij,ci  in  the  defaults. 

Figure  3  gives  examples  in  the  Z  representation  of  [Reiter]’s  Default  Logic.  It  will  be 
noted  that  example  CD.Reiter2.4  has  only  1  fixed  point,  contrary  to  the  claim  in  [Reiter] 
that  it  has  3  fixed  points  in  his  default  logic. 


□  ( nonmonotonic  R  .Rel  ter-Conscruct  lue-Defaults) 

PNOSLCM  CD.N<lt«rt.1l(a  k 

<3  (J) 

<-  J  <C  J  3  (-  b  (-  (-  a )  ('  c))3)  (■>  (<  k  >  a)  ([  J  I  «)>  (•  (<  k  >  b)  (C  J  }  b) ) 

(♦  <<  k  >  e)  <[  J  ]  e))))) 

•oioriok,  («  (■  k  («  a  c  ('  b>))  (•  k  (-  b  ,)  c)))) 

TIMIt  4.762  •  ••«.,  CLOSURE  Si  >01  «|t. 

PROBLEM  CO.Rcl  t«r2.2i  (  ■  k 

(3  (J)  (-  J  <-  (<  k  >  e)  (C  J  3  <-  d>)>  (-  (<  k  >  d>  (t  J  I  (-  a) ) )  l«  (<  k  >  a)  ( C  J  ]  (-  r>>)>>) 

SOLUTION)  (•  k  <-  (m  d)  ('  f))) 

TIMfi  2.784  >••>.«  CLOSURES*  429  •!«. 

pnoslcm  co.N.lt. , 2.2, (a  k  (3  (J)  <-  J  (-  (<  k  >  a)  U  J  3  b>))  (-  (<  k  >  b)  ([  J  3  ('  a)))))) 

SOLUTION)  (.  (.  k  ('  a))  (■  k  (-  b))) 

TIMC.  1.385  mi.,  closuncsi  153  <li. 

rsosLCM  CO. k, II. )!...(•  k 

(3  (j) 

(-  J  (*  ( <  k  >  a)  ( t  J  3  a))  <-  (-  <t  J  3  b>  (<  k  >  c))  (t  J  3  c ) ) 

(*  (-  <C  J  3  (-  d  a)>  (<  k  >  *>)  (t  J  3  e)) 

(-  (-  ([  J  3  (*  c  a))  ( <  k  >  < -  a))  (<  k  >  (-  d  a) ) >  (C  J  3  f>) 

(C  J  3  (-  b  <-  e  {-  d  a))  (.  (-  a  c)  (k  e))))))) 
solution)  (■k(>abc(--e))) 

Tint)  53 . 955  )««).,  CLOSUNCS)  9806  <11. 

pnoslcm  C0.Ntlt.i2.St  (  ■  k 

(3  (J) 

J  <*  (-  ([  J  3  a)  (<  k  >  (3  (x)  (pr  k)>>)  ([  J  3  (3  (k)  (pr  N)))>  (-  (<  k  >  a)  (t  j  3  a)) 

(■•  (<  k  >  (-  a))  (t  J  3  ('  a)))))) 

SOLUTION.  (-  (a  k  (-  a))  (•  k  (-  a  (3  (»)  (pr  «))))) 

TIMt.  3.090  CLOSUNCS)  186  .If. 

PNOSLCM  CO.N.I).,ll,(  a  k  (3  <J)  ( *  j  ( -  (  <  k  >  a)  (  (  J  3  <  -  a)))))) 

SOLUTION)  nil 

Tint)  .372  tatl.,  CLOSUNCSI  11  til. 

PNOSLCM  C0.Nalt.r4.1i  (  1  k 

(3  (J) 

(-  J  (-  (*  <C  j  3  (-  a  O)  (<  k  >  (-  a  0)1  (  [  j  3  ( ~  a  fill 

(-  (-  (C  J  3  «)  (<  k  )  b)>  (I  J  3  b> )  (*  (•llj)(-i  e) )  (<  k  >  c)>  (t  J  3  c) ) 

(.  (<  k  >  (-  =) )  <[  J  3  (-  «)))  (  [  J  3  ( -  ( •  c  d)  (*  <-  a  b)  c )  (-  c  d)  (.  d 

SOLUTION)  ( *  ( ■  k  ( -  a  d  f  < b)  (-  e)))  (■  k  (~  a  b  c  d  e  f)3) 

TIMC)  16.639  CLOSUNCS)  7311  ill 

PNOSLCM  C0,N.II.)S.<)(  ■  k 

(3  (J) 

(~  J  (■•  (  <  k  >  a)  (  [  J  3  a))  (••  (-  ((  J  3  (3  (it  y)  (bp  k  y)))  (t  k  >  c ) )  ((  J  3  cl) 

(-  (-  ((  J  3  (-  d  a))  (<  k  >  c )  3  ([  J  3  e>) 

<-  (-  (C  J  3  <-  c  cl)  (<  k  >  (3  (k)  (V  (y)  (-  (fp  k  y)  <9  «)>)))) 

((  J  3  (3  (k)  (V  (y)  (-  (fp  k  y)  (9  «>)>>)) 

(  [  J  3  ( -  (3  (f2)  (V  (k)  (bp  k  ( f  2  kill)  (-  (-  c)  d  a)  (-  (-  c)  (-  e)  (-  a))))))) 
solution)  (-  (•  k  <-  a  t  (-  c)  (3  ( f  2)  (V  («)  (bp  k  ( f2  kill))) 

(■  k  (-  a  c  ('  c)  (3  ( f 2)  (V  (k)  (bp  k  ( f 2  k)))))l 

(■  k  (-  c  d  a  (-  a)  (3  (k)  (-  (9  k)  (V  (y)  ( fp  x  y))))  (3  (r2)  (9  (k)  (bp  k  (f2  kill)))) 

Tim,  88.866  ).•■.,  closuncsi  11516  .u. 

PNOSLCM  Tl.lt.  O.h.I.  .f  ■l..kn  (■  k 

(3  (j) 

(-  J  <(  J  3  (  Isblock  bl))  (■•  (<  k  >  (k  (Isblock  bill)  (C  J  3  (-  (  lablock  bl )  1 1 1 

(.  (<  k  >  ('  (Isblock  b2)))  ([  J  3  ('  (Isblock  b2)))> 

(-  (<  k  >  (k  (Isblock  b3) ) )  ([  J  3  (’  (Isblock  b3))))))l 

solution,  (■  k  ("  (Isblock  bl )  (-  (Isblock  b21)  (k  (Isblock  b3)))) 

TIMC)  2.051  t  • .  ■ .  j  CLOSUNCS,  315  III. 

PNOBlCM  S.IT..S-P. tyM.il.ik a  COi  (■  k  (3  (J)  ( k  J  (.  (<  k  >  (*  r))  ((  J  3  s))  (■•  ((  J  3  r)  ((  J  3  r))))) 
solution,  (•  k  s) 

TIMC,  .999  CLOSUNCS,  85  .It. 
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Figure  4  gives  examples  which  are  not  representable  in  [Reiter]’s  default  logic  but 
which  can  be  viewed  as  an  extension  of  this  system  to  the  case  where  variables  are 
allowed  to  cross  modaf  scopes.  Such  equations  are  of  the  form: 

(3K(KA(Ks(3P(PA([P]G)A(Ai*l,n<Vxl...xmi(((tP]Ai)A(Aj=l,mi(<K>By)))->([P]Ci))))))))) 

The  "Quantifying  over  defaults"  example  is  interesting  because  it  contradicts  the 
suggested  solution  in  [Reiter]  for  an  extension  of  his  logic  to  allowing  quantified 
variables  across  modal  scopes. 


(Nonnonotonlc  Con* true 1 1  v. -Quant (flection.) 'Default.) 


(■  k  (3  (J)  J  <t  J  ]  (-  (3  <x>  (a  »))  <-  (0  bl))>)  (V  (x)  <•  (<  k  >  (p  *>)  (C  J  )  (p  x))>)))> 
solutio.i  (»  k  (»  (-  (p  bl))  (8  (y)  (-  (C-i  y  bl)  (p  y))>  (3  (x)  (a  x))>) 

timci  1.699  •••>.,  cioimiii  225  «u. 


(3  (J) 

(~  J  (t  J  ]  <*  ( Isbleck  bl)  deblock  b2)  (l.bloek  b3))) 

(V  (x)  (-  (<  k  >  <-  ( 1 abloek  x)))  (t  J  3  ('  ((.block  x>>)>>)>) 

SOtUTtO.i 

(e  k  (»  ( 1  .block  bl)  ((.block  b2)  ((.block  b3)  (0  (y)  (-  ( y  bl)  ( [»)  y  b2)  (W  y  b3)  ('  ((.block  y)))>)) 
nuci  1.989  <••>..  cto.u.c.i  991  an. 

fX4IUM  DliJ«*»llv«  0«ti  tlfl*  4  «U»i 

(«  k  (3  (J)  («  J  ([  J  ]  (•  (fly  tuccty)  (fly  chilly)))  (W  (x)  ( -  ( *  k  »  (~  (fly  x)))  ([  J  )  (-.  (fly  x))>)>)>) 
SOlUTIO.i  (»  (•  k  (x  (fly  chilly)  (-  (fly  tuccty))  (V  (y)  (•  (  t*l  y  chilly)  ( [«)  y  tuecty)  (-  (fly  y))>)>) 

(.  k  («  (fly  tuccty)  ('  (fly  chilly))  (V  (y)  {-  ((*)  y  chilly)  ( [«)  y  tuccty)  (-  (fly  y>)))))) 

TIM f.  5.997  ■<•!.,  CLOSURtti  987  <11. 

P.O.I  CM  Ottfltlll  414  1lMfXlll(>  W 

(3  (j) 

C-  J 

(i  J  1  (-  (V  (x)  (x  (-  (bird  x)  (-  (*b  x)))  (fly  x)))  (V  (x)  (*  (ostrich  x)  ( sb  x))))) 

(V  (.)  (.  (<  k  >  (x  (,b  «)))  l(  j  ]  (-  (.b  x)))))))) 

.OlUTIOCi  (•  k 

(-  (W  (x)  («  (cb  x)))  (V  (x)  (*  (-  (ostrich  x))  (.b  x) ) )  (V  (x)  (-  (-  (bird  x>)  (.b  x)  (fly  x>))>) 
TlMCi  .755  Mil.,  Cto.u.c.i  116  all. 

r.o.UM  Sniimik  til  xllii..  u  (c  k 

(3  (J) 

(-  j  (U)  (-  (bird  tuccty)  (x  (ostrich  sen)))) 

<(  J  )  (W  (x)  (•  (ostrich  x)  (bird  x)))) 

( V  (x)  <*  (<  k  >  (-  (bird  x)))  (C  J  )  ('  (bird  x>)))>))) 

SOtUTIOXi  (•  k 

(x  (bird  tuccty)  (-  (ostrich  sen))  (V  (y)  ('  ((*}  y  tuccty)  (x  (bird  y)))) 

(V  (y)  ( -  ( (*)  y  tuccty)  (x  (ostrich  y))  (bird  y))))) 
timci  2.350  cto.u.c.i  909  •!■. 


r.o.UM  Gaaalar 4  It  ac<  Mllltaa  !•  (•  k 

(3  (J) 

(-  J 

((  J  ) 

(-  (V  (x)  (x  (knight  x)  (person  x))>  (0  (x)  (x  (kneve  x)  (person  x))) 

(V  (x)  (a  ( knave  x)  (K.r  x)))  (3  (x)  (-  ('  Olcr  x))  ('  (kn.ve  x>))>  (Her  nork) 
( kneve  bork ) ) ) 

(V  (x)  (*  (<  k  >  (-  (K.r  «)))  ((  J  )  (-  (H»r  x)))))))> 

(OlUTIOCi  (*  k 

(•  (kneve  bork)  (Her  bork)  (liar  nork)  (person  bork)  (V  (x)  (-  (x  (knight  x))  (person  x))) 

(V  (y)  ( X  ([»)  y  bork)  (x  (knsue  y))  (person  y)>) 

(W  (y)  (x  ( («)  y  bork)  (  (a)  y  nork)  (x  (Her  y)))) 

(V  (y)  (x  ((a)  y  bork)  ( C*3  y  nork)  <■  (kneve  y>)  (Her  y))) 

(3  (x)  (-  (-  (kneve  x))  (-  (Her  x)))>)) 

TlMCi  7.099  laal.,  cto.u.c.i  1309  ah. 
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Figure  4 


4.3  Parallel  Circumscription 

The  Parallel  Circumscription  [McCarthy80]  of  a  theory  G  of  first  order  logic  without 
equality  by  n  with  3  variable  is  defined  as  follows: 


570 


(Circ(G  n  d)n  3)  =df  ((G  n  3)a(Vp,z((G  p  z)A<Vi,x((pi  x)-Kiri  x))))-KVi,x((7ri  x)->(pi  x))))) 
where  n  and  3  axe  finite  sequences  of  predicates,  p  and  z  are  sequences  of  predicate 
variables  of  the  same  length  as  n  and  3  respectively,  the  arity  of  corresponding 
predicates  and  predicate  variables  is  the  same,  x  following  a  predicate  or  predicate 
variable  represents  a  sequence  of  variables  of  that  arity,  and  (G  p  z)  is  the 
replacement  of  all  unmodalized  occurrences  of  n  and  3  in  (G  n  3).  Parallel 
Circumscription,  with  both  variable  and  fixed  predicates,  is  the  disjunction  of  the 
solutions  of  the  fixed  point  equation  containing  the  initial  axioms  G  and  defaults 
specifying  that  every  proposition  formed  from  a  circumscribed  predicate  is  false  by 
default  and  every  proposition  farmed  from  a  fixed  predicate  is  both  true  and  false  by 
default  [Brown89a], 

(CIRCXG  n  3 )n  3M3K(Ka(K=((G  n  3)A(Vi(Vx((<K>(-,(m  x)))-K-*(rci  x))))> 

A(Vi(Vx((<K>M'n3i  x)))-»M'rc3i  x))))) 

A(Vi(Vx((<K>('rt3i  x))-K'rc3i  x))))  )))) 

where  ~nd  is  the  sequence  of  all  predicates  not  in  it  or  3.  Finite  parallel 
circumscription  is  circumscription  where  (Vx(x[=]6l)v...v(x[=]8n))  follows  from  (G  n  3). 


Figure  5  gives  examples  of  Circumscription  in  the  Z  representation. 


(flonnonotonic  Circunscr iption) 

problem  a««mifyi*9  «v«r  o«r««itt  i(3  (k)  (~  k  (■  k  (~  (3  (x)  (q  k>)  (•*  (p  bl))  (W  (x)  (■•  (<  k  >  (p  x))  (p  «))))))) 
solutiomi  (~  (-  (p  bl))  (V  (y)  (~  ( (=)  y  bl)  (o  y)))  (3  (x)  (a  x))) 

TIMf i  1.206  iaa«.,  CLOSURES.  266  cl*. 

PROBLEM  MaCartfcy'l  Blatkti  ...  .  ,  uu,s,. 

(3  (k)  (-  k  (m  k  (-  (  1  sb  lock  bl )  (isblock  b2)  (isblock  b3)  (V  (x)  ( •*  ( <  k  >  (-  (Isblock  x)))  (-  (isblock  *))))).)) 

SOLUTION!  (~  (isblock  bl)  (isblock  b2)  (isblock  b3) 

<V  (y)  (~  (C=]  y  bl)  (CO  y  b2)  ( CO  y  b3)  (-  (isblock  y))))) 

TlMCi  2.023  •  CLOSURES!  507  alt. 

PROBLEM  Dlijxaa  ll  OvaatiriMlUii  ... 

(3  (k)  (~  k  (■  k  (~  (-  (fly  tueety)  (fly  chilly))  (Y  (x)  (-  (<  k  >  (-  (fly  *>))(-*  ir\y  «))  )  )) 

SOLUTION:  (~  <~  (fly  tueety)  (-  (fly  chilly))  <U  (y)  (-  (  C=)  y  chilly)  (  [  =  )  y  tueety  ( -*  {fly  y  >) )) 

<~  (fly  chilly)  (-  (fl.»  tueety))  (*  (y)  (~  ( C  =  3  y  chilly)  ( C=1  V  tueety)  (-  (fly  y)))))) 

TIME!  4.791  CLOSURES:  1055  all. 

PROBLEM  Ottrlakas  «ra  Ak«»ra«ll(3  (k) 

(“  k 
(  ■  |( 

(-  (V  (*)  (-  (-  (bird  ")  (-  ( eb  x))>  (fly  x) ) )  (V  (x)  l*  (ostrich  k)  ( sb  x))) 

(V  <x)  (-  (<  k  >  (-  (eb  x>>)  (-  (eb  *))))>)>> 

solutiomi  (-  (W  <x>  (-  (eb  »)))  IV  <»)  (-  (-  (ostrich  x>)  (eb  *>>)  (W  («)  (*  <-  (bird  x>)  ( eb  «)  l'l»  »>>)> 

TIMfi  .609  ft.,  CLOSURCSi  141  all. 

PROBLEM  Gaaaiarath  *«4  Niliiaa  1i  (3  (k) 

(-  k 
( ■  k 

(*  (bird  tueety)  (-  (ostrich  sen))  (V  (x)  (  *  (ostrich  x)  (bird  x))) 

(V  (x)  <-  (<  k  >  <-  (bird  x)>)  (-  (bird  x))))>))) 

SOLUTIOMI  (~  (bird  tueety)  ( **  (ostrich  sanj)  (V  (y)  (~  (  [  =  )  y  tueety)  ( -»  (bird  y)))) 

(W  (y)  ( •  (  [  =  ]  y  tueety)  (-»  (ostrich  y))  (bird  y)))) 

TIME!  1.772  1  a  •  t.(  CLOSURES!  474  all. 

PROBLEM  CiMiaraik  Nlliiai  *i  (3  (k) 

(-  k 
(  ■  k 

(-  (V  ( x )  («  (knight  x)  (person  x)))  (V  (x)  (knave  x)  (person  x)); 

(V  (x)  (*  (knave  x)  (liar  x)))  (3  (x)  (-  ( -  (Kar  x))  (-  knave  x))))  (U*r  nork) 
(knave  bork)  (V  (x)  ( ■*  ( <  k  >  (-  (liar  x)>)  (-  (liar  «)>)))>)) 

SOLUTION.  ( -  (knave  bork)  (liar  bork)  (liar  nork)  (person  bork)  (V  (x)  ( ~  ( **  (knight  x)  )  (person  x)  f  ) 

(V  (y)  ( *■'  ( [=0  y  bork)  (-  (knave  y) )  (person  y))) 

(W  (y)  (w  ( C"1  y  bork)  (  (=)  y  nork)  (-  (liar  y) ) ) )  ... 

(W  (y)  (-  ((=J  y  bork)  (C=J  y  nork!  ('  (kneve  y))  (Her  y))>  13  (x)  (-  <~  (kneuc  x)>  ('  (Her  *)>)> 

) 

TIMKl  8.523  !«(■.,  CLOSUREBl  1599  •»«. 
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Figure  6  gives  examples  which  are  not  representable  in  Circumscription  but  which  can  be 
viewed  as  an  extensions  to  this  system  to  the  case  where  more  complex  defaults  are  used, 
such  as  in  Autoepistemic  Logic  or  in  [Eeiter]s  default  logic. 


( Nonnonotonl c  Ex tens  (ons-to-Ctrcunscr 1 pt <  on) 

MOIUU  TW.riUit  (3  (k)  (~  k  (»  k  (-  (bird  tweety)  (V  (x)  (*•  (bird  x>  { <  k  >  (fly  *)))  (fly  x))))))) 

solution,  (*  (bird  tueety)  (fly  tueety)  (V  (y)  (v  (  C2)  y  tueety)  (-*  (bird  y))  (fly  y)))) 

time,  2.835  mi.,  CLOtURtli  634  *(i. 

Moslem  cw.-nust 

(3  (k)  (-  k  (»  k  (•*  (bird  tueety)  (bird  chilly)  (-*  (fly  chilly))  (V  (x)  (■*  (-  (bird  x)  (<  k  >  (fly  x)))  (fly  «))))))) 

solution!  (•*  (bird  chilly)  (bird  tueety)  (fly  tueety)  (-  (fly  chilly)) 

(V  (y)  (*  (  C=0  y  chilly)  (  [*]  y  tueety)  (-  (bird  y))  (fly  y)))) 
timIi  6.349  (•«(.,  closures,  1787  •!«. 

MOSLEM  C«aflUlUf  (M«0«ri»»tt*i  PhD.  Cx««fU)i  (3  (k) 

(-  k 
(■  K 

(~  (prof  fr)  ( nd  fr) 

(W  (x)  (*  (-  (prof  x)  (<  k  >  (phd  x)))  (phd  x))) 

(V  (x)  (-  (-  (nd  x)  (<  k  >  (-  (phd  x))))  (-  (phd  x)))))))) 

SOLUTION!  (- 

(~  (nd  fr)  (prof  fr)  (-  (phd  fr))  (W  (y)  (-  ( [s]  y  fr)  (-  (prof  y))  (phd  y))) 

(W  (y)  (~  ( («3  y  fr)  (-  (nd  y) )  (-  (phd  y))))) 

(*  (nd  fr)  (phd  fr)  (prof  fr)  (V  (y)  (~  (  03  y  fr)  (-  (prof  y))  (phd  y))) 

(V  (y)  (v  ((*]  y  fP)  (-  (nd  y) )  (-  (phd  y)))))) 

TIME!  13. 009  mt.,  closures,  3865  •■». 
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Figure  6 

4.4  The  Closed  World  Assumption 

The  closed  world  assumption  on  a  theory  g  is  the  union  of  g  and  the  set  of  negations  of 
a  set  of  simple  sentences  formed  from  predicates  m  and  sequences  of  variable  free 
terms  5  of  the  appropriate  arity  which  are  not  deducible  from  g: 

(gu{(list  ’-/list  m  8JJ:  (-/(list  ni  b)z(cl-th  g)))}) 

The  closed  world  assumption  on  a  set  of  sentences  g  with  respect  to  n  is  the  meaning 
of  g  anded  to  the  defaults  which  state  that  if  the  negation  of  a  sentence  formed  from 
n  and  a  sequence  of  variable  free  terms  8=8l...5ni  is  possible  then  that  negation  is 
the  case:  (GA(Vi(V8((<G>(-.(ni  S)))-K->(ni  8))))))  where  G  is  the  meaning  of  g.  The 
Generalized  Closed  World  Assumption  is  represented  by  the  expression: 

(CWA*  G  n)  =df  GA(Vi(Vx((<G>(-.(7U  x))) — >( — .(rti  x))))) 
where  x=xl...xni  is  a  sequence  of  variables  whose  length  is  the  arity  of  ni. 

Figure  7  gives  examples  in  the  2  representation  of  the  closed  world  assumption. 


( Nonnonotoni c  C 1 osed-Uor Id-flssunpt Ion) 

fAoDiiM  cwa  it  (a  a  p  (<  a  b)  >  <-  «))  <-  *))  <-  (<  (-3b)  >  (-  b))  {-  b>)  {-  (<  (-  a  b)  >  (-  c))  (-  c ) ) 
(*•  (<  (-  a  b)  >  (-  d))  ( ^  d))) 
lOiuTtOMi  (a  a  b  (-  c)  (-  d)) 

time.  .262  closures.  16  •««. 

Moslem  c  wai  FUIt«  0«m«u  lt««kii(A  (Isblock  bl )  ( (<  (Isblock  bl )  >  (-  (Isblock  bl ) ) )  (-  (Isblock  bl))) 

(••  (<  (Isblock  bl)  >  (-  (Isblock  b2)))  (-  visblock  b2))) 

(~  (<  (isblock  bl )  >  <-  (Isblock  b3)>)  {-  (Isblock  b3)))) 
solution,  (a  (Isblock  bl)  (-  (Isblock  b2))  <-  ( Isblock  b3))) 

TlMCi  .123  *«•«.,  CLOSURES,  21  •!». 
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Figure  8  gives  examples  which  are  not  representable  in  CWA  but  which  can  be  viewed  as 
an  extensions  to  this  system  to  the  case  where  variables  are  allowed  to  cross  modal 


scopes. 


P  (Nonnonotonlc  Cxtcnslons-to-CUA) 

••«•«•»»!•»  •  »«.  i(*  <3  (x)  (q  x))  (->  (p  bl))  (W  (y)  (•  (<  (-  (3  (x)  (q  x))  (-  (p  bl))>  >  (p  y))  (p  y )))) 

solution.  <«  <~  <p  bl)>  {8  (y>  <-  <[«J  y  bl)  (o  y>)>  <3  <x)  <q  x>>)  "  IP  vn  IP  yllll 

timci  .364  ......  detunes.  36  .... 

pnosliu  u.c. si..t..(-  (Isblock  bl)  (Isblock  b2)  {Isblock  b3) 

(W  (x)  («  (<  (-  (Isblock  bl)  (Isblock  b2)  (Isblock  b3))  >  (->  (Isblock  x)))  (-  (Isblock  x))))) 
solution.  <-  (Isblock  bl)  (Isblock  b2)  (Isblock  b3)  '  1  " 

(W  (x)  (-  <  t*3  x  bl)  ([•]  x  b2)  (C*3  x  b3)  (-  (Isblock  x))))) 

T.Mfi  .369  ......  cLosunts.  138  .... 

mosLLu  tw.ni.ii  (-  (bird  tueety)  (8  (x)  <«  <-  (bird  x)  (<  (bird  tucety)  >  (fly  x)))  (riy  x)))) 
solution.  (-  (bird  tucety)  (fly  tucety)  (8  (y)  (-  (  [*)  y  tucety)  (-  (bird  y) )  (fly  y)))) 

TIDE.  .183  >••■..  CLOSURES.  43  all. 

problem  cw...Hi»  (-  (bird  tucety)  (bird  chilly)  (-.  (fly  chilly)) 

(U  (x)  (■*  («  (bird  x)  (<  (*  (bird  tueety)  (bird  chilly)  (~  (fly  chilly)))  >  (fly  x)))  (fly  x)))) 
solution.  (-  (bird  chilly)  (bird  tueety)  (fly  tueety)  ('  (fly  chilly)) 

(V  (y)  (-  ([*)  y  chilly)  ( M  y  tueety)  (~  (bird  y))  (fly  y)))) 

TIMT.  .64?  I.ai.,  CLOSUNCS.  131  alt. 

PROBLEM  Ol.jaaa.lya  Oaail.TIa . tla.i  (»  (*  (fly  tueety)  (fly  chilly)) 

(8  (x)  (.  (<  (-  (fly  tueety)  (fly  chilly))  >  (-  (fly  x)))  (x  (fly  x))))) 

SOLUTION.  nil 

TIME.  .631  >aai.,  CLOSURES.  164  al>. 

PROBLEM  C«aflU«l*f  0«fa«lii  (M«0«raqtt*s  PkD.  Cx«iapl«)i 

(-  (prof  fr)  (nd  fr)  (8  (x)  (*  (-  (prof  x)  (<  {-  (prof  fr)  (nd  Tr >>  >  (phd  x) ) )  (phd  x))) 

(8  (x)  («  (-  (nd  x)  (<  (*  (pro  f  fr)  (nd  fr))  >  ( -  (phd  x))))  ( -  (phd  x))))) 

solution.  nil 

I  71  ME.  .48?  .aa...  CLOSUNCS.  9?  all. 

problem  o. .......  ...  (8  (x)  (-  (-  (bird  x)  (-  <eb  x))>  (My  x)))  (8  (x)  (.  (ostrich  x)  (ob  x))) 

(8  (y) 

(- 

(<  (-  (8  (x)  (.  (-  (bird  x)  (-  (eb  x))>  (fly  x)))  (8  (x)  (-  (ostrich  x)  (sb  x>>>) 

> 

Ub  y))) 

<-  («b  y))))) 

s  olutiomi  (~  (V  (y)  (-  (ab  y)))  (V  (x)  (-  (-  (ostrich  x) )  ( ab  *)))  (W  (x)  (-  (-  (bird  *))  ( ab  x)  (fly  *)))) 

TlMti  1.81$  «€«>.,  ClOSVACSf  250  elf. 

rnOfiCM  StMitrtik  «•<  Mllitaa  1.  (-  (bird  tueety)  (ostrich  sen))  (V  (x)  (-  (ostrich  x)  (bird  x))> 

(V  (y) 

<- 

(<  (**  (bird  tueety)  (-  (ostrich  sen))  (0  (x)  ( *  (ostrich  x)  (bird  x)))) 

> 

(-  (bird  y))) 

<-  (bird  y) > ) > ) 

Sot UTiOMi  (-  (bird  tueety)  (-  (ostrich  sen))  (V  <h)  (~  (^  (ostrich  *))  (bird  *))) 

(W  (y)  (~  <  (*]  y  tueety)  (-  (bird  y))))) 

TIME*  1.217  ClOSUREti  309  «li. 

problem  GtvaiiMll  Niim*  2i  (-  (V  (x)  ( *  (knight  k)  (person  x)))  (V  (x)  ( *•  (knave  x)  (person  x))) 

(W  (x)  (4  (knave  x)  (liar  x)))  (3  (x)  (»  ( -  (liar  «))  ( ->  (knave  x))))  (liar  nork) 

(knave  bork) 

(V  (y) 

(- 

(<  (*  (0  (x)  ( *  (knight  x)  (person  x)))  (0  (k)  (4  (knave  x)  (person  x) ) ) 

(0  (x.  ( (knave  x)  (liar  x)))  (3  (x)  (~  ( **  (liar  x) )  (-  (knave  x))))  (Mar  nork) 
(knave  bork)) 

> 

<-  (Her  y))) 

(**  (Usr  y))))) 

fotUTiOMi  (knave  bork)  (liar  bork)  (liar  nork)  (person  bork)  (0  (x)  (-  (-  (knight  *))  (person  x))) 

(0  (y)  y  bork)  (-*  (knave  y))  (liar  y)))  (V  (y)  ( -  (  [■)  y  bork)  (*»  (knave  y))  (person  y))) 

(0  (y)  <~  ([•)  y  bork)  <C*)  y  nork)  (-  (  1  Ur  y))>>  (3  (x)  <~  (-  (knave  x))  (-  (liar  x))))> 

TlWti  $.237  •«•«.,  CLOSUfltft  1194  »i«. 
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Figure  8 

5.  Implementation 

The  derived  rules  of  inferpnrp  nf  7  ....j  r 

of  Z  used  for  nonmonotonic  reasoning  are  implemented 


as  definitions  in  the  programming  language  Schemata,  which  was  specifically  designed 
to  express  arbitrary  deductive  operations.  Listed  below  are  the  Schemata  definitions 
which  implement  derived  rules  of  inference  needed  to  manipulate  the  disjunction  symbol: v. 


(define  v 
(defaxiom! 
(def axiom 
(defaxiom! 
(defaxiom! 

(defaxiom! 

(defaxiom! 

(defaxiom! 

(defaxiom! 
(defaxiom! 
(defaxiom ! 
(defaxiom! 


_v) 

V0  (v)  sf) 

v-Sort  (v  .  r) (if(nu11?(set!  r(sort«  r  vorder))) (cut! (v  .  r))(cut!(v 

v[]pp  (v  *«*1  (_[]  p)  «**2  p  •••3)  (v  •••1  •••a  p  •••3)) 

vASubsump(v  •••■!  (_a  .  r1)«*«2( !test(_A  .  r2) (subsume?  rl  r2))«**5) 

(v  •••1 (_a  .  r1)**»2  •••5)) 

VAabsId  (v  •••!  (_a  *»*2  x  •••3)«««4  x  •••5)(v  •••I  •••4  x  •••5)) 
vabs— i  (v  •••"!(_-,  x)*»*2( !both(n  x)(!test  e(unmoda1 i2ed-in?  x  e)))*»»3) 

(v  •••1  (_-i  x)*»*2(rt  $t) •••3) ) 

vabs  (v  «**1  x  ***2( !both(ji  x)(!test  e(unmodal i2ed-in?  x  e)))»«»3) 

(v  x  •••2(n  $f ) •••3) ) 
vassoc  (v  •••!  (_v  •••2)««»3)(v  •••!  •••2  •••3)) 

vf  (v  •••1  $f  •••2)(v  •••!  •••2)) 

vt  (v  •••!  $t  •••2)  $t) 

vl  (v  x)  x) 


r)))) 


Defaxiom  defines  a  derived  rule  of  inference  which  causes  the  subexpressions  which 
match  its  second  argument  to  be  replaced  by  the  corresponding  binding  of  its  third 
argument.  Defaxiom!  is  like  defaxiom  except  that  it  cuts  alternative  applications 
from  the  search  space.  The  underscore  symbol  _  in  the  second  argument  indicates 
that  the  succeeding  symbol  is  a  constant  to  the  matcher  and  is  not  a  local  variable 
that  may  be  bound  by  matching.  The  •••n  variables  are  segment  variables  which  match 
0  or  more  expressions  and  n  is  an  schemator  variable  [Morse], 


6.  Conclusion 

Since,  in  general,  each  of  these  4  nonmonotonic  theories  compute  different  things, 
it  might  at  first  appear  that  their  computational  properties  cannot  be  compared. 
However,  this  is  not  entirely  the  case,  as  there  are  a  number  of  metatheorems  which 
show  that  various  nonmonotonic  theories  give  identical  results  when  applied  to  axiom 
sets  of  certain  forms. 


There  are  two  results  comparing  Autoepistemic  Logic  and  Nonconstructive  Default  Logic  to 
Reiter’s  Default  Logic.  The  first  result  [Brown89a]  states  that  these  logics  give 
essentially  the  same  answer  if  no  default  in  the  axiom  set  has  a  necessary  hypothesis. 

This  result  is  not  a  necessary  condition  as  examples  NCD.Reiter2.4  in  figure  1  and 
CD.Reiter2.4  in  figure  3  show.  In  fact,  all  the  analogous  examples  except  the  Gelfond- 
Przymus'nska  example  in  these  two  figures  have  the  same  result.  Since  the  deduction 
times  for  Nonconstructive  Default  Logic  are  more  than  2  times  faster  than  the  analogous 
examples  for  Reiter’s  Default  Logic,  there  is  some  evidence  in  favor  of  the 
nonconstructive  default  logic.  The  difference  in  speed  is  caused  by  the  more  complex 
nature  of  Reiter’s  fixed  point  equation  which  involves  an  extra  quantifier  3j  inside. 

The  second  result  relating  these  systems  [Konolige]  says  essentially  that  the  solutions 
of  Reiters  Default  Logic  are  a  subset  of  those  of  nonconstructive  default  logic.  Given 
the  relative  deduction  speeds  obtained,  the  possibility  arises  of  computing  the  fixed 
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points  of  Reiter’s  Default  logic,  by  solving  for  the  Z  solutions  of  Nonconstructive 
Default  logic,  and  then  plugging  each  such  solution  back  into  the  Z  equation  for  Reiter’s 
default  logic  and  eliminating  those  which  do  not  simplify  to  true. 

One  fairly  general  result  [Brown89a]  relating  Parallel  Circumscription  to  the 
quantificationally  generalized  closed  world  assumption  states  that  if  the  generalized 
closed  world  assumption  of  any  set  of  predicates  is  logically  possible  then  it  is  the 
same  as  the  circumscription  of  that  set  with  those  predicates  regardless  of  which  other 
predicates  are  fixed  or  variable.  Since  the  generalized  closed  world  assumption  of  any 
set  of  predicates  of  a  set  of  Horn  clauses  is  logically  possible  (i.e.  because  the 
intersection  of  all  the  models  of  such  theories  is  itself  a  model)  this  is  a  fairly 
useful  relationship.  The  disjunctive  quantification  example  (figures  5  and  8)  show 
that  it  cannot  be  extended  to  sets  of  non-Horn  clauses.  For  Horn  clause  theories, 
deducing  the  generalized  closed  world  assumption  always  took  less  time  than  deducing  the 
corresponding  circumscription.  Thus  this  provides  some  evidence  in  favor  of  the  former 
theory. 

Since  all  these  theories  have  been  represented  in  the  modal  quantificational  logic  Z  and 
since  it  can  represent  extensions  to  all  these  theories  (see  figures  2,  4,  6,  8),  and 
since  all  these  problems  have  been  solved  with  a  deduction  system  consisting  of  derived 
rules  of  inference  of  Z,  it  may  be  concluded  that  Z,  even  though  it  is  a  monotonic 
logic,  is  an  interesting  theory  of  nonmonotonic  reasoning. 
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A  Logic  Programing  Approach  to  Network  Flow  Algorithm. 
Andrew  W.  Harrell 

U.S.  Any  Engineer  Waterways  Experiment  station, 
Vicksburg,  Mississippi 
Abstract 

The  well-known  Ford-Fulkerson  algorithm  and  most  of  the  more 
recent  approaches  to  solving  the  network  maximal  flow  and  minimal- 
cost  flow  problems  use  a  labeling  procedure.  Labeling  involves 
using  nodes  in  the  network  which  have  values  and  are  updated  by 
adding  a  series  of  augmenting  flows  or  edges  until  the  optimal 
solution  is  reached.  In  this  paper,  an  alternative  approach  is 
examined  using  the  full  ordered  list  of  flow  paths  without  cycles. 
This  list  is  generated  by  a  Prolog-based  depth-first  search  with 
backtracking  of  the  type  described  by  Winston  [13], [14].  This 
approach  keeps  track  of  the  full  queue  of  partial  search  paths  and 
is  easier  to  use  to  examine  the  solution  for  weak-links  or  critical 
nodes.  If  a  modeler  is  creating  a  network  to  represent  a  real 
situation  it  is  reasonable  to  assume  that  the  number  of  ingoing  and 
outgoing  edges  to  a  vertex  are  limited.  Time  bounds  are  presented 
to  demonstrate  that  the  above  approach  is  under  these  conditions 
as  efficient  as  the  n-cubed  algorithms  explained  in  Tarjan  [12] 
which  use  a  method  of  labeled  preflows. 

Key  words  -  network  algorithm,  depth-first  search,  maximal 
flow,  min-cost  flow,  logic  programming,  backtracking. 


1.  Introduction. 

With  the  development  of  computer  graphics  techniques  for 
displaying  digital  map  information,  new  ways  of  representing  unit 
movement  and  aircraft  or  ship  routing  have  been  developed.  However, 
the  use  of  digital  map  data  presents  problems  as  for  example 
representing  the  effects  of  various  types  of  on-  and  off-road 
obstacles,  underwater  mines,  bridge  interdiction  on  the  movement 
rates,  and  routing  possibilities.  Programs  must  be  written  to 
define  and  store  route  movement  networks  and  arrays  of  obstacles. 
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For  this  paper,  ve  will  assume  this  already  exists  along  with 
avenues  of  approach  or  movement  corridors  and  their  corresponding 
traverse  speeds  across  the  nap.  Harrell  [7],  [8],  [9]  gives  a  partial 
description  of  some  current  techniques  of  doing  this.  The  following 
short  glossary  defines  some  of  the  basic  terms  that  will  be  used: 

Backtracking  -  An  algorithmic  search  scheme  which  in  order  to 
compute  all  the  ways  to  satisfy  a  given  goal  computes  one  solution 
through  following  a  series  of  branching  points  and  then  retraces 
its  steps  to  the  last  previous  decision  in  order  to  compute  another 
possible  solution. 

Cycle  -  A  path  with  the  same  starting  and  ending  node. 

Dead  end  -  A  node  in  a  network  from  which  no  edges  proceed. 

Edge  -  The  line  connecting  two  nodes  in  a  network.  Each  edge  in  a 
network  usually  has  associated  with  it  a  traverse  time,  vehicular 
speed,  or  a  flow-rate,  and  represents  a  given  portion  of  the 
overall  map. 

Flow-  An  assignment  of  of  flow-rates  to  some  or  all  of  the  edges 
in  a  given  network.  Each  flow-rate  has  to  be  less  than  or  equal  to 
the  flow-capacity  of  its  edge. 

Flow-capacity-  The  largest  allowable  flow-rate  for  a  particular 
edge. 

Flow-rate  -  The  number  of  vehicles  per  hour  that  can  pass  over  a 
given  edge  in  the  network.  As  explained  in  the  text  this  can  be 
calculated  as  [l/(time  it  takes  a  group  of  vehicles  to  traverse  the 
edge) ] *number  of  vehicles  in  the  group. 

Maximal  flow  problem  -  The  problem  of  determining  what  is  the 
greatest  number  of  vehicles/hour  that  can  travel  through  a  network 
at  a  given  time.  It  is  computed  by  designing  an  algorithm  to 
optimize  the  assignments  of  flows  to  edges  in  the  network. 

Maximal  flow  value  -  The  value  which  is  a  solution  to  a  Maximal 
flow  problem.  Note,  that  it  is  possible  for  there  to  be  several 
different  network  flows  which  realize  a  given  maximal  flow  value. 

Min-cost  flow  network-  A  flow  network  with  costs  (times  to 
traverse)  as  well  as  flows  associated  with  its  edges.  In  this  paper 
in  order  to  determine  the  cost  associated  with  a  flow  the  following 
procedure  is  followed:  1)  The  flow  rate  on  each  flow  path  solution 
through  the  network  is  multiplied  by  its  time  of  traversal  and  the 
result  summed  over  all  paths  in  order  to  obtain  a  total  cost 
associated  with  a  given  maximum  flow  solution.  This  is  the  measure 
of  effectiveness  which  determines  the  optimality  of  the  solution. 
The  total  cost  of  the  flow  can  then  be  divided  by  the  total  maximum 
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flow  to  obtain  an  average  cost  per  vehicle  to  travel  through  the 
network . 

Min-cost  flow  problem-  The  problem  of  determining  from  all  the 
possible  flows  which  realize  a  given  network  maximal  flow  value, 
those  which  do  it  with  minimal  cost. 

Network  -  A  collection  of  nodes  and  edges  that  represent  movement 
possibilities  over  a  given  terrain  area. 

Node  -  A  point  of  reference  in  a  network  from  which  edges  are  drawn 
from  and  into. 

Path  -  An  ordered  list  of  edges  each  of  which  has  the  same 
starting  node  as  the  preceeding  edge's  ending  node. 

An  artificial  intelligence  network  algorithm  is  methodology 
based  on  searches  for  paths  from  start  nodes  through  a  network  to 
ending  goal  nodes  using  the  methods  of  logic  programming.  The 
search  mechanism  proceeds  in  an  orderly  fashion  unifying  the 
variables  in  the  search  predicates  from  one  level  of  search  in  the 
network  to  another.  The  algorithm  used  must  save  the  partial 
solutions  in  an  environment  list  so  it  can  backtrack  its  way 
through  the  previous  variable  bindings  in  order  to  generate  all 
possible  ways  of  reaching  the  goal  state.  This  differs  from  many 
network  algorithms  that  use  labels  (instead  of  a  list  of  partial 
search  paths)  at  the  nodes  to  store  information  as  the  steps  in  the 
algorithm  proceed.  Thus,  after  the  labeling  algorithms  are  through 
generating  solutions,  information  is  not  kept  on  "how"  the 
solutions  were  reached. 

The  search  algorithm  discussed  in  Section  2  below  will  print 
out  ordered  lists  of  shortest  paths  with  and  without  the  presence 
of  obstacles.  These  lists  reveal  the  critical  nodes  or  weak  links 
that  most  affect  the  optimal  paths  in  the  network.  In  order  to  do 
this  and  compute  movement  possibilities  across  cross-corridors  an 
algorithm  has  to  keep  track  of  more  information  J-han  can  be  stored 
on  just  a  single  label  per  node  in  the  network  or  on  a  single 
search  tree.  One  needs  to  store  the  same  kind  of  list  of  partial 
solutions  that  a  logic  programming  unification  algorithm  does  when 
it  tries  to  satisfy  goal  predicates. 

Similarly,  in  developing  programs  to  compute  network  flow 
rates  which  identify  the  critical  nod^s  in  the  solution,  it  is 
important  to  compute  the  maximal  or  run-cost  flows  in  terms  of  an 
ordered  list  of  paths  from  the  start  node  (or  set  of  nodes)  to  the 
goal  node  (or  set  of  nodes) .  The  solution  can  be  displayed  just  as 
a  logic  programming  interpreter  displays  in  turn  the  list  of 
predicate  variable  identifications  which  satisfy  the  specified 
goal.  The  question  then  becomes  whether  this  approach  is  feasible 
in  terms  of  search  time  bounds  and  how  it  is  implemented. 

These  questions  are  answered  in  this  paper  which  contains 
five  sections.  In  Section  2  the  main  search  algorithm  used  to 
compute  shortest  paths  or  maximal  flow  paths  and  give  the 
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derivation  of  the  number  of  search  steps  required  to  generate  all 
these  solutions  is  presented.  In  Section  3  an  explanation  of  hov 
this  algorithm  can  be  used  to  solve  the  maximum  flow  problems 
associated  with  certain  types  of  networks  is  discussed.  In  section 
4  simple  modifications  to  this  algorithm  are  presented  that  can  be 
used  to  solve  the  corresponding  min-cost  flow  problem.  Section  5 
contains  a  short  discussion  of  the  appropriateness  of  these 
algorithms  for  the  transportation  problem  and  the  assignment 
problem,  and  the  next  and  final  section  contains  the  conclusions. 
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The  Main  Search  Algorithm 

As  mentioned  above,  outputting  the  full  search  path  allows  the 
user  to  determine  the  effect  of  weak  links  or  choke  points  on  the 
solution.  For  example,  in  on-  or  off-road  movement  networks  based 
upons  digitized  maps,  it  is  important  to  know  the  effects  of 
minefields,  anti-tank  ditches,  abatis,  and  road  craters  on  the 
overall  possible  vehicular  flow  rate  vehicles  traverse  across  the 
terrain.  Network  path-generating  algorithms  based  upon  dynamic 
programming,  dynamic  tree  structures,  or  node  labeling  do  not  save 
the  information  on  the  movement  possibilities  through  cross¬ 
corridors  in  the  terrain.  This  increases  computational  speed  in 
many  cases,  but  important  information  about  the  vulnerability  or 
sensitivity  of  the  solution  to  degrading  factors  is  lost.  The  best 
algorithm  for  these  purposes  is  one  that  provides  a  way  to  measure 
the  effect  of  changing  flow  rates  and  times  in  certain  parts  of  the 
network  on  the  overall  solution. 

An  example  of  such  an  algorithm  is  given  below.  The  search 
procedure  presented  keeps  track  of  the  next  best  choices  in  a 
sorted  priority  queue.  This  is  necessary  so  the  algorithm  can 
backtrack  quickly  to  find  another  solution  after  it  has 
determined  the  shortest  path  or  failed  to  reach  a  goal  in  a 
given  direction.  In  order  to  do  this,  it  was  convenient  to  write 
the  program  in  Prolog.  An  algorithm  that  does  this  is 
described  in  the  book  by  Winston  [13]  .  The  description  of  the 
algorithm  is  as  follows  : 


Step  1  Form  a  queue  of  partial  paths.  Let  the  initial 
queue  consist  of  the  zero-length,  zero-step  path  from  the 
start  node  to  nowhere. 
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Step  2  Until  the  queue  is  empty  or  the  goal  has  been  reached 
determine  if  the  first  path  in  the  queue  reaches  the  goal 
node. 

Step  2a  If  the  first  path  reaches  the  goal  node, 
do  nothing. 

Step  2b  If  the  first  path  does  not  reach  the  goal 
node: 

Step  2bl  Remove  the  first  path  from  the  queue 

Step  2b2  Form  new  paths  from  the  removed  path 
by  extending  them  one  step 

Step  2b3  Add  the  new  paths  to  the  queue 

Step  2b4  Sort  the  queue  by  cost  accumulated  so 
far,  with  least  cost  paths  placed  in  front. 

Step  3  If  the  goal  node  has  been  found,  announce  success; 
otherwise,  announce  failure. 


The  algorithm  as  given  terminates  when  the  shortest 
incomplete  path  is  longer  than  the  shortest  complete  path.  In 
this  situation  there  are  no  further  paths  needing  to  be 
investigated  for  optimality.  Since  the  paths  which  could  never  be 
optimal  have  been  pruned  out  at  an  earlier  stage,  the  queue 
remaining  (which  has  been  sorted  at  each  stage)  contains  at  its 
head  the  optimal  path. 

The  Prolog  source  code  and  Pascal  source  code  for  one 
particular  implementation  of  the  algorithm  is  given  in  Harrell's 
report  [9]  and  it  can  be  implemented  in  the  C  language  using 
essentially  the  same  code.  There  is  a  way  to  implement  the 
algorithm  using  a  dynamic  tree  structure  to  keep  the  environment 
of  partial  solutions  which  it  is  able  to  backtrack  through  (see  the 
book  by  Bratke)  .  However,  as  mentioned  above,  a  tree  can  store 
information  about  only  one  partial  path  from  its  root  to  each  leaf 
or  subtree  node. 

The  question  then  becomes  whether  the  list  of  all  partial 
paths  accumulated  using  the  search  algorithm  becomes  so  large  that 
it  is  impractical  to  manipulate.  The  theorems  and  the  lemmas  listed 
below  prove  that  under  certain  restrictions,  such  as:  1)  no  dead 
ends  in  the  network,  2)  the  maximum  numbers  of  nodes  going  in  and 
out  of  a  vertex  bounded  above,  and  3)  the  maximum  number  of  nodes 
which  are  critical  in  the  sense  defined  below  is  bounded,  the  time 
it  takes  to  finish  this  type  of  algorithm  is  not  longer  than  for 
the  algorithms  which  compute  shortest  paths  to  create  maximal  and 
min-cost  flows  according  to  the  approaches  of  Edmonds  and  Karp  [4]. 
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Let: 


ni  -  the  number  of  nodes  with  i  edges  proceeding  from  them, 
nji  *  the  number  of  nodes  with  j  edges  entering  them  and  i 
edges  proceeding  from  them, 

maxe  *  the  maximum  number  of  edges  proceeding  from  any  node 
in  the  network, 

emax  =  the  maximum  number  of  edges  entering  any  node  in  the 
network. 


Call  a  node  a  critical  backtracking  node  if  it  has  more  than 
1  edge  proceeding  from  it  and  more  than  one  edge  entering  it. 

Let  ncrit  *  the  number  of  critical  bactracking  nodes  in  a 
network. 

Call  a  node  of  the  network  a  q  stage  1th  critical  path 
backtracking  node  if  it  is  a  critical  path  backtracking  node  and 
it  is  preceeded  in  the  network  by  q  levels  of  backtracking  nodes, 
each  having  more  than  1  edge  entering  them.  Moreover,  there  must 
be  1  of  these  backtracking  nodes  with  more  than  l  edge  at  the 
preceeding  search  level  to  the  given  node. 

Let  n  j,  ji  =  the  number  of  q  stage  ,1th  critical  path 
backtracking  nodes  with  j  edges  proceeding  into  them  and  i  edges 
leaving  them. 

nlcrit  *  the  number  of  critical  path  backtracking  nodes  which 
are  not  q  level  1th  critical  for  q  or  1  >  1. 

Examples  of  these  definitions  will  be  given  in  the  course 
of  the  following  discussion. 


Theorem  l  Given  a  connected  directed  graph  with  a  starting 
node  and  a  goal  node  and  no  dead  ends  other  than  the  goal  node. 
Moreover,  if  there  are  at  most  ncrit  critical  path  backtracking 
nodes  with  at  most  a  q  level  instance  of  prior  influence,  then  the 
number  of  different  paths  (containing  no  cycles)  from  the  starting 
node  to  the  ending  node  is  bounded  by  the  expression 

1  +  (emax  -l)*(n  -nl-ncrit)  +  (maxe*emax  -l)*nlcrit 
+( (maxe* emax) ^  -  1)* (ncrit  -nlcrit) 


Proof:  This  theorem  is  proved  by  following  through  the  steps 
of  the  above  algorithm  and  counting  the  number  of  ways  new  paths 
are  generated.  Step  2b4  which  insures  the  solutions  will  be 
generated  in  order  of  shortest  length  is  not  necessary  if  the 
algorithm  is  only  being  used  to  generate  all  possible  paths.  At 
Step  2b,  new  paths  are  added  to  the  queue  of  partial  paths  each 
time  the  search  predicate  finds  a  node  following  the  current  node 
which  does  not  form  a  cyclic  path.  Since  1)  the  graph  is  finite, 
2)  there  are  no  dead  ends,  3)  the  graph  is  connected,  4) no  cycles 
are  permitted,  then  each  new  path  will  eventually  reach  the  goal 
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node. 


During  the  generation  of  the  list  of  partial  paths,  nodes  with 
only  one  edge  proceeding  into  them  and  one  edge  leaving  them  expand 
the  current  path  but  do  not  add  any  additional  combinatorial  search 
possibilities  to  keep  track  of  during  the  backtracking  process. 

Then,  the  number  of  paths  which  the  non-critical  backtracking 
nodes  enter  into  is: 

1  +I*n2  +  2*n3  +  3*n4  +(j-l)*nj  +...(emax  -  l)*nemax  (l) 


Using  the  fact  that  n  *  nl  +  n2  +  . . .  nemax  we  note  that  the 
above  number  is  bounded  by: 


(emax  -  l)*(n  -  nl)  +  1 

The  example  below  (Figure  1)  illustrates  how  equation  (1)  counts 
paths  in  a  network  without  any  critical  backtracking  nodes. 


Figure  1.  Example  1 


There  are  14  nodes, 
not  counting  Z. 
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nl  -  6  ,  n2  »  7 


n3  -  1 


emax  =  3 

number  of  paths  *  (7  +  l)*l  +  2*(1)  =  10 

path  search  steps  used  to  generate  path1 

ABDIZ  4 

ABDGZ  1 

ABDGFZ  1 

ABEFZ  2 

ABENZ  1 

ACHMZ  3 

ACJLMZ  3 

ACJLZ  0 

ACJLKZ  1 

ACJKZ  1 


If  the  network  we  are  considering  has  q  stage  1th  critical 
path  bactracking  nodes  but  none  with  q  or  1  greater  than  1  then  the 
search  algorithm  will  generate  an  additional: 

32  + —  (j*i  -  l)*n11ji 
maxeemax  paths. 

This  number  is  bounded  by: 

(maxe*emax  -  l)*(nlcrit). 

Example  2  -  consider  the  following  movement  network,  having  two 

starting  nodes  A1  and  A2  and  three  goal  nodes  El,  E2,  and  E3: 


3*n1;L22  +  5*n1123  +  ...  5*n,, 
+...  (maxe*emax  -  l)*nll 


Solution: 


1  A  search  step  is  defined  to  be  one  cycle  of  search  through 
the  database  of  edges  to  determine  which  nodes  are  connected  to  a 
given  edge.  It  is  assumed  the  network  information  is  stored  in  a 
vector  structure  in  which  each  edge  along  with  its  starting  and 
ending  node  and  value  are  kept.  Since  in  generating  the  queue  of 
search  paths  a  new  path  uses  the  nodes  from  the  prior  search  paths, 
it  is  not  necessary  to  search  through  the  database  for  all  the 
prior  nodes  in  creating  the  new  paths. 


584 


To  compute  all  the  paths  for  this  network  break  it  up  into 
six  connected  components  corresponding  to  each  possible  combination 
of  starting  node  and  ending  node.  Figures  3  through  7  show  this. 


1)  Al  -  El 


ABDZE 

ABCFGE 

ABCDZE 

ABCFLGE 

A8DZGE 

ABCFZE 

ABCDZGE 

ABCFZGE 


8  solutions  Z  is  a  l-stage  1th  critical 
path  backtracking  node 

n2  =  2  n3  =  l  n1122  =  1 

8=1+  n2*l  +  n3*2  +  nix22* ( 2*2  -1) 

=  l  +  2  +  2  +  3 


2)  Al  -  E2 
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abcfgme 

abcflme 

abcflgme 

abdzgme 

abcdzgme 

abcfzgme 


6  solutions  n3  =  1  n2  *  3 
6=1+  l*n2  +  2*n3  =1+3+2 


3)  A1  -  E3  no  solutions 


4)  A2  “  El 


AHLGE 

aiwhlge 

AIKJHLGE 


3  solutions 
3=1+  l*n2 


n2  =  2 
1  +  2 


5)  A2  -  E2 

AHME 

AIWHME 

aikjhme 

AHLME 

ahlgme 

aiwhlme 

aiwhlgme 

aikjhlme 

aikjhlgme 


9  solutions  n2  -  3  nn32 

=  1  +  n2*l  +  nix32*(3*2  -  1) 
=  1  +  3  +  5 


1 


6)  A2  -  E3 


AHE 

AIWHE 

AIKJHE 


3  solutions  n2  -  2 
3=1+  n2*l 


1  +  2 
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Figure  5  A2  -  El 


£89 
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If  vehicular  speeds  are  added  to  the  edges  in  the  network  of 
example  2  as  shown  in  Figure  8,  the  shortest  path  search  algorithm 
may  be  used  to  produce  the  full  list  of  non-cyclic  paths  ordered 
by  their  length  in  time  (minutes  to  traverse)  .  Figure  9  shows  the 
one  shortest  path  and  the  full  ordered  list  is  shown  below. 

SHORTEST  PATHS 


AHE  time (min) =  48.7 

AIWHE  time (min) =  56.8 

AHME  time (min) =  63.0 

AIKJHE  time (min) =  66.2 

ABDZE  time (min) =  67.2 

ABCFGE  time (min) =  67.2 

AIWHME  time (min) =  71.0 

ABCFGME  time (min) =  74.1 
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ABCDZE  time (min) =  74.9 

ABCFLGE  time (min) =  76.2 

ABCFLME  time (min) =  78.9 

AHLGE  time (min)*  79.8 

AIKJHME  time (min) =  80.5 

ABDZGE  time (min) =  80.6 

ABCFZE  time (min) =  81.4 

AHLME  time (min)*  82.5 

ABCFLGME  time (min) =  83.0 

AHLGME  time (min) =  86.7 

ABDZGME  time (min) =  87.4 

AIWHLGE  time (min) =  87.9 

ABCDZGE  time (min) =  88.4 

AIWHLME  time (min) =  90.6 

AIWHLGME  time (min) =  94.7 

ABCFZGE  time (min) =  94.8 

ABCDZGME  time (min)*  95.2 

AIKJHLGE  time (min)=  97.4 

AIKJHLME  time (min) =  100. 0 

ABCFZGME  time (min) =  101.7 

AIKJHLGME  time (min) =  104.2 


If  the  network  contains  q  level  1th  critical  path  backtracking 
nodes  with  q  or  1  greater  than  1,  then  the  computation  of  the 
number  of  possible  paths  becomes  more  complex.  Given  n  ^ji  q  level 
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the  mth  among  the  q  levels  of  that  sth  node  which  are  proceeded  by 
backtracking  nodes  with  more  than  1  edge  entering  them  and  the  rth 
among  the  1  edges  entering  the  node,  let  PRIOR  be  the  number 

of  edges  entering  that  prior  critical  backtracking  node .  Then,  the 
number  of  new  paths  generated  by  these  nqlji  q  level  1th  critical 
backtracking  nodes  is  bounded  by: 

flr*i  i 

e  <n  <n^w>*i,-i 

*-l  ®«i  r-l 

i8  =  number  of  paths  leaving  the  sth  critical  backtracking  node 
(which  is  bounded  by  maxe) 

So,  the  total  number 

of  such  additional  paths  will  be  bounded  by  (ncrit  -  nlcrit)  *(emaxc* 
*maxe)  -  1.  Note,  in  this  computation,  a  critical  backtracking 
node  may  preceed  (by  occuring  closer  to  the  start  node  in  the 
network)  more  than  one  other  such  node  a  certain  number  of  times. 
In  this  case  this  formula  will  overcount  the  number  of  paths  by 
that  same  factor.  See  the  example  below  for  an  illustration  of  how 
this  can  occur.  But  whatever  the  case,  these  additional  paths 
exhaust  all  the  ways  in  which  the  algorithm  can  possibly  backtrack 
to  produce  solutions.  Thus,  adding  this  bound  to  the  previous  one 
we  have  the  expression  given  in  the  statement  of  the  theorem. 
Q.E.D. 

Example  3)  In  the  network  below  the  node  B  is  a  2  level  2th 
critical  bactracking  node. 


ARPMBCG 

ASPMBCG 

ARPNBCG 

ASPNBCG 

24  solutions  s  =  1  ,  q  =  2, 

ARPQMBCG 

ARPTNBCG 

1=2,  i.  =  3 

ASPQMBCG 

ASPTNBCG 

48=pj .i.i<p?.i.i*pi. i.i 

+P2.2.1 *Pl.2.1> *3 

ARPMBDG 

ARPNBDG 

ASPMBDG 

AJRPQMBDG 

ASPNBDG 

ARPTNBDG 

-2(2*2  +2*2 } *3  >24 

ASPQMBDG 

ARPMBEG 

ASPTNBDG 

ARPNBEG 

PRI0Vr.s  -  Pm.r.s 

ARPQMBEG 

ASPNBEG 

PRI0Rm.r.s 


ARPMBEG 

ARPQMBEG 

ASPNBEG 

ASPQMBEG 


ARPNBEG 

ASPNBEG 

ARPTNBEG 

ASPTNBEG 


Pm.r.s 


In  this  example  B  is  a  two  stage  2th  critical  backtracking 
node.  This  is  because  the  2  backtracking  nodes  M  and  N  which 
precede  B  in  the  network  have  more  than  2  edges  entering  them  and 
and  there  are  2  levels  of  critical  backtracking  nodes  P  and  then 

M, N  which  preceed  B.  Because  the  node  P  preceeds  the  nodes  M  and 

N,  the  formula  overcounts  by  a  factor  of  2. 


In  the  applications  that  these  theorems  are  used  we  have  some 
freedom  in  how  many  nodes  and  edges  are  included  in  the  network. 
The  network  will  be  a  representation  or  model  of  some  physical 
situation  or  process.  In  practice  it  becomes  more  and  more  unlikely 
in  a  random  physical  situation  that  planar  graphs  containing  many 
q  level  1th  critical  backtracking  nodes  where  q  or  1  »  1  will 
occur.  In  movement  networks  based  upon  digitized  maps  such  nodes 
correspond  to  critical  choke  points  at  which  the  routes  diverge  to 
go  around  an  obstacle  and  then  reconverge. 

Theorem  2  Considering  the  same  type  of  graph  as  in  Theorem  1, 
now  allow  paths  to  travel  in  directions  opposed  to  the  way  the 
edges  are  directed.  Then  the  bound  for  the  number  of  possible  paths 
is  increased  to  : 


1  +  emax*(n  -  nl  -  ncrit)  +  ((emax  +  maxe)*maxe  -  l)*nlcrit 
+  (((emax  +maxe)  *maxe) q  -  l)*  (ncrit  -  merit) 

Proof:  Count  the  number  of  paths  using  the  algorithm  as  in  theorem 
1.  For  those  nodes  with  only  one  edge  entering  them,  the  new  number 
of  possible  edges  which  proceed  outward  has  been  increased  by  one. 
These  edges  will  create  : 

1  +  2*n2  +  3*n3  +  i*ni  . . . emax*nemax  paths. 

This  number  is  bounded  by  1  +emax*(n  -  nl  -ncrit).  For  the  nodes 
with  j  >  1  edges  entering  them,  the  new  maximum  number  of  edges 
leaving  the  node  is  emax  +  maxe.  The  rest  of  the  formula  follows 
from  the  previous  calculations. 


Theorem  3  The  number  of  search  steps  required  to  compute  all 
the  paths  of  the  type  of  graphs  mentioned  in  theorem's  1  and  2  is 
bounded  by: 

1  +  emax  *(n  -ncrit)  +[ (emax) ^q+1^ ] *  ncrit 

where  q  =  the  maximum  level  of  any  q  level  1th  critical  path 
backtracking  nodes  in  the  network, 


Proof:  This  can  be  verified  by  tracing  through  the  algorithm 
and  adding  a  marker  to  a  node's  count  each  time  a  search  step  is 
performed.  In  following  through  the  algorithm  note  that  since  the 
network  is  connected  each  node  is  encountered  during  the  searching 
and  backtracking  exactly  the  sum  of  the  number  of  times  that  there 
are  different  edges  proceeding  into  it  multiplied  by  the  number  of 
times  the  preceding  nodes  to  that  edge  have  already  been 
encountered.  For  the  nodes  which  are  not  critical  path  backtracking 
nodes  we  have: 

l*(ln  +1)  +  2*(2n)  +  ..  emax*(emaxn)  search  steps, 

where  in  =  the  number  of  nodes  witn  exactly  i  edges  entering  into 
them.  This  number  is  bounded  by  the  number  1  +  emax(n  -  ncrit)  .  For 
a  node  which  is  a  q  level  1th  critical  backtracking  node,  the  sum 
of  the  number  of  times  that  there  are  different  edges  proceeding 
into  it  multiplied  by  the  number  of  times  the  preceeding  node|  to 
that  edge  have  already  been  encountered  is  bounded  by  (emax)  . 
Q.E.D. 

Example  2  (continued) 

In  this  graph,  which  has  no  critical  path  backtracking  nodes, 
we  have  In  =  10  and  2n  =  3  so  the  number  of  search  steps  should  be 
(10  +1)  +  2*3  =  17.  This  can  be  verified  by  tracing  through  the 
algorithm  and  keeping  a  count  as  below: 


node  search  encounters  node  search  encounters 

A  I  HI 

B  I  M  II 

D  I  LI 

Cl  J  I 

II  K  II 

G  I  N  I 

F  II 

E  I  total  number  of  encounters  =  17 


example  3  (continued) 

This  graph  has  a  two  level  2nd  critical  backtracking  node  B. 
The  number  of  search  encounters  for  the  various  nodes  is  as  below: 


node 

search 

encounters 

node 

search  encounters 

P 

II 

B 

IIIIIIII 

Q 

II 

C 

iiiiiiii 

T 

II 

D 

mmii 

M 

IIII 

E 

IIIIIIII 

N 

IIII 

total 

number  of 

encounters  =  46 

The  following  theorem  holds  for  the  same  reasons  as  the 
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preceeding  discussions. 


Theorem  4  The  number  of  search  steps  required  to  compute  all 
the  paths  in  the  case  where  the  paths  are  allowed  to  travel  in 
opposite  directions  is  bounded  by: 

1  +  (emax  +  maxe]*/n  -  ncrit)  + 

{(emax  +  maxe) (ncrit  -  nlcrit) 


Methodology  for  Solving  Maximal  Flow  problems 

Given  that  there  is  a  method  of  generating  all  the  shortest 
paths  through  the  network,  it  is  easy  to  modify  the  predicates  to 
generate  a  list  of  maximal  flow  paths  through  the  network.  We 
now  assume  that  each  route  segment  has  an  associated  maximal  flow 
capacity.  At  each  stage,  simply  choose  the  direction  of  maximal 
flow  to  expand  the  paths,  and  perform  a  sort  on  the  maximal  flow 
of  the  route  segments  instead  of  minimum  length.  Then,  write  a 
predicate  that  each  time  we  reach  the  goal  node  with  a  route,  go 
back  and  subtract  that  route's  flow  values  from  the  network 
capacities.  When  the  Prolog  backtracking  search  does  not  generate 
any  more  solutions,  all  directed  paths  through  the  network  have  at 
least  one  edge  which  is  already  filled  to  capacity.  One  other  way 
exists  to  increase  the  flow  in  the  network.  The  paths  which 
contain  edges  that  point  backward  along  the  allowed  route  segments 
can  be  considered.  Then,  when  the  goal  node  is  reached,  proceed 
back  to  the  source  and  modify  the  flow  capacities,  and  add  flow 
capacity  along  those  segments,  instead  of  subtracting  it.  As 
explained  by  Sedgewick  [10]  when  the  above  procedure  reaches  a 
situation  in  which  all  paths  have  either  full  forward  edges  or 
nonempty  back  edges,  then  the  Ford  Fulkerson  theorems  says  the 
maximal  flow  of  the  network  has  been  reached.  Many  of  the 
algorithms  presently  in  use  Goldberg  [6]  and  Tar j an  [12]  for 
solving  the  maximal  flow  problem  do  not  save  lists  of  partial  paths 
but  instead  use  a  labeling  process  to  update  information  at  each 
node.  This  increases  computational  speed,  but  makes  it  difficult 
to  pick  in  order  the  main  routes  that  contribute  to  the  optimal 
solution.  Since  it  may  be  desirable  to  do  sensitivity  analyses 
which  locate  the  points  in  the  network  which  most  affect  all  the 
possible  solutions,  the  above  approach  gives  more  information  after 
the  process  is  completed.  Thus,  it  is  possible  to  print  out  in 
order  of  flow  the  paths  which  contributed  to  the  max-min  cut 
situation.  This  then  can  be  used  to  plan  barriers  for  the  defense 
or  attack  routes  for  the  offense.  The  description  of  the  algorithm 
is  as  given  below: 
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Step  1.  Search  for  the  best (maximal  flow)  route 
from  the  starting  to  the  ending  node.  Only  search 
in  directions  in  which  there  is  either  a  forward 
edge  with  positive  unused  flow  capacity,  or  a  backward 
edge  with  positive  existing  flow.  If  no  route  exists, 
terminate  the  algorithm. 

Step  2.  Subtract  the  value  of  that  flow  from  the 
capacity  slots  in  the  route's  definition  and  add 
the  flows  (or  subtract  the  flows  if  headed 
in  a  backward  direction  along  an  edge) .  Go  to  step 
1. 

Further  explanation  of  the  algorithm  and  examples  of  its  use 
are  given  in  Harrell's  paper  [6].  The  source  code  of  its 
implementation  is  in  the  technical  report  [9].  If  flow  rates  are 
added  to  the  edges  in  Example  2  the  above  procedure  will  generate 
the  maximum  vehicular  flow  across  the  network  and  compute  the 
sensitivity  of  the  solution  to  changes  in  flow  rates  at  critical 
nodes. 

Off-road  vehicle  flow  capacities  may  be  estimated  from 
vehicular  movement  formations  and  speeds  in  the  terrain  corridor 
that  each  edge  corresponds  with.  Suppose  that  the  terrain  will 
support  a  certain  number  of  units  as  shown  in  Figure  11  and  the 
movement  speeds  are  as  in  Figure  8.  Figure  12  shows  the  number  of 
vehicles  per  square  kilometer  that  correspond  to  a  particular 
movement  formation.  Multiplying  (l/(time  it  takes  a  group  of 
vehicles  to  traverse  the  edge)]*  number  of  vehicles  in  the  group 
determines  the  flow  rate  associated  with  an  edge.  Then  Figure  13 
shows  a  maximum  flow  solution  and  Figure  14  shows  the  changes  in 
the  solution  caused  by  changing  the  speeds  on  three  edges. 
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6  h. 

effective  density  of  5  velncles/sq.  h. 

Figure  12.  Standard  cross  country  movement  unit 
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The  flow  paths  and  their  maximum  capacities  that  are 
associated  with  the  solutions  of  Figures  13  and  14  are  : 

Maximal  flows  without  obstacles 

MAXIMAL  FLOWS 

AIWHE  flow  veh/hr  =85.5 
ABCFZE  flow  veh/hr  =  81.0 

AHE  flow  veh/hr  =  78.5 

AIKJHE  flow  veh/hr  =  27.5 

total  flow  272.5  veh./hr. 


If: 

a.  Four  anti-tank  ditches  with  corresponding  parallel  tank 

bumps , 

b.  One  standard (conventional)  rectangular  minefield 

Maximal  Flows  with  obstacles, 

c.  Four  scatterable  minefields, 

are  emplaced  the  maximal  flows  are  reduced. 

Maximal  flows  with  obstacles 
MAXIMAL  FLOWS 

ABCFZE  flow  veh/hr  =  81.0 

AHE  flow  veh/hr  =  78.5 

AIWHE  flow  veh/hr  =  15.5 

AIKJHE  flow  veh/hr  =  6.5 

total  flow  181.5  veh./hr 


Theorems  1  through  4  solve  the  problem  of  determining  how  many 
steps  it  takes  this  algorithm  to  generate  all  the  paths  that  create 
a  max-min  cut.  As  outlined  in  Example  3,  after  each  search  step  the 
new  edges  must  be  placed  in  a  sorted  priority  queue  of  current 
partial  search  paths.  The  maximum  number  of  insertions  required 
after  each  search  step  is  emax  +  maxe.  Each  insertion  requires  the 
checking  of  the  lengths  of  at  most: 

1  •+•  emax*(n  -  nl  -  ncrit)  +  ((emax  +  maxe)*maxe  -  i)*nlcrit 
+  (((emax  +maxe) *maxe) q  -1)*  (ncrit  -  merit) 

partial  paths  {according  to  Theorem  2}  against  the  lengths  of  the 
new  paths  created  by  adding  an  edge  onto  the  active  search  path. 
Let  e*  =  maxe  +  emax,  and  assume  e*  <=  some  constant  Cl, then  the 
above  expression  is  less  than  or  equal  to: 

1  +  ncrit*  Cl2q  +  n*2Cl2 
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The  total  number  of  steps  is  then  bounded  by: 

{1  +  (emax  +  maxe)*(n  -  ncrit)  + 

(emax  +  maxe)  *  (ncrit  -  nlcrit) }  * 

(emax  +  maxe]  *  {1  +ncrit*Cl^q  +  n*2Cl^>. 

This  is  less  than  or  equal  to: 

{Cl  +  n*Cl2  +ncrit*Cl >{1  +ncrit*Cl2q  +n*2Cl2}. 

And  this  expression  is  of  the  order: 

{n2}*2*Cl4  +  ncrit  *  ci^3q+2^> 


Theorem  5  If  in  designing  the  networks  which  represent  the 
movement  possibilities,  we  limit  ourselves  to  the  case: 

(emax  +maxe)4  <=  n  and  (emax  +maxe)  (3c3+2)  <=  n2  ,  q<=2 

then  the  number  of  steps  needed  to  solve  the  maximal  flow  problem 
is  of  the  order  of  n  ,  the  number  of  vertices  cubed. 

Also,  if  we  are  not  interested  in  generating  the  saturating 
flow  paths  in  priority  order  of  increasing  flow,  then  the  partial 
paths  do  not  need  to  be  sorted  after  each  search  step.  By  examining 
the  above  expressions  we  see  that  this  reduces  significantly  the 
time  the  algorithm  takes  to  compute  a  maximal  flow. 


4 .  Methodology  for  Solving  the  Minimal  Cost  Network  Flow  Problem 

As  a  further  benefit  of  the  above  approach,  the  procedures 
developed  can  be  used  to  solve  the  minimal  cost  network  flow 
problem.  The  minimal  cost  network  flow  problem  is  a  generalization 
of  the  transportation  network  problem  in  operations  research.  In 
its  formulation  each  edge  is  assumed  to  have  a  cost  as  well  as  a 
flow  capacity  associated.  In  this  paper  in  order  to  determine  the 
cost  associated  with  a  flow  the  following  procedure  is  used:  1)  The 
flow  rate  on  each  flow  path  solution  through  the  network  is 
multiplied  by  its  time  of  traversal  and  the  result  summed  over  all 
paths  in  order  to  obtain  a  total-  cost  associated  with  a  given 
maximum  flow  solution.  This  is  the  measure  of  effectiveness  which 
determines  the  optimality  of  the  solution.  The  total  cost  of  the 
flow  can  then  be  divided  by  the  total  maximum  flow  to  obtain  an 
average  cost  per  vehicle  to  travel  through  the  network.  The  min- 
cost  flow  problem  is  then  the  problem  of  determining  from  all  the 
possible  flows  which  realize  a  given  network  maximal  flow 
value,  those  which  do  it  with  minimal  cost.  This  measure  of 
effectiveness  is  important  in  wargaraing  because  it  represents  the 
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amount  of  target  exposure  which  is  required  for  an  offensive  force 
to  reach  its  objectives.  As  in  the  shortest  path  algorithm 
presented  earlier,  the  algorithm  that  will  be  used  to  compute  total 
cost  of  the  flow  expands  the  paths  at  each  stage  in  the  direction 
of  shortest  time.  The  resulting  expanded  pathlist  will  be  sorted 
on  time  of  the  routes  (in  the  maximal  flow  algorithm  the  maximal 
flow  possibilities  were  sorted  on) .  Only  those  directions  in  which 
the  maximum  flow  algorithm  says  there  is  either  a  forward  edge 
with  positive  unused  flow  capacity  or  a  backward  edge  with 
positive  existing  flow  should  be  chosen.  As  in  the  maximum  flow 
algorithm,  when  the  goal  node  is  reached  we  proceed  back  to  the 
source  and  subtract  the  maximum  flow  that  least  cost  incremental 
flow  route  can  handle (in  the  maximal  flow  algorithm,  the  route  is 
not  necesarily  the  least  cost  incrementing  flow) . 

The  following  theorem  from  Ford  and  Fulkerson  [5],  which  is  also 
noted  in  Deo  (3],  insures  that  this  process  will  generate  the 
optimum  solution. 

Theorem  6  Let  f  be  the  minimal  cost  flow  pattern  of  value  w  from 
start  to  finish.  The  flow  pattern  f'  obtained  by  adding  delta  <= 
0  to  the  flow  in  the  forward  edges  of  a  minimal  cost  unsaturated 
path,  and  subtracting  delta  from  the  flow  in  the  backward  edges 
of  the  path  is  a  minimal  cost  flow  of  value  w  +  delta  for  the 
original  network. 

The  same  source  code  predicates  can  be  used  to  implement  this 
algorithm: 


Step  l:  Search  for  the  best (minimal  cost)  route  from 
the  starting  to  the  ending  node.  Only  search  in  directions  in 
which  there  is  either  a  forward  edge  with  positive  unused  flow 
capacity,  or  a  backward  edge  with  positive  existing  flow.  If  no 
such  route  exists,  then  terminate  the  algorithm. 

Step  2:  Subtract  the  value  of  that  flow  from  the 
capacity  slots  in  the  route's  definition  and  add  the  flows  (or 
subtract  the  flows  if  going  in  a  backward  direction)  along  the 
edges.  Go  to  Step  1. 


If  we  again  consider  the  network  in  Example  2,  it  is  now 
possible  to  solve  the  problem  of  determining  which  of  the  several 
maximum  flow  solutions  costs  less  in  the  above  sense. 

Minimal  cost  flows  for  the  same  network  and  same  vehicle/ weather 
conditions  are  shown  below  in  Figure  15.  The  maximum  throughput 
for  the  minimal  cost  flows  is  the  same  as  that  which  results  from 
the  maximal  flow  algorithm.  The  paths  followed  to  acheive  this 
throughput  is,  however,  different  in  the  minimal  cost  flows  from 
those  which  result  from  running  the  maximal  flow  algorithm.  This 
is  to  be  expected  since  in  the  minimal  cost  case  the  algorithm 
chooses  the  direction  of  shortest  time  to  expand  the  search  path. 
In  the  maximal  flow  case,  the  algorithm  chooses  the  direction  of 
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maximum  flow  to  expand  the  search  paths.  In  the  list  of  the 
minimum  cost  flow  paths,  we  have  included  an  average  cost  for  each 
flow.  This  is  defined  to  be  the  average  of  the  sum  of  the  amount 
of  each  flow  path  (vehicles)  times  the  cost  of  it  (minutes) .  Note 
that  th’s  average  does  not  change  much  with  and  without  the 
presence  of  obstacles.  This  is  because  what  the  obstacles  affect 
(being  employed  over  only  a  part  of  the  network)  is  primarily  the 
maximum  throughput,  and  not  the  time  through  the  network. 


MIN_COST  FLOWS 

Minimum  cost  flows  without  obstacles 


AHE 

flow 

veh/hr  = 

78.5 

cost 

time (min)  = 

48.7 

AIWHE 

flow 

veh/hr  = 

85.5 

cost 

time (min) = 

56.8 

AIKJHE 

flow 

veh/hr  = 

27.5 

cost 

time (min) = 

66.2 

ABDZE 

flow  veh/hr  = 

66.5 

cost 

time (min) = 

67.2 

ABCFGE 

flow 

veh/hr  = 

14.5 

cost 

time (min) = 

67.2 

total  flow  =  272.5  veh\hr.  total  cost  =  78.5*48.7  +  85.5*56.8  + 
27.5*66.2  +  66.5*67.2  +14.5*67.2.  average  cost  =58.5  minutes  per 
vehicle 


Explanation  of  How  this  Algorithm  Can  Be  Used  To  Solve  the 
Transportation  Network  Problem 

By  the  transportation  network  problem  the  following  is  meant: 
Consider  n  points  located  on  a  map  as  origins  of  logistical 
material.  Each  point  has  associated  with  it  a  supply  of  a[i]  units 
of  the  material.  In  addition,  there  are  m  destination  points,  with 
each  destination  point  requiring  b[i]  unit  of  the  material. 
Associated  with  each  link  in  a  network  between  the  sources  and  the 
destinations  there  is  a  unit  cost  of  transportation  and  a  flow 
capacity.  The  problem  is  to  determine  the  shipping  pattern  from 
origins  to  destinations  that  minimizes  the  total  cost  under  the 
constraints  imposed  by  the  flow  capacities  on  each  link.  By 
defining  n  paths  each  with  a  flow  capacity  equal  to  a[i]  from  a 
notional  starting  point,  and  m  paths  each  with  a  flow  capacity 
equal  to  b[i]  from  the  destinations  to  a  notional  ending  points, 
this  problem  can  be  considered  as  a  special  case  of  the  minimal 
cost  network  flow  problem  discussed  in  the  previous  section.  The 
algorithm  given  to  solve  that  problem  will  in  the  process  of 
computing  the  maximal  flow  in  the  network  just  defined,  produce 
the  minimal  shipping  cost  solution  which  satisfies  most  of  the 
total  requirements  at  the  destinations.  With  simple  modifications 
to  the  starting  requirements  for  the  search  routines  the  algorithm 
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will  produce  solutions  which  satisfy  the  list  of  destinations  in 
any  given  prioritized  sequence. 


Conclusions 

6.  We  have  given  definit  ons  and  examples  of  some  artificial 
intelligence  network  terminology  and  discussed  several  .ways  in 
which  sorted  priority  queue  depth-first  searches  can  be  used  to 
solve  shortest  path, network  maximal  flow,  and  min-cost  network  flow 
problems.  We  have  shown  that  the  time  bounds  for  these  algorithms 
depend  on  the  number  of  critical  path  backtracking  nodes  in  the 
following  sense:  If  there  are  no  critical  backtracking  nodes,  and 
the  network  is  directed,  then  there  are  at  most  (emax  -l)*(n  -  nl) 
+  1  paths  in  the  whole  search  space.  If  there  are  critical  path 
backtracking  nodes  then  the  number  of  paths  is  bounded  by: 

1  +  (emax  -l)*(n  -  nl  -  ncrit)  +  (maxe*emax  -l)*nlcrit 
+( (maxe*emax)q  -1)* (ncrit  -nlcrit) 

We  obtained  similiar  expressions  for  the  case  in  which  the  paths 
can  go  either  forward  or  backward  along  edges  in  the  network.  We 
used  these  expressions  to  obtain  time  bounds  for  the  total  number 
of  steps  to  solve  the  maximal  flow  and  min-cost  flow  problems. 
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ABSTRACT.  Computer  Algebra  Systems  (CAS)  are  powerful  and  efficient  tools  that  can  help 
with  problem  solving  in  research  environments.  In  this  paper,  we  outline  some  of  the 
capabilities  of  one  particular  CAS,  Derive,  which  runs  on  IBM  personal  computers  and 
compatibles.  The  discussion  includes  analysis  of  symbolic  and  numeric  computation  and 
graphic  displays.  Emphasis  is  placed  on  the  special  functions  and  computational  modes  most 
beneficial  to  researchers.  We  present  some  of  the  general  algorithms  used  by  this  software 
package  and  discuss  the  role  of  CAS  in  solving  research  problems.  The  limitations  of  Derive  are 
presented. 

INTRODUCTION.  A  Computer  Algebra  System,  capable  of  symbolic  manipulation,  is  a 
powerful  and  efficient  tool  in  a  research  environment.  At  the  United  States  Military  Academy 
(USMA),  every  student  is  required  to  purchase  Derive  (distributed  by  Soft  Warehouse,  Inc)  for 
their  use  in  all  mathematics  courses.  Also,  all  the  mathematics,  science  and  engineering 
faculty  have  Derive.  USMA  is  an  undergraduate  institution  with  a  student  population  of 
approximately  4400  cadets,  each  of  whom  possesses  an  IBM-compatible  personal  computer. 
Every  graduate  fulfills  the  requirements  for  a  bachelor  of  science  degree,  with  about  50% 
obtaining  a  major  or  field  of  study  in  a  math,  science,  or  engineering  discipline.  USMA  is 
certainly  not  a  research  institution,  but  a  number  of  both  students  and  faculty  are  involved  in 
a  myriad  of  research  areas,  often  involving  interaction  with  other  Department  of  Defense 
agencies.  Derive  is  proving  to  be  an  extremely  useful  tool  for  both  the  student  or  faculty 
researcher. 

In  a  research  environment,  the  most  popular  uses  of  a  computer  algebra  system  are  in  the 
following  areas: 

•  Solving  differential  equations 

•  Fourier  transforms 

•  Group  theory 
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•  Laplace  transforms 

•  Number  theory 

•  Calculus  (including  multivariable  and  vector) 

•  Linear  algebra 

CAPABILITIES  AND  EXAMPLES.  Let  us  examine  a  few  of  these  areas  to  see  how  Derive 
can  assist  the  researcher.  Once  the  program  is  executed,  it  is  possible  to  load  one  or  more 
utility  files  into  the  Derive  window.  These  utility  files  can  contain  user  built  functions,  which 
can  then  be  used  for  repetitive  calculations.  These  functions  supplement  the  functions 
executed  through  the  Derive  menu  or  the  standard  in-line  functions  always  available.  Derive 
itself  comes  with  several  of  these  files,  including  files  for  first-  and  second-order  differential 
equations,  recurrence  equations,  probability  functions  and  special  functions  such  as  Bessel  and 
Airy  functions.  Also  included  is  a  file  containing  many  common  physical  constants  and  unit 
conversion  factors.  Users  can  create  their  own  files  containing  functions  from  the  above  files  or 
their  own  unique  functions.  For  example,  the  two  tables  below  provide  a  partial  listing  of  pre¬ 
defined  functions  available  to  solve  first-order  differential  equations  symbolically. 


Function  Name 

Purpose/Form  of  Equation  to  be  Solved 

FUN_LIN_CCF 

(q.a.b.c.x.y.xO.yO) 

solves  y'  =  q(x,  y)(ax  +  by  +  fc),  y{x 0)  =  yO 

LINJFRAC 

solves  y'  =  r(x,y)((ax  +  by  +  c)/{sx  ~F  ty  +  d )), 

(r.a.b.c.s.t ,d,x,yfxO,yO) 

y(x0)=y0,  sa-eb  ^  0,  cd  ^  0 

INTEG_FCTR_FREE_OF  _X 

solves  p(x,  y)  +  q(x,  y)y'  =  0,  when  the 

(r,p,q,x,y,xO,yO) 

integration  test  is  free  of  x  and  gives  r 

INTEG_FCTR_FREE_OF_Y 

solves  p( x,  y)  +  q(x ,  y)y'  =  0,  when  the 

(r,p,q,x,y,xO,yO) 

integration  test  is  free  of  y 
(similar  to  above) 

GEN_H0M(r fk,x,y,xO,yO) 

solves  general  homogeneous  equation 
y'  =  r(x,y)  =  h(yxk)y/x 

ALMOST_LIN 

(r,h,p,q,x,y,xO,yO) 

solves  r(x,  y)y'  +  p(x)h(y)  =  q(x),  y(x 0)  =  yO 

CLAIRAUT 

helps  solve  the  Clairaut  equation 

TAYjQDEI (r,x,y ,x0 ,y0) 

finds  4th  degree  Taylor-series  solution  to 
y'  =  y)»  y(*o)  =  yO 

PICARD(r ,  y.prev ,  x ,  y ,  x  0 ,  yO) 

given  approximate  solution  yp rev  to  y'  =  r(x,y), 
y(x0)  =  yO,  and  finds  an  improved  iterate 

The  basic  commands  for  solving  first-order  differential  equations 
available  in  the  utility  file  ODE1  and  their  purpose. 
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Function  Name 

Purpose 

SEP ARABLE (p, q, x, y ,x0 ,y0) 

solves  a  separable  differential  equation, 
y'  =  ?(*)<?(*),  y(*o)  =  yO 

EXACT_IF_0(p,q,x,y) 

checks  if  equation  p  +  qy'  =  0  is  exact 

EXACT  (p ,  q ,  x ,  y ,  xO ,  yO) 

solves  the  exact  equation  (above), 
with  y(xO)  =  yO 

USE_TNTEG_FCTR 

solves  equation  with  known  integrating 

(m,p,q,x,y,xO,yO) 

factor  m 

LINEAR1 (p , q , x , y , xO , yO) 

solves  linear  equation  y'  +  py  =  q, 
with  y(xO)  =  yO 

BERNOULLI 

solves  the  Bernoulli  equation 

(p,q,k,x,y,xO,yO) 

y'  +  py  =  qyk ,  with  y(xO)=yO 

HOMOGENEOUS  _IF  _FREE_0F  _X 

if  this  returns  a  0,  the  equation 

(r,x,y) 

y'  =  r(r,  y)  is  homogeneous 

HOMOGENEOUS ( r , x , y , xO , y 0 ) 

solves  the  homogeneous  equation 
(shown  above) 

Other  commands  available  in  ODE1  and  the  form  of  the  equation  they  solve. 


A  few  practical  examples  should  provide  the  necessary  illumination  on  how  these  kinds  of 
functions  can  be  used.  Suppose  we  wished  to  solve  the  initial-value  problem 

y"  —  2y'  +  y  =  10e'2x  cos(x) ,  subject  to  y(0)  =  1  and  yr(0)  =  2. 

This  is  recognized  as  a  second-order,  constant  coefficient,  nonhomogeneous  differential 
equation.  After  loading  the  0DE2  file,  we  proceed  as  follows: 

First,  classify  the  nature  of  the  homogeneous  solution  using  the  command 

LIN2_RED_CCF_DISC(— 2,1) 

The  discriminant  will  be  either  positive  (real,  distinct  roots),  negative  (imaginary  roots),  or 
zero  (repeated  roots).  In  our  example,  the  discriminant  is  zero,  indicating  repeated  roots.  Our 
next  command  would  then  be 


LIN2_RED_CCF_0(— 2,x) 
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This  would  return  the  homogeneous  solution,  complete  with  arbitrary  constants,  as  shown 
below 


@1  ex  +  @2  xex. 

The  particular  solution  can  then  be  found  using  the  command 

LIN2_COMPLETE(ex,xex,10e-2x  cosx,x) 

The  particular  and  homogeneous  solutions  can  then  be  added  together,  and  initial  conditions 
applied  by  using  the  command 

IMPOSE_IC2(  x,f(x),0, 1,2) 

which  would  then  yield  the  solution 

y  =  g ex  -f  4xex  +  (  |cosx  —  3sjnx  )  e_2x  . 

Derive  can  also  plot  this  solution  using  either  a  full  or  split  screen  window. 

Some  other  quick  examples  of  the  capabilities  of  Derive  include  the  following: 

The  command 

LAPLACE(  f(t),  t,s) 

yields  the  Laplace  transform  of  f(t).  A  piecewise  continuous  function  can  be  constructed  using 
built-in  functions  of  Derive ,  and  then  its  Fourier  series  found  by  using  the  command 

FOURlER(  f(x),  x,  a,  b,  n) 

where  a  and  b  specify  the  interval  and  n  is  the  number  of  terms  desired.  Derive  can  solve 
multiple  integrals  (or  derivatives  of  any  order)  symbolically,  and  can  approximate  definite 
integrals  with  an  adaptive  quadrature  routine.  Derive  also  has  built-in  commands  for  such 
vector  operations  as  finding  the  Laplacian,  divergence,  curl,  and  potential  of  a  given  vector. 
Many  matrix  algebra  functions  are  also  included  in  Derive ,  such  as  computing  determinants  or 
inverses,  finding  eigenvalues,  or  row-reducing  a  matrix.  In  short,  the  capabilities  of  Derive  can 
deliver  the  researcher  from  much  of  the  tedium  of  “number  crunching,”  thereby  allowing 
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greater  time  for  actual  research. 


LIMITATIONS.  Lest  we  come  to  think  of  Derive  as  being  capable  of  performing  all  of  our 
mathematical  manipulations,  it  is  important  to  realize  its  limitations.  First  and  foremost,  it 
has  no  programming  language  to  do  branching  or  iteration.  There  is  no  way  to  input 
superscripted  or  subscripted  variables.  Derive  is  relatively  slow  at  plotting  graphs,  especially 
when  plotting  “out-of- range.”  Its  three-dimensional  plotting  is  adequate,  but  would  be  much 
enhanced  if  contour  plots  could  be  generated.  Also,  there  is  no  way  to  edit  graphs  to  add  such 
amenities  as  axis  labels.  Perhaps  the  most  significant  shortcoming  occurs  when  an  impossible 
operation  is  attempted.  Sometimes,  instead  of  providing  some  type  of  warning  or  error 
message,  Derive  simply  “does  nothing,”  leaving  the  user  wondering  why  the  command  was  not 
executed. 

CONCLUSION.  Despite  Derive's  limitations,  its  user-friendly  interface  and  relatively  low  cost 
make  it  an  excellent  aid  to  any  researcher.  The  researcher  with  limited  computation  hardware 
can  especially  appreciate  its  small  size  —  the  entire  program,  to  include  utility  files,  will  fit  on 
a  standard  5^  inch  floppy  disk  and  executes  on  standard  IBM  PCs  and  compatibles.  This  is 
extremely  convenient  for  those  researchers  that  have  computers  without  a  hard  disk  drive. 
Finally,  for  students  and  faculty  here  at  USMA,  the  availability  of  Derive  makes  it  an 
extremely  effective  tool  for  conducting  group  research  projects. 
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ABSTRACT.  The  United  States  Military  Academy  (USMA)  is 
fortunate  to  have  a  computer-rich  educational  environment. 
Every  student  purchases  a  personal  computer,  every  faculty 
member  has  one  at  his  or  her  desk,  and  all  the  departments 
have  mobile  computers  equipped  with  an  overhead  projection 
device  to  bring  the  computer  experience  into  the  classroom. 
Additionally,  each  student  purchases  standard  software 
consisting  of  word  processing,  spreadsheet,  and  computer 
algebra  systems  (CAS) .  The  challenge  to  the  faculty  is  to 
effectively  use  the  available  resources  to  enrich  the 
students  and  increase  their  understanding  of  the  concepts 
presented.  The  Department  of  Mathematical  Sciences  (D/MS) 
relies  heavily  on  CAS  to  take  the  drudgery  out  of  the 
computations  and  to  put  excitement  into  its  courses. 

1.  INTRODUCTION.  Prior  to  its  foray  into  using  CAS,  D/MS 
had  already  decided  on  some  of  its  software  requirements. 
Minitab  and  Quattro  would  be  used  as  statistical  and 
spreadsheet  packages  respectively,  and  some  other  software 
packages,  like  Calculus  Toolkit  and  the  Mathematics  Plotting 
Program  (MPP)  would  be  available  as  demonstration  packages. 
While  these  last  two  programs  performed  well,  they  were 
limited  in  their  scope. 

In  the  spring  of  1989,  D/MS  began  a  search  for  a  reliable 
CAS.  It  wanted  something  that  was  easy  to  use  and  could 
integrate  numerical,  graphical,  and  analytical  procedures. 
Since  all  the  cadets  would  be  buying  a  copy  for  their  own 
computer,  cost  was  important. 
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Derive,  a  commercial  product  from  The  Soft  Warehouse  was 
selected  as  the  software  that  best  met  the  requirements.  The 
D/MS  faculty  began  the  following  program  to  integrate  CAS 
into  the  classwork. 

The  first  100  copies  arrived  in  May  of  1989  and  were 
distributed  to  the  math  and  science  faculty.  In  June, 
members  of  the  D/MS  presented  a  demonstration  to  the  math, 
science,  and  engineering  faculty  to  let  them  know  the  state 
of  CAS  and  to  prepare  them  to  integrate  it  in^c  their 
courses.  As  the  first  purchase  was  going  to  rxxi  the 
freshman  and  sophomore  classes,  most  of  the  engineering 
faculty  would  have  one  or  two  years  to  validate  our 
experience  and  integrate  it  into  their  own  disciplines. 

The  students  received  their  copies  of  Derive  in  October,  two 
months  into  the  academic  year.  Unfortunately,  this  was  too 
late  in  the  semester  to  properly  integrate  it  into  the 
instruction,  but  plans  were  made  to  do  so  in  the  second 
semester.  During  that  fall  semester,  the  instructors  were 
strongly  urged  to  utilize  Derive  in  their  classroom  sessions 
and  many  did  so. 

During  the  second  semester,  students  in  the  Calculus  I 
course  used  Derive  extensively  in  the  classroom  and  on 
homework.  This  was  accomplished  by  utilizing  the  school’s 
computer  labs  in  conjunction  with  pre-planned  laboratory 
worksheets.  Other  math  courses  used  Derive  on  a  weekly 
basis  and  included  Derive  worksheets  in  some  of  their  lesson 
material.  The  Calculus  I  students,  while  not  unanimous  in 
their  opinion  of  CAS,  gave  the  following  comments:  "Often 
in  the  past,  I  wouJd  give  up  on  homework.  Derive  is  another 
alternative."  "I  finally  figured  out  that  Derive  could  help 
me  explain  things,  not  just  give  me  results."  "I  feel  with 
time  I  will  be  able  to  integrate  Derive  into  my  homework 
work  schedule.” 

A  new  core  math  curriculum  was  implemented  in  the  fall  of 
1990.  This  caused  the  department  to  change  many  course 
priorities  and  objectives  to  meet  the  new  requirements  of 
the  curriculum.  The  summer  of  1990  was  used  to  restructure 
courses  not  only  so  that  they  would  complement  the  new 
curriculum,  but  with  the  specific  purpose  of  integrating  the 
CAS  into  daily  lessons. 

2.  OCR  APPROACH.  Currently,  nine  math  courses  at  USMA 
utilize  CAS  as  a  part  of  their  instruction:  Precalculus, 
Discrete  Dynamical  Systems,  Calculus  I  &  II,  Differential 
Equations,  Probability  and  Statistics,  Multi-Variable 
Calculus,  Mathematical  Modeling,  and  Numerical  Analysis. 
Additionally,  Derive  is  powerful  enough  to  be  used  by  the 
students  in  a  variety  of  other  mathematics,  science,  and 
engineering  courses.  Its  ability  to  evaluate  integrals  and 
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combinations  makes  it  ideal  for  checking  solutions  to 
probability  problems.  Multiple  integration  is  a  challenging 
problem  for  our  students,  but  Derive  enables  students  to 
solve  these  problems  without  becoming  mired  in  their 
intricacies.  It  is  also  useful  for  checking  any  kind  of 
numerical  work  that  involves  complex  calculations.  The 
students  can  do  their  homework  assignment  and  then  rapidly 
check  their  work. 

In  the  classroom,  Derive  is  used  to  demonstrate  ideas,  solve 
problems,  conduct  experiments,  and  explain  results.  Many 
times,  this  is  done  as  an  introduction  to  the  next  lesson, 
whereby  the  instructor  takes  the  current  subject  and 
conducts  a  "What  if?"  experiment  to  introduce  the  next 
topic. 

In  the  larger  picture,  CAS  can  help  cover  the  entire 
spectrum  of  mathematics;  discrete-continuous,  linear- 
nonlinear,  and  deterministic-stochastic.  In  this  way,  the 
CAS  allows  the  student  to  do  mathematical  modeling  and  work 
on  more  realistic  problems  instead  of  spending  his  time 
memorizing  algorithms.  Thus,  the  student  is  better  able  to 
internalize  the  algorithm  through  repeated  use,  instead  of 
memorizing  it  for  a  test  and  then  forgetting  it  through  lack 
of  use.  The  students  are  encouraged  to  conduct  their  own 
numerical  experiments.  Changing  a  parameter,  introducing 
more  terms,  and  exploring  limits  are  some  of  the  things  that 
a  student  can  readily  do  to  experience  the  excitement  of 
mathematics  instead  of  the  drudgery  of  computational 
exercises . 

Derive' s  graphing  capability  helps  the  student  accomplish 
these  things  and  many  others.  Changing  an  expression  and 
seeing  a  different  table  of  numbers  is  nice,  but  watching 
this  change  occur  on  a  graph. lets  the  student  actually  see 
how  the  changes  influence  the  behavior  of  the  result.  Of 
equal  importance  is  Derive ' s  capability  to  use  multiple 
windows.  This  allows  the  student  to  explore  different  areas 
while  keeping  the  same  screen  setup. 

The  following  figure  shows  a  screen  display  of  three 
different  windows;  algebra,  2-D  graph,  and  3-D  graph.  If  a 
student  wished  to  explore  the  cosine  function,  he  or  she 
would  do  so  merely  by  entering  a  new  expression  in  Window  1 
and  having  Derive  plot  the  new  expression.  The  original 
cos  (x)  can  be  deleted  or  left  on  the  graph  for  easy 
comparison  with  the  new  expression. 
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Derive  is  a  very  capable  teaching  tool.  Students 
historically  have  problems  dealing  with  functions  in  two 
variables.  Using  Derive,  it  can  be  shown  that  these 
functions  are  accessible.  The  following  graph  illustrates 
the  function  f(x,y)  =  y^  -  x2  in  the  upper  right  and  after 
cutting  it  with  the  plane  z  =  0  using  the  MAX  function  in 
Window  3. 


Similarly,  the  CHI  function  is  used  to  introduce  piecewise 
functions  and  demonstrate  discontinuities.  Derive  allows 
the  student  to  visualize  the  effects  of  discontinuities. 
This  allows  the  student  to  see  what  is  taking  place  when  an 
algorithm  indicates  that  a  function  has  a  discontinuity. 
Derive' s  expression  and  plot  for  a  piecewise  continuous 
function  is  shown  in  the  following  figure. 


Calculus  functions,  as  are  most  of  Derive 's  functions,  are 
reached  through  the  menu  system.  Besides  finding 
derivatives  and  integrals,  Derive  gives  the  student  ready 
access  to  limits,  summations,  and  Taylor  Series.  These  are 
useful,  not  only  to  determine  numerical  results,  but  also  to 
explore  the  functions  and  to  discover  the  various  attributes 
of  each. 

Vector  and  matrix  operations  are  other  types  of  operations 
that  Derive  will  do  for  the  student.  Scalar  multiples,  dot 
products  and  cross  products  are  all  at  the  student's 
fingertips.  Matrix  inverses,  addition,  multiplication,  and 
finding  eigenvalues  are  all  standard  operations  in  Derive. 

The  student  is  encouraged  to  use  Derive  in  two  fashions. 
First,  the  student  can  perform  many  operations  sequentially. 
This  helps  in  understanding  the  algorithms.  Then,  the 
student  can  simply  use  the  appropriate  Derive  function  to 
determine  if  the  process  was  correct.  In  this  way,  the 
students  can  do  larger,  more  realistic,  problems  without 
worrying  about  getting  lost  in  the  quagmire  of  the 
individual  calculations.  Students  are  also  encouraged  to 
use  these  functions  any  time  that  they  have  homework 
assignments . 
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The  following  figure  shows  the  process  of  finding  the 
eigenvalues  of  a  4x4  matrix. 
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No  software  package  is  perfect,  and  we  would  be  remiss  if  we 
did  not  point  out  some  of  Derive' s  limitations.  First  and 
foremost  is  its  lack  of  programming.  However,  this  may 
actually  benefit  students  who  need  to  perform  step-by-step 
operations  to  understand  the  concept  of  the  algorithm. 
Another  area  in  which  it  falls  short  is  its  lack  of  a  mouse 
interface.  While  most  of  the  commands  are  easy  to  get  to, 
editing  a  previously  entered  command  or  moving  the  cursor 
around  the  2-D  graph  is  tedious  when  compared  to  the  rest  of 
the  program's  user-friendly  implementation.  Another 
shortcoming  is  its  limited  text  formatting.  All  keyboard 
entries,  unless  enclosed  by  quotes,  are  treated  as  variables 
and  therefore,  entering  and  printing  technical  notation  is 
difficult . 
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3.  IMPACT  AND  LESSONS  LEARNED:  D/MS  is  in  the  process  of 
changing  from  text  and  algorithm  oriented  courses  to 
laboratory  and  problem  solving  courses.  As  these  changes 
are  made,  there  is  no  longer  an  emphasis  on  the  techniques 
of  mathematics,  but  on  its  concepts  and  applications.  Just 
as  the  advent  of  hand-held  calculators  removed  interpolation 
and  finding  square  roots  from  our  curriculum,  we  see  the 
rise  of  CAS  replacing  the  classroom  hours  spent  mastering 
algorithms  with  time  spent  on  the  applications  and  concepts 
of  mathematics. 

From  the  students'  point  of  view,  55%  of  those  who  used  it 
during  the  second  semester  MA101  course  felt  that  Derive  was 
beneficial  to  their  understanding  of  the  curriculum. 
Interestingly,  this  group  of  students  did  better  than  the 
previous  year's  students  on  a  standard  calculus  test 
administered  towards  the  end  of  the  semester.  It  takes  a 
conscious  effort  on  the  instructor's  part  to  bring  a  CAS 
into  the  classroom  and,  more  importantly,  to  have  their 
students  use  it  outside  the  classroom.  Failure  to  do  so 
magnifies  any  misgivings  that  the  student  has  about  using 
the  computer  as  a  learning  tool.  The  solution  to  this 
problem  is  to  include  Derive  in  the  testing  and  evaluation 
phase  of  the  course. 


4.  CONCLUSIONS:  The  D/MS  has  made  significant  progress 
integrating  CAS  in  the  classroom,  but  there  is  still  much  to 
be  done.  The  new  course,  Discrete  Dynamical  Systems,  which 
now  begins  our  core  math  program,  was  designed  specifically 
to  employ  CAS  to  the  fullest  possible  extent.  The  large 
number  of  students  taking  this  course,  about  1150,  provides 
a  large  student  population  to  test  our  new  teaching 
philosophy  and  the  use  of  CAS  in  undergraduate  mathematics. 
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Domain  Decomposition  Methods  for  Problems 
with  Uniform  Local  Refinement  in  Two  Dimensions 

James  H.  Bramble,  Richard  E.  Ewing,  Rossen  R.  Parashkevov, 

and  Joseph  E.  Pasciak 

Abstract.  In  this  talk,  we  first  present  a  flexible  mesh  refinement  strategy  for  the  approxima¬ 
tion  of  solutions  of  elliptic  boundary  value  problems  in  two  dimensional  domains.  Coupled 
with  this  approximation  scheme,  we  shall  describe  preconditioners  for  the  resulting  discrete 
system  of  algebraic  equations.  These  techniques  lead  to  efficient  computational  procedures  in 
serial  as  well  as  parallel  computing  environments.  The  preconditioners  are  based  on  overlap¬ 
ping  domain  decomposition  and  involve  solving  (or  preconditioning)  subproblems  on  regular 
subregions.  These  techniques  are  analyzed  in  a  forthcoming  paper  [2],  We  present  the  results 
of  numerical  experiments  illustrating  the  preconditioning  algorithms. 


INTRODUCTION 

To  provide  the  required  accuracy  in  many  applications  involving  large  scale  scientific 
computation,  it  becomes  necessary  to  use  local  mesh  refinement  techniques.  These  tech¬ 
niques  allow  the  use  of  finer  meshes  in  regions  of  the  computational  domain  where  the 
solution  exhibits  large  gradients.  This  remains  practical  only  if  efficient  techniques  for  the 
solution  of  the  resulting  discrete  systems  are  available.  In  this  talk,  we  will  give  a  flexible 
scheme  for  refinement  as  well  as  develop  effective  iterative  methods  for  the  solution  of  the 
resulting  systems  of  discrete  equations.  This  was  also  presented  at  the  Fourth  Iternational 
Symposium  on  Domain  Decomposition  Methods,  Moscow,  USSR,  May,  1990.  The  analysis 
for  the  methods  discussed  in  this  talk  is  given  in  [2]. 

We  shall  be  interested  in  techniques  for  problems  with  refinements  which  are  not  quite 
local.  As  an  example,  one  might  consider  a  front  passing  through  a  two  dimensional 
domain.  In  this  case,  it  might  be  necessary  to  refine  in  the  neighborhood  of  the  front. 

There  are  a  number  of  ways  of  developing  preconditioned  iterative  schemes  for  the  dis¬ 
crete  systems  resulting  from  local  mesh  refinement  in  the  literature.  Techniques  based  on 
nested  multilevel  spaces  are  given  in  [l],[lO],jll],  Techniques  based  on  domain  decomposi¬ 
tion  are  given  in  [3], [14], [15].  The  analysis  presented  there  implicitly  depends  on  the  shape 
of  the  the  refinement  domain,  and  hence  the  resulting  algorithms  may  not  be  as  effective 
with  irregularly  shaped  refinement  regions.  These  algorithms  also  require  the  solution  of 
a  subproblem  or  preconditioner  on  the  refinement  regions.  This  talk  will  provide  alterna¬ 
tive  preconditioned  iterative  techniques  for  these  problems  based  on  overlapping  domain 
decomposition.  Our  algorithms  are  simpler  and  possibly  more  effective  when  implemented 
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ment  of  Energy.  Accordingly,  the  U.S.  Government  retains  a  non-exclusive,  royalty-free  license  to  publish  or 
reproduce  the  published  foTm  of  this  contribution,  or  allow  others  to  do  so,  for  U.S.  Government  purposes. 
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and  by  the  U.S.  Army  Research  Office  through  the  Mathematical  Science  Institute,  Cornell  University. 
Additional  supporters  of  this  work  include  the  Office  of  Naval  Research  under  contract  No.  0014-88-K-0370 
and  by  the  Institute  for  Scientific  Computation  at  the  University  of  Wyoming  though  National  Science 
Foundation  grant  No.  RII-8610680. 
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since  they  often  lead  to  preconditioning  subproblems  defined  on  either  regular  subregions 
or  topologically  ‘nice’  meshes.  The  refinement  region  is  the  union  of  the  subregions. 

The  proposed  mesh  refinement  strategy  is  important  in  that  it  provides  a  basic  approach 
for  implementing  dynamic  local  grid  refinement.  An  example  of  a  refinement  strategy 
involves  starting  with  a  uniform  coarse-grid  and  refining  in  small  subregions  associated 
with  a  selected  set  of  coarse-grid  vertices.  These  subregions  are  allowed  to  overlap  and 
there  are  no  theoretical  restrictions  on  the  resulting  refinement  region  (the  union  of  the 
subregions).  Dynamic  refinement  is  achieved  by  simply  dynamically  changing  the  selected 
set  of  coarse-grid  vertices. 

In  addition,  the  technique  can  be  integrated  into  existing  large  scale  simulators  without 
a  complete  redesign  of  the  code.  This  is  because  most  of  the  computation  involves  tasks 
on  either  the  global  coarse  grid  or  the  refinement  grids  associated  with  the  refinement  sub- 
regions.  Choosing  the  coarse  and  refinement  grid  structure  to  be  that  already  used  in  the 
code  saves  considerable  development  costs.  For  example,  if  one  uses  regularly  structured 
meshes  in  the  coarse  and  refinement  grids,  a  substantial  part  of  the  resulting  algorithm 
only  requires  operations  on  regular  grids  even  though  the  resulting  final  approximation 
space  is  not  regular. 

The  outline  of  the  remainder  of  the  talk  is  as  follows.  In  Section  2,  we  define  some 
preliminaries  and  describe  the  second-order  elliptic  problems  which  will  be  considered. 
The  overlapping  domain  decomposition  algorithms  for  grids  with  partial  refinement  is 
defined  in  Section  3.  The  theoretical  estimates  for  the  resulting  preconditioned  systems 
(from  [2])  are  also  given  there.  Finally,  computational  aspects  and  the  results  of  numerical 
experiments  using  these  preconditioning  techniques  are  discussed  in  Section  4. 

2.  The  elliptic  problem  and  preliminaries 

We  shall  be  concerned  with  the  efficient  solution  of  discrete  equations  resulting  from 
approximation  of  second-order  elliptic  boundary  value  problems  in  a  polygonal  domain  fl 
contained  in  two  dimensional  Euclidean  space  R2 .  We  consider  the  problem  of  approxi¬ 
mating  the  solution  u  of 


(2.1) 


Lu  -  f  in  H, 

u  =  0  on  dfl. 


Here  L  is  given  by 


«,j  =  i 


dx, 


dv 

a'3d7/ 


and  {a,;(x)}  is  a  uniformly  positive  definite,  bounded,  piecewise  smooth  coefficient  matrix 
on  fl.  The  corresponding  bilinear  form  is  denoted  by  A(-,-)  and  is  given  by 


(2.2) 


A(v,  w) 


=  £  /  °. 
i^tJa 


dv  dw 
dxx  dx j 


dx, 


and  is  defined  for  functions  v,w  E  Here  Hl( 0)  is  the  Sobolev  space  of  order  one 

on  fl.  We  denote  the  L2(U)  inner  product  by  (•,•).  The  weak  solution  u  of  (2.1)  is  the 
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function  u  €  77o(f2)  satisfying 

A{u,<p)  =  (/,<p)  for  all  <p  £  Hq  (fl). 

Here,  Hq(Q)  is  the  subspace  of  functions  in  H1( Q)  whose  traces  vanish  on  dfl. 

We  consider  the  above  model  problem  for  convenience.  Many  extensions  of  the  tech¬ 
niques  to  be  presented  are  possible;  for  example,  one  could  consider  equations  with  lower- 
order  terms  and  different  boundary  conditions. 

In  this  talk,  we  shall  deal  with  various  domains.  These  domains  will  always  be  open. 

3.  The  overlapping  algorithms 

In  this  section,  we  shall  define  iterative  methods  for  problems  with  partial  refinement 
based  on  overlapping  domain  decomposition.  We  start  with  a  coarse  mesh  U txh  consisting 
of  triangles  of  quasi-uniform  size  H.  The  associated  finite  element  space  M0  is  defined 
to  be  the  set  of  continuous  piecewise  linear  functions  on  the  coarse  mesh  which  vanish  on 
dfl.  The  interior  nodes  of  this  mesh  will  be  denoted  {x,},  for  i  —  1,...  ,NC.  The  mesh 
refinement  is  defined  in  terms  of  a  number  of  coarse  grid  subdomains  {fl,}  for  t  =  1, . . .  ,  K. 
By  convention,  flt  is  defined  to  be  the  interior  of  the  union  of  the  closures  of  the  coarse 
grid  triangles.  The  refinement  regions  will  also  be  referred  to  as  “the  subdomains.”  We 
assume  that  they  have  limited  overlap  in  that  any  point  of  fl  is  contained  in  at  most  a  fixed 
number  (not  depending  on  77)  of  the  subdomains.  We  define  the  domain  of  refinement  Qr 
to  be  the  union  of  the  subdomains,  0r  =  uf-jfl,-.  There  are  no  theoretical  restrictions 
concerning  the  definition  of  the  refinement  subregions  except  that  they  are  defined  in  terms 
of  the  coarse  grid  triangles  and  satisfy  the  overlap  property  as  described  above. 

We  provide  two  examples  of  this  construction.  For  both  examples,  the  subregions  are 
associated  with  coarse  grid  nodes.  For  the  first  example,  we  define  the  region  associated 
with  a  coarse-grid  node  x,  as  the  subdomain  fl,  which  contains  the  coarse-grid  triangles 
having  x*  as  a  vertex.  For  the  second  example,  we  consider  a  mesh  which  is  topologically 
equivalent  to  a  regular  rectangular  mesh  (see  Figure  3.1).  In  this  case,  we  define  ft,  to  be 
the  four  quadrilaterals  which  share  the  vertex  x,.  Some  reasons  for  such  a  choice  will  be 
explained  later.  In  either  case,  an  index  set  /  C  [1, . . .  ,  Nc)  is  selected  and  the  domains  {fl*} 
with  t£  7  are  used  to  define  the  refinement  region.  By  possibly  changing  the  numbering  of 
the  coarse  grid  nodes,  we  assume,  without  loss  of  generality,  that  7  =  [1,2, .. .  ,K j.  There 
are  no  additional  restrictions  concerning  this  set  7  and  hence  rather  complex  refinement 
regions  are  possible. 

The  composite  space  is  defined  in  terms  of  a  quasi-uniform  mesh  {r^}  on  f2  of  size  h  <  H 
which  satisfies 

U idTlH  C  U ,dr£. 

The  space  of  continuous  piecewise  linear  functions  with  respect  to  this  triangulation  (which 
vanish  on  dfl)  will  be  denoted  by  M.  Note  that  this  space  is  introduced  for  the  construction 
and  analysis  of  the  composite  grid  space.  It  is  not  used  in  actual  computation  since  it  has 
too  many  degrees  of  freedom  in  fl/flr.  The  subspace  M,  associated  with  the  subdomain 
fl,  is  defined  by 

(3.1)  M,  =  {4>  6  M  [  support  4>  C  fl,}. 
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Figure  3.1 

A  distorted  rectangular  mesh. 

The  composite  finite  element  space  is  then  defined  to  be 

K 

i=0 

Note  that  the  space  M  provides  finer  grid  approximation  in  the  refinement  region  nr. 
An  illustrative  example  of  a  mesh  so  generated  is  given  in  Figure  3.2.  The  nodes  on  the 
boundary  of  the  refinement  region  which  are  not  coarse-grid  nodes  are  slave  nodes  since, 
by  continuity,  the  values  of  functions  in  M  on  these  points  are  completely  determined  by 
their  values  on  neighboring  coarse-grid  nodes.  The  operator  A,  :  Ml  M,  is  defined  for 
v  G  Mt  by 

(Atu,  <t>)  =  A(v,  (p)  for  all  4>  €  A/,. 

Our  goal  is  to  efficiently  solve  the  composite  grid  problem:  Given  a  function  /  6  Z2(0), 
find  U  €  M  satisfying 

(3.2)  A(U,<t>)  =  (/,  <}>)  for  all  €  M. 

As  above,  we  define  A  :  M  •— ♦  M  by 

{Av,<j>)  =  A(v,<t>)  for  all  <t>  €  M. 

Problem  (3.2)  can  then  be  rewritten  as 


X-  Selected  conrsc  grid  nodes 


Figure  3.2 
A  composite  grid. 

for  appropriate  F  €  M.  We  will  develop  preconditioners  for  (3.3)  by  using  overlapping 
domain  decomposition. 

There  are  basically  two  classes  of  these  preconditioners,  the  additive  and  multiplicative. 
The  additive  version  defines  the  preconditioner  Ba  for  A  of  (3.3)  by 

K 

Ba  =  ^R,Qt- 

i'=0 

Here,  Q,  denotes  the  L2(n)  projection  operator  onto  M,  and  Rt  is  a  symmetric  positive 
definite  operator  on  Mx.  Explicit  choices  for  Rx  will  be  discussed  later;  however,  we  note 
that  it  suffices  to  take  Rx  to  be  a  preconditioner  for  Ax. 

The  multiplicative  version  is  defined  by  applying  the  Rx  consecutively.  The  multiplicative 
preconditioner  Bm  applied  to  a  function  W  G  M  is  defined  as  follows: 

(1)  Set  Y0  =  0. 

(2)  For  i  =  1, . . .  ,K  +  1,  define  Yx  by 

(3.4)  Yt  =  y,_x  +  -  AYx-i). 

(3)  For  i  =  K  +  2,...  , 2 K  +  2,  define  K,  by 

(3.5)  Yt  =  y,_i  +  R2K  +  2-tQ2K  +  2-i(W  ~  >iy,_i). 

(4)  Set  BmW  =  Y2K+2- 
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It  is  not  difficult  to  see  that  Bm  is  a  symmetric  linear  operator  on  M. 

The  operators  Ba  and  Bm  defined  above  will  be  effective  as  preconditioners  A  if  they 
satisfy  the  following: 

(1)  They  are  relatively  inexpensive  to  evaluate. 

(2)  They  lead  to  well  conditioned  linear  systems. 

The  first  criterion  involves  implementation  issues  and  will  be  discussed  later  in  more  de¬ 
tail.  The  second  criterion  requires  that  the  condition  numbers  K{BaA)  and  K[BmA)  be 
small.  In  the  case  of  the  additive  algorithms,  this  is  equivalent  to  the  existence  of  positive 
constants  Co,  ci  satisfying 

(3.6)  c0A(v,v)  <  A[BaAv,v)  <  CiA{v,v)  for  all  t >  e  M, 

with  Ci/ c0  small.  A  similar  statement  holds  for  the  product  algorithm. 

The  analysis  presented  in  [2]  requires  the  following  hypotheses.  It  is  assumed  that  there 
are  positive  constants  C0  and  u  which  do  not  depend  on  h,  H  or  the  subdomains  and 
satisfy 

(3.7)  CqA[w,w)  <  A{RxAxw,w)  <  uA(w,w)  for  all  w  €  Mt. 

This  means  that  the  operators  Rx  are  spectrally  good  preconditioners  for  Ax.  For  the 
product  algorithm,  we  also  assume  that  0  <  u>  <  2.  The  following  theorem  is  proved  in 
[2]- 

THEOREM  3.1.  Assume  that  there  are  no  isolated  points  on  the  boundary  of  ftr .  Then 
the  condition  numbers  K{BaA )  and  K(BmA)  remain  bounded  independently  of  h,  H  and 
the  choice  of  subdomains  {ft,}. 

Remark  3. 1:  The  analysis  given  in  [2]  uses  techniques  from  both  the  theory  of  overlap¬ 
ping  domain  decomposition  [12], [13]  as  well  as  the  standard  domain  decomposition  theory 
[5]-[8j  to  provide  the  result  for  the  additive  algorithms.  The  result  for  the  multiplicative 
version  follows  from  that  for  the  additive  and  the  application  of  a  general  theory  given  in 

[«1- 

R  EM  ARK  3.2:  The  hypothesis  concerning  isolated  points  on  the  boundary  of  ftr  is  in¬ 
cluded  to  provide  a  uniform  estimate  for  the  preconditioned  systems.  If  dftr  contains 
isolated  points  then  it  is  possible  to  show  (cf.  Remark  4.2  of  [2])  that  the  condition  num¬ 
ber  grows  at  most  on  the  order  of  \n2(H/h).  This  sort  of  decay  is  actually  seen  in  the  last 
numerical  example  in  Section  6. 

REMARK  3.3:  There  is  very  little  restriction  concerning  the  way  that  the  domains  ft, 
are  defined.  Note  that  if  only  one  refinement  domain  is  used,  then  Theorem  3.1  provides 
a  result  for  the  imbedded  space  case  proposed  in  [3].  Alternatively,  one  can  consider 
the  case  where  ftr  is  all  of  ft  and  hence  M  =  M.  In  this  case,  Theorem  3.1  guarantees 
uniform  bounds  for  the  condition  numbers  without  putting  restrictions  on  the  shapes  of 
the  subdomains  {ft,}.  Thus,  for  example,  the  subdomains  can  be  taken  to  be  strips  as 
long  as  the  coarse  problem  is  included. 
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4.  Computational  aspects  and  Numerical  Examples 

In  this  section,  we  discuss  some  of  the  computational  properties  associated  with  the 
method.  In  particular,  we  shall  consider  its  feasibility  for  use  in  dynamic  refinement 
strategies.  We  shall  also  see  that  with  this  type  of  method,  it  is  possible  to  develop  highly 
vectorizable  and  parallelizable  code.  Finally,  we  provide  the  results  of  numerical  examples 
illustrating  the  condition  numbers  for  the  preconditioned  systems  described  earlier. 

We  consider  the  earlier  discussed  examples  where  the  domain  of  refinement  is  defined  by 
simply  selecting  coarse-grid  nodes  and  a  rule  for  defining  the  refinement  region  associated 
with  a  coarse  node.  Specifically,  we  consider  the  example  where  the  coarse  mesh  is  defined 
from  quadrilaterals  and  the  refinement  region  associated  with  a  coarse-grid  vertex  is  defined 
to  be  the  four  quadrilaterals  which  share  the  vertex.  An  easy  way  to  implement  this 
refinement  involves  using  vectors  of  unknowns  with  some  redundancy.  Associated  with 
each  quadrilateral,  we  associate  a  vector  which  contains  the  fine-grid  unknowns  in  the 
quadrilateral  and  its  boundary.  The  program  is  designed  to  operate  on  a  data  structure 
which  contains  a  coarse-grid  vector  and  a  list  of  fine-grid  vectors  corresponding  to  the 
quadrilaterals  appearing  in  the  refinement  regions.  This  process  is  controlled  by  a  list  of 
pointers  which  connect  the  location  of  quadrilateral  fine-grid  vectors  in  this  data  structure 
to  the  coarse  grid  node  refinement  regions  in  which  they  appear.  A  simple  control  structure 
is  also  developed  to  handle  the  redundancy  in  the  data  vectors.  These  control  structures 
can  be  easily  derived  from  the  list  of  coarse-grid  refinement  nodes  and  the  coarse  mesh 
geometry.  Thus,  a  dynamic  change  in  the  refinement  region  only  requires  a  simple  (and  of 
negligible  cost)  computation  of  some  control  pointers  associated  with  the  co  tJ  :-e  grid. 

An  advantage  of  the  proposed  approach  is  that  it  can  be  used  to  invoke  refinement  with¬ 
out  the  use  of  the  general  data  structures  associated  with  meshes  which  are  not  regular. 
One  assigns  a  regular  mesh  topology  to  the  coarse  mesh  and  to  the  meshes  in  the  refine¬ 
ment  subregions.  This  means  that  even  though  the  composite  mesh  is  highly  irregular, 
all  of  the  problems  (on  A/,,  i  G  To)  which  need  to  be  solved  or  preconditioned  will  be  on 
regular  rectangular  meshes.  Similarly,  it  is  possible  to  decompose  the  evaluation  of  the 
composite  grid  operator  into  pieces  which  involve  operator  evaluation  on  the  topologically 
rectangular  mesh  parts.  For  these  topologically  rectangular  meshes,  highly  efficient  mod¬ 
ules  for  preconditioning  and  operator  evaluation  are  available  for  both  vector  and  parallel 
computing  architectures. 

We  shall  consider  the  model  problem 

Au  =  f  in  O, 

u  =  o  on  an, 

where  A  denote  the  Laplacian  and  Q  is  the  unit  square  [0, 1]  x  [0,  l].  To  define  the  coarse 
mesh,  the  domain  0  is  first  partitioned  into  m  x  m  square  subdomains  of  side  length 
H  =  1/m.  Each  smaller  square  is  then  divided  into  two  triangles  by  one  of  the  diagonals 
(e.g.  the  diagonal  which  goes  from  the  bottom  left  to  the  upper  right  hand  corner  of  the 
square).  The  coarse-grid  approximation  space  Mq  is  defined  to  be  the  set  of  functions 
which  are  continuous  on  H,  are  piecewise  linear  with  respect  to  the  triangulation,  and 
vanish  on  dfl.  The  space  M  is  defined  from  a  similar  finer  mesh  of  size  h  =  H/l  for  some 
integer  l  >  1. 


(4.1) 
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For  our  first  two  examples,  we  consider  an  application  where  it  is  required  to  refine 
along  the  diagonal  connecting  the  origin  with  the  point  (1,1).  Such  a  refinement  might 
be  necessary  if  the  function  /  has  large  gradients  near  this  diagonal  but  is  well  behaved 
in  the  remainder  of  ft.  Accordingly,  we  select  the  coarse-grid  nodes  on  the  diagonal  for 
refinement.  We  define  the  refinement  region  associated  with  a  refinement  node  to  be  the 
four  coarse  mesh  squares  which  have  that  node  as  a  corner.  Note  that  the  refinement 
region  is  highly  irregular  even  though  the  coarse  problem  and  the  refinement  subproblems 
involve  regular  rectangular  meshes. 

We  will  illustrate  the  rate  of  convergence  of  preconditioned  algorithms  for  solving  (3.2) 
where  A(-,-)  is  given  by  the  Dirichlet  form.  To  do  this,  we  shall  numerically  compute  the 
largest  and  smallest  eigenvalue  (Ai  and  Ao  respectively)  of  the  preconditioned  operator 
BaA.  As  is  well  known,  the  rate  of  convergence  of  the  resulting  preconditioned  algorithms 
can  be  bounded  in  terms  of  the  condition  number  K(BaA)  =  Ai/A0.  We  shall  not  report 
results  for  preconditioning  with  the  product  operator  Bm,  although  our  previous  experience 
[9]  suggests  that  the  product  version  will  converge  somewhat  faster  than  the  additive. 

Table  4.1  gives  the  largest  and  smallest  eigenvalue  and  the  condition  number  of  the 
system  BaA  as  a  function  of  h.  In  this  example,  we  took  R,  =  Ar_1;  i.e.,  we  solved 
exactly  on  the  subspaces  {M,}.  For  Table  4.1,  m  =  4  and  there  are  three  refinement 
subdomains  (0,1/2)  x  (0,1/?'  (1/4, 3/4)  x  (1/4, 3/4),  and  (1/2,1)  x  (1/2,1).  Note  that 
both  the  upper  and  lower  eigen  values  appear  to  be  tending  to  a  limit  as  the  ratio  hf  H  >— *■  0. 
Similar  behavior  is  seen  in  Table  4.2,  which  corresponds  to  m  =  8  and  uses  seven  smaller 
refinement  subregions. 


Table  4.1 

Condition  numbers  for  S  overlapping  subregions 


h 

^0 

K[BaA ) 

1/8 

2.44 

0.50 

4.9 

1/16 

2.50 

0.41 

6.1 

1/32 

2.51 

0.38 

6.6 

1/64 

2.52 

0.36 

6.9 

1/128 

2.52 

0.35 

7.1 

In  almost  all  realistic  applications,  the  direct,  solution  of  subproblems  is  much  more 
expensive  than  the  evaluation  of  a  suitable  preconditioner.  To  illustrate  the  effect  on  the 
convergence  rate  of  the  preconditioned  iteration,  we  next  consider  the  previous  example  but 
with  the  direct  solves  on  the  subspaces  replaced  by  multigrid  preconditioners.  Specifically, 
we  employ  the  V-cycle  multigrid  algorithm  (cf.  j4])  using  one  pre-  and  post-smoothing 
Jacobi  iteration  on  each  grid  level.  This  leads  to  a  preconditioning  operator  Rt  :  M,  *-»  M, 
which  satisfies 

(4.2)  0.4A(t>,u)  <  A(RlA{V,v)  <  A(v,v )  for  all  v  6  Mi- 
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Table  4.2 

Condition  numbers  for  7  overlapping  subregions 


h 

•^l 

■^0 

K(BaA) 

1/16 

2.46 

■fj 

5.2 

1/32 

2.52 

■SI; 

6.5 

1/64 

2.54 

7.2 

1/128 

2.54 

7.5 

The  constant  0.4  above  was  computed  numerically  and  holds  for  all  of  the  subspace  prob¬ 
lems  which  are  required  for  this  application,  including  Mo- 

Tables  4.3  and  4.4  provide  the  eigenvalues  and  condition  numbers  for  the  above  examples 
when  direct  solves  were  replaced  by  multigrid  preconditioners.  Note  that  in  all  of  the 
reported  runs,  the  condition  number  with  multigrid  preconditioners  was  at  most  5/4  times 
as  large  as  that  corresponding  to  exact  solves.  Such  an  increase  in  condition  number  is 
negligible  in  a  preconditioned  iteration.  In  contrast,  the  computational  time  required  for 
the  multigrid  sweep  is  considerably  less  that  that  needed  for  a  direct  solve  (especially  in 
more  general  problems  with  variable  coefficients). 


Table  4.3 

Preconditioned  subproblems,  S  overlapping  subregions 


h 

•^l 

warn 

K{BaA) 

1/8 

2.37 

0.53 

4.5 

1/16 

2.12 

0.33 

6.4 

1/32 

2.07 

0.27 

7.6 

1/64 

2.04 

0.25 

8.2 

1/128 

2.02 

0.24 

8.4  j 

1 

Table  4.4 

Preconditioned  subproblems,  7  overlapping  subregions 


h 

mm 

wmm 

K(BaA) 

1/16 

2.36 

0.40 

5.9 

1/32 

2.11 

0.28 

7.5 

1/64 

0.24 

8.8 

1/128 

2.03 

0.22 

9.4 
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As  a  final  example,  we  consider  a  case  where  the  isolated  point  hypothesis  of  Theorem 
3.1  is  not  satisfied.  Specifically,  we  consider  a  coarse  mesh  of  size  H  =  1/4  and  select  the 
four  nodes  with  (x,  y)  values  (1/4, 1/2),  (3 / 4,1/2),  (l/2,l/4),  and  (1/2, 3/4).  The  refinement 
region  is  everything  but  the  subsquares  (0, 1/4]  x  [0, 1/4],  [0, 1/4]  x  [3/4,  l],  [3/4, 1]  x  [0, 1/4], 
and  [3/4,1]  x  [3/4,1].  Note  that,  to  satisfy  the  hypotheses  of  the  theorem,  it  would  be 
necessary  to  include  a  refinement  region  centered  at  the  coarse-grid  node  (1/2, 1/2).  Table 
4.5  gives  the  smallest  eigenvalue  for  the  operator  BaA  as  a  function  of  h.  The  function 
(.32  +  .361og2(/i_1))~2 3 4 5 6 7 8 9 10 11 12  is  also  provided  for  comparison.  These  results  suggest  that  smallest 
eigenvalue  Ao  decays  as  predicted  by  the  theoretical  bound  C  j  ln(H//i)2  (see  Remark  3.2). 


Table  4.5 

A  “ bad ”  example  in  two  dimensions. 


h 

■^0 

(,32  +  .361og2(h-1))-2 

1/8 

.50 

.51 

1/16 

.32 

.32 

1/32 

.22 

.22 

1/64 

.16 

.16 

1/128 

.12 

.12 
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Abstract.  The  book  by  Tarski  and  Givant,  A  Formalization  of  Set  Theory  Without  Variables,  describes 
a  powerful  system  of  equational  logic  based  on  proper  binary  relations.  This  simple  system  is  rich  enough 
to  serve  either  as  a  specification  language  for  describing  interacting  systems,  a  concise  and  flexible  query 
language,  or  a  notation  for  program  synthesis. 

We  consider  the  problem  of  program  synthesis  and  introduce  an  operator  to  specify  single  linear  recur¬ 
sions.  Equations  about  such  recursions  are  chained  together  to  give  a  method  of  query  transformation  that 
collapses  loops  by  rewriting.  The  equations  hold  under  all  interpretations  and  are  appropriate  support  for 
a  distributed  database  in  which  each  node  has  its  own  snapshot  of  the  data.  These  results  are  part  of  an 
effort  to  find  a  mechanism  for  quickly  expressing  and  efficiently  solving  complex  logical  queries. 


1.  Introduction.  One  of  the  most  attractive  program  de1  tlopment  methods  is  trans¬ 
formational  programming.  This  report  describes  the  synthesis  :  d  transformation  of  logical 
queries.  The  assumptions  are  that 

1.  Queries  are  to  be  initially  written  as  straightforward,  high  level  specifications.  More 
efficient,  but  probably  less  readable,  queries  are  derived  through  correctness  pre¬ 
serving  transformations. 

2.  Functional  and  relational  operators  provide  an  appropriate  context  for  specifica¬ 
tions.  They  have  the  algebraic  properties  required  for  the  transformations. 

The  central  contribution  is  the  application  of  the  relation  algebras  and  Q-systems  of 
Tarski[l]to  query  optimization.  The  notation  is  attractive,  as  it  is  a  convenient  combination 
of  logic  programming  and  functional  programming.  This  middle  ground  has  the  express- 
ibility  of  the  former  and  the  manipulability  of  the  latter.  Specifically,  a  key  contribution 
is  an  operator  for  linear  recursions  over  regularly  structured  objects  along  with  equations 
for  merging  loops  and  propagating  constraints.  The  objects  are  terms  of  a  single  binary 
function  symbol. 

2.  Motivation.  Relational  queries  are  typically  straightforward,  declarative  expres¬ 
sions.  The  nonprocedural  quality  of  such  queries  is  intuitive  and  leads  to  easy  optimization. 
However,  query  languages  are  limited  in  their  ability  to  compute  results.  The  usual  repair 
is  to  combine  the  query  language  with  a  procedural  language  resulting  in  a  large  increase 
in  complexity  and  error  prone  programming. 

Because  of  their  declarative  nature,  relational  systems  are  appropriate  for  quickly  and 
naively  formulating  queries.  This  is  important  for  time  critical  activities  in  which  machine 
efficiency  is  secondary  to  overall  completion  time.  On  the  other  hand,  the  limited  power  of 
relational  query  languages  has  meant  an  inability  to  handle  unpredictable  situations  or  plan 
for  the  unexpected.  In  other  words,  all  the  planning  must  have  been  done  in  the  laboratory 
instead  of  in  the  field. 

In  order  to  allow  the  customer  some  flexible  planning  ability,  such  a  system  needs 
some  extensibility  and  much  safety.  This  extensibility  is  independent  of  concerns  about 
space  and  time  requirements  or  accounts  of  storage  allocation  and  reclamation.  Freedom 
of  control  over  storage  is  a  primary  characteristic  of  declarative  programming.  Declarative 
programming  is  a  problem  oriented  style  of  programming. 

Finally,  software  is  often  developed  in  an  ad  hoc  fashion.  The  design  of  computational 
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Symbol 
A-Z 
a  -  z 
T 
5 

B(S) 

A 

V 


3 

V 


dU 

(-I-) 


Meaning 

variables 

constants  or  function  symbols 

set  of  terms 

set  of  ground  terms 

set  of  binary  relations  on  5 

logical  and 

logical  or 

logical  negation 

existential  quantifier 

universal  quantifier 

if  and  only  if 

equivalent  to 

defined  as 
ordered  couple 
Table  X 

Summary  of  notation 


tools  usually  begins  with  the  hardware,  followed  by  the  construction  of  a  generally  useful 
collection  of  operations,  ending  with  (often  inadequate)  attempts  at  optimization.  The 
alternative  approach  taken  here  is  to  consider  optimization  first,  that  is,  to  choose  the 
notation  so  that  optimization  is  easy.  This  suggests  that  program  operations  will  have 
many  algebraic  properties. 

3.  Preliminaries.  The  notation  is  to  be  interpreted  both  mathematically  and  as  pro¬ 
gram  constructs  with  the  clear  understanding  that  the  two  are  distinct.  Within  definitions 
and  arguments  for  correctness  we  follow  an  extended  predicate  logic  C+  [1].  This  system 
has  two  important  levels:  a  logic  for  program  specification  and  an  algebra  for  program 
optimization. 

Every  program  is  specified  as  a  relation  between  inputs  and  outputs  and  includes  subpro¬ 
grams  specifying  predicates  on  pairs,  or  as  generators  of  pairs.  Program  forming  operations 
are  operations  on  relations  and  optimization  rules  are  equations  between  relations.  Control 
is  assumed  to  proceed  from  left  to  right,  top  to  bottom.  Efficiency  of  query  solution  is  heav¬ 
ily  dependent  on  both  data  and  control.  Some  operations,  such  as  negations,  are  delayed 
until  arguments  are  available.  Transformations  collapse  redundant  computations  or  reorder 
to  avoid  delaying.  The  objective  is  to  schedule  relations  to  uniquely  and  efficiently  generate 
answers. 

The  theory  of  relations  is  one  of  the  most  thoroughly  developed  branches  of  logic[l-3] 
The  algebraic  terminology  categorizes  a  portion  of  the  large  collection  of  relation  equations. 
It  also  helps  describe  algorithms  abstractly,  independently  of  a  model.  The  most  specific 
structures  are  relation  algebras  and  Q-relation  algebras  as  described  in  [I],  Table  1  is  a 
summary  of  notation. 

fn  particular,  the  calculus  of  binary  relations  follows  the  well  understood  laws  of  Boolean 
algebra.  A  Boolean  algebra  is  defined  as  a  structure  (U,  +,  )  satisfying  a  set  of  equations 

as  axioms.  Although  there  are  no  implicit  assumptions  about  the  underlying  universe  £/,  we 
are  most  interested  in  Boolean  algebras  defined  on  6(5),  the  set  of  binary  relations  between 


638 


Relation  Symbol 
0 
1 

ft  +  S 

RS 

ftOS 

ft©S 

ft 

R~ 

ft* 

[R\S] 

pi(D,S,C) 


Meaning 
empty  relation 
universal  relation 
sum  of  relations 
product  of  relations 
ordered  coupler  of  relations 
relative  product  or  composition 
complement  of  the  relation  ft 
converse  of  a  relation 
arbitrary  exponentiation  of  ft 
relational  constructor  of  couples 
linear  recursion 


O 

id  identity  relation,  (also  written  1) 

0 

di  diversity  relation,  0 

hd  first  projection,  a 

tl  second  projection,  b 

Table  2 

Summary  of  Relatione  and  Relation  Operators 


terms.  The  following  is  a  sample  axiomatization  of  a  Boolean  Algebra. 

(1)  (x  +  y)  +  z  =  x  +  (y  +  z), 

(2)  x  +  y  =  y  +  x, 

(3)  X  =  (X  +  Y)  +  X  +  Y. 

A  relation  algebra  (an  RA)  is  a  structure  (lf,+,  ,  id).  The  common  language  de¬ 

scribes  these  operations  as  sum,  complementation,  composition,  converse,  and  the  identity. 
A  summary  of  the  relation  operators  appears  in  Table  2.  The  following  is  an  axiomatization 
for  an  RA  [1,4], 


(4) 

(5) 

(6) 

(7) 

(8) 

(9) 

(10) 
(11) 
(12) 
(13) 


(F +  G)  + H  =  F +  (G+ H), 

F  +  G  =  G  +  F, 

F  =  (F  +  G)  +  (F  +  G), 
F©(G©  H)  =  (F  O  G)&  ft, 
(F  +  G)Q  H  =  FQ  H  +GQ  H, 
F  ©  id  =  F, 

F”  =  F, 

( F+GY=r  +  G~ , 
(F©Gr  =  G'©F\ 
F~®T®G  +  G  =  G. 


From  these  fundamental  operations  we  can  abstractly  define  other  relations  and  rela¬ 
tion  operators.  Positive,  logical  definitions  of  product  F  •  G  and  the  universal  relation  1  are 
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preferred.  They  are  more  easily  implemented  and  constructive  proofs  are  more  intuitive. 
Implementation  considerations  for  negations  such  as  complement  and  diversity  di  are  dis¬ 
cussed  in  the  next  section.  The  following  are  further  definitions  in  terms  of  the  fundamental 


operations. 

(14) 

F  •  G  dU  F  +  G, 

(15) 

1  dU  id  +  id, 

(16) 

o'!/ 1, 

(17) 

didU~d. 

It  is  easy,  but  tedious,  to  check  that  the  following  logical  definitions  for  these  relation 
operators  satisfy  the  properties  of  an  RA. 

(18) 

VXY{X  (F  G)Y  ~  X  FY  AX  GY), 

(19) 

VXY{X  (F  +  G)Y  ~X  FYVX  GY}, 

(20) 

VXY{X  (FqG)Y  ~3Z{X  F  Z  AZ  G  y}}, 

(21) 

VJYY{X  r  Y  ~Y  F  X}, 

(22) 

VX{X  id  X}, 

(23) 

VXY{X  1  Y}, 

(24) 

V.YYfJY  FY«  -'{X  F  Y}, 

(25) 

VJYY {JY  di  Y  ~-<(X  id  Y}. 

These  operators  satisfy  many  equations.  For  example,  some  simple  properties  of  sum 

and  product  are 

(26) 

A  +  0  =  A, 

(27) 

0  +  A-A, 

(28) 

(A  B)  ■  C  =  A  (B  ■  C), 

(29) 

"C' 

III 

(30) 

III 

For  any  given  RA  =  (i/,U,n,  ,0,1),  two  elements  a,b  are  called  quasiprojections  if 
a'Oa  C  id,  b~Ob  C  id,  and  av0b  =  1.  An  RA  is  called  a  Q-relation  algebra  if  its  operators 
include  quasiprojections. 

These  definitions  say  nothing  about  the  intended  realization,  and  their  abstractness 
is  what  makes  them  appropriate  for  describing  broadly  applicable  program  operations.  In 
[1]  the  intended  interpretation  is  encapsulated  into  a  membership  relation  E.  The  laws  of 
an  RA  and  a  Q-relation  algebra  hold  for  any  definition  of  E,  and  E  connects  the  relation 
algebras  with  a  particular  universe  over  which  the  relations  are  to  range.  In  an  application, 
the  query  language  is  the  set  of  abstract  operations  and  the  database  is  E. 

In  particular,  a  Q-relation  algebra  defined  over  a  nontrivial  universe  contains  ordered 
pairs  or  couples  and  the  quasiprojections  suggest  that  there  are  operations  for  selecting 
components  of  these  couples.  Two  such  selectors  over  ordered  couples  called  conjugated 
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projections  (or  just  projections)  are  defined  and  represented  by  hd  and  tl. 

(31)  VXY{(X|Y)  hd  X }, 

(32)  'iXY{{X\Y)  tl  Y}. 

Note  that  they  satisfy  hd~  ©  hd  =  id,tF  (-)tl  =  id,  and  hd“  ©  tl  =  1.  Thus  they  qualify  as 
quasiprojections.  These  projections  select  components  of  ordered  couples.  The  following 
operators  construct  or  perform  related  functions  on  couples. 

(33)  [F|G]  dM  (F  ©  h<T)  -  (G  ©  </“), 

(34)  FOG  dU  [hd  ©  F\tl  ©  G ]. 

The  following  properties  hold  for  construction  and  coupling  operators.  They  all  have  similar 


proofs. 

(35) 

[F  +  G\H]  =  [F\H]  +  [G\H], 

(36) 

[F\G+H]  =  [F\G]  +  [F\H], 

(37) 

[F\G][H\K)  =  ((F-H)\(GK)}, 

(38) 

(FOG)  Q(HOK)  =(F  ©  f/)Q(G  ©  K), 

(39) 

(FO(GQ  W))  ©  ( ROS )  =  [FOG)  ©  (FO(VY  ©  5)), 

(40) 

(F  Q  WDG)  ©  (ROS)  =  (FOG)  ©  (W  ©  ROS), 

(41) 

(FOG)'=(F”OG”), 

(42) 

(F  +  G)OH  =  (FOH  +  GOH), 

(43) 

FO(G  +  H)  =  (FOG  +  FOH), 

(44) 

(FOG)  ■  (HOK)  =  (F  ■  H)0(G  ■  K). 

These  prelimaries  from  the  theory  of  relations  have  been  studied  elsewhere.  Closely 
related  are  the  FP  systems  of  Backus  [5]  as  they  also  depend  upon  operators,  particularly 
composition,  to  form  more  complex  programs  out  of  simpler  ones.  Berghammer  and  Zierer 
[6]  have  given  a  relation  algebra  semantics  for  FP-like  languages.  Mili,  Desharnais,  and 
Mili  [7]  have  given  heuristics  for  the  design  of  deterministic  programs  from  relational  equa¬ 
tions.  This  paper  describes  a  transformation  system  based  on  the  operators,  relations,  and 
equations  of  a  Q-relation  algebra. 

4.  A  facility  for  definitions.  The  logical  definitions  of  +,  •,  0,  ”,  ,  hd,  tl,  □,  [  |  ],  id, 

and  1  (18-25)  suggest  an  implementation  with  an  SLDCNF-resolution[8]logic  programming 
system  in  the  logic  of  predicates  of  three  variables.  For  example,  the  goal  3X,  Y(X  R  V') 
would  be  represented  as  :  -  p(X ,  R,Y).  Goals  are  solved  in  left-to-right,  top-to-bottom 
order.  SLDCNF  is  briefly  discussed  in  the  section  on  negations. 

We  insist  that  no  definition  introduce  variables  into  the  relation  argument.  Thus  every 
literal  in  the  body  of  a  program  clause  of  the  form  p( X,  R,  Y )  will  contain  only  variables  R 
that  were  named  in  the  clause  head.  Therefore  if  a  goal  is  p(X,R,Y)  where  R  is  ground, 
then  every  relation  argument  in  every  subgoal  will  be  ground. 

The  expression  R  =  5  is  to  be  read  as  'R  is  equivalent  to,  but  should  be  rewritten 
as,  5.’  But  this  is  represented  in  a  logic  program  as  the  clause  p(X,R,Y)  :  -  p(X,S,Y) 
which  is  the  only  clause  defining  R.  Conjunctions  are  solved  from  left  to  right.  The  intent 
is  to  match  control  with  the  order  in  which  variables  are  bound.  Therefore  we  partition 
the  arguments  of  a  relation  into  two  structured  components,  an  input'  and  an  ‘output.- 
Predicates  of  single  arguments  are  conceptually  extended  to  two  arguments,  both  of  which 
are  the  same  value. 
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4.1.  Negations.  The  identity  X  id  Y  unifies  terms  X  and  Y.  The  diversity  relation 
depends  on  an  antiunification  algorithm[9]and  is  denoted  A  di  B.  If  A  and  B  are  both 
ground  constants  then  A  di  B  will  fail  or  succeed  depending  on  whether  they  are  the  same 
or  different  constants.  On  the  other  hand,  if  they  contain  unbound  variables  then  they  are 
delayed  until  arguments  are  known  or  output  as  answers  if  they  are  never  known. 

Terms  with  structure  are  solved  recursively  in  a  manner  similar  to  unification.  If  the 
two  terms  A  and  B  have  either  different  principal  functors  or  different  numbers  of  arguments 
then  antiunification  succeeds  without  new  inequality  constraints.  On  the  other  hand,  with 
the  same  principal  functor  and,  say  N,  arguments,  diversity  in  any  argument  is  enough 
for  antiunification  to  succeed.  Thus  possibly  N  new  choice  points  are  created  by  recursing 
the  algorithm  on  these  arguments.  If  one  of  the  corresponding  pairs  of  arguments  can  be 
determined  to  be  different  then  no  other  alternatives  need  be  considered  as  the  algorithm 
terminates  successfully. 

For  example,  consider  the  problem  f(X,  g(a))  di  /(u,  g(Y )).  The  two  solutions  (inequal¬ 
ity  constraints)  are  X  ^  u  and  Y  £  a.  On  the  other  hand,  the  subgoal  f(X,a)  di  /(Y,  6) 
succeeds  with  no  new  inequality  constraints. 

Negation  as  finite  failure  cannot  compute  the  complement  of  a  relation.  The  goal 
3XY,X  R  Y  is  equivalent  to  3XY,  ->X  R  Y  but  negation  by  failure  instead  determines 
-’(3.VV',  X  R  Y).  Although  negation  as  finite  failure  has  the  usual  logical  interpretation  for 
ground  goals  [10],  in  the  face  of  transformation  we  cannot  be  assured  when  a  variable  will 
be  ground. 

“Constructive  negation”  is  an  extension  to  negation  by  failure  [8]  The  elements  of  R  are 
not  necessarily  constructed,  but  instead  the  resolution  procedure  is  extended  with  inequality 
constraints  which  are  delayed  until  the  arguments  are  known.  If  arguments  are  never  known 
then  these  constraints  are  returned  as  answers.  Thus,  solutions  to  nonground,  negative 
subgoals  are  constructed  as  a  set  of  equations  and  inequations.  Constructive  negation  has 
both  a  clean  semantics  and  the  advantage  of  speeding  up  some  computations  although  a 
set  of  solutions  must  be  finite  to  be  complemented. 

4.2.  Sequences.  A  convenient  way  of  sharing  an  input  to  more  than  one  function  is 
with  Backus’  constructor  functional[5].  A  similar  constructor  operation  on  relations  can  be 
defined.  The  ordered  coupling  operator  creates  an  ordered  couple  as  an  output  from  a  pair 
of  inputs  in  a  one  to  one  fashion.  Both  operations  are  strict.  While  complex  structures  are 
made  with  constructors,  they  are  taken  apart  with  hd  and  tl.  These  are  called  selectors  by 
Backus.  Selectors  disassemble  what  the  constructors  and  ordered  couplers  build. 

There  is  a  constant  available  to  represent  end  of  lists.  Since  terms  are  finite  then  hd 
cannot  always  succeed.  This  implies  an  indivisiable  constant.  The  symbol  ()  will  represent 
this  constant.  A  sequence  of  one  element  A  will  be  represented  as  (A),  that  is  (A\B)  where 
B  =  ().  The  function  null  tests  sequences  for  emptiness  and  is  an  identity  on  ().  That  is, 
null  is  just  the  single  pair  (),(). 

An  associated  relation  to  deposit  this  symbol  into  a  list  construction  can  be  defined. 
This  empty  sequencer  is  a  constant  function  that  ignores  other  domain  elements,  returning 

the  object  ().  This  function  is  indicated  as  [  ].  It  is  defined  as  [  ]  s  10  null.  The  following 
are  some  simple  properties  of  null  and  [  ]. 


(45) 

[A\B\  0  null  =  0, 

(46) 

(AOB)Qnull  =  0, 

(47) 

null  0  ( AOB)  =  0, 
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(48) 

(49) 


[  ]  ©  null  =  [  ], 

null  ©  [A|B]  =  [null  ©  A\null  ©  B]. 


5.  Linear  recursion.  The  exponentiation  or  closure  of  a  relation  R  is  R  repeatedly 
composed  with  itself  and  is  defined  as  R*  =  id  +  R  ©  R* .  As  an  illustration  consider  finding 
the  greatest  common  divisor  of  two  natural  numbers  using  subtract  as  defined  only  on  that 
set.  Thus  (3|5)  subtract  Y  would  fail  but  (5|3)  subtract  2  succeeds. 

Example  1  Greatest  common  divisor 

gcd  d=  ((id  +  [tl\hd\)  ©  [su6<rarf|</])*  ©  (hd  ■  tl). 

A  simple  program  can  be  envisioned  to  start  with  two  numbers  and  continually  subtract 
the  second  number  from  the  first  until  the  numbers  are  the  same.  If  necessary,  it  reorders 
the  numbers  so  that  the  largest  is  first.  A  trace  of  the  computation  on  the  couple  (6|18) 
is  (6|18)  =>  (18|6)  =>  (12|6)  =>  (6|6)  =>  6.  Simple  exponentiation  is  not  expressive  enough 
and  is  naturally  oriented  to  elements  without  structure.  Instead  a  linear  recursion  operator 
is  defined  to  extend  the  effect  of  the  ordered  coupler  to  lists  and  list-like  structures.  This 
operator  divides  the  structure  into  a  couple  with  D,  solves  for  base  cases  with  S,  then 
combines  the  elements  of  a  couple  with  C. 

(50)  pi(D,  5,  C)d=  S  +  DQ  (idOpi(D,  S,  C))  ©  C. 

Arbitrary  exponentiation  can  be  included  with  pi  as  R*  =  pi([l|B],  id,  tl).  Also,  we  can 
define  map  to  apply  the  effect  of  a  relation  to  each  item  of  a  sequence.  Thus  for  example, 
the  goal  (2,3)  map(id  +  sub  I)  V  has  four  solutions  for  Y.  The  solutions  would  be  Y  = 
{(2, 3),  (2, 2),  (1,3),  (1,2)}.  The  definition  is 

(51)  map(R)  =  pi(id,null,  ROid). 

In  our  relations  we  collect  inputs  into  a  single  structured  input.  For  example,  the  Prolog 
goal  append(A,  B,C)  is  written  as  (A,B)  append  C.  On  the  other  hand,  predicates  of  a 
single  argument  become  two  argument,  subrelations  of  the  identity.  An  example  is  3  odd  3. 
These  extensions  are  important  because  they  allow  us  to  use  higher  order  operators  with 
Boolean  valued  relations  to  define  new  predicate  operators. 

All  recursions  that  follow  will  be  given  in  terms  of  pi.  They  are  preliminary  to  the 
example  transformations.  The  next  definition  is  the  function  that  cumulatively  concatenates 
a  sequence  of  sequences. 

(52)  cone  =  pi(id,  null,  append). 

(53)  append  =  pi([hdQ  hd\tlOid],(nullOid)Q  tl.id). 

An  item  can  be  coupled  to  each  element  of  a  sequence  with  either  distr  or  distl  as  in 
[5].  These  functions  can  be  simply  defined  in  terms  of  pi  and  the  definitions  expose  their 
inherent  symmetry. 

(54)  distr  =  pi([hdOid\tlOid\,hdQnull,id). 

(55)  distl  d=  pi([idOhd\idOtl],tl  ©  null,  id). 
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To  see  what  is  happening  here  consider,  for  example,  the  function  distr.  It  inputs  an  ordered 
couple,  a  sequence  and  an  item.  The  result  is  a  sequence  of  couples  each  of  which  has  the 
given  item  as  a  second  component.  Schematically  this  is 


((At,  ■  ■  ■ ,  An),  C)  distr  ((At ,  C),  (A,,  C),  ■  •  ■,  (AN,  C». 

Many  relations  can  now  be  concisely  defined  in  a  single  expression.  For  example,  we  can 
define  member  on  a  couple,  an  element  and  a  sequence.  Thus  member  is  simply  a  predicate 
(am  identity)  that  tests  the  element  for  membership  in  the  sequence.  For  example  both 
((3,  (2, 3, 4))  member  (3,  (2, 3, 4)))  and  ({3,  (5))  nonmember  (3,  (5)))  are  true. 


(56)  select  d=  pi(id,hd,tl). 

(57)  member  dM  [hd  ■  (tl  ©  select)\tl\. 

(58)  nonmember  d~  [hd|d»stl  ©  map((hd  ©  di)  ■  t/)]. 

The  following  equations  characterize  not  only  some  forms  of  loop  merging  but  can  also 
propagate  constraints.  Since  control  is  left  to  right,  constraints  on  search  are  best  performed 
as  soon  as  possible.  The  following  equation  enables  further  propagation  of  constraints. 

Proposition  1  Merging  recursions.  If  (idOid)  ©  C\  ©  S2  =  0  A  S\  ©  D2  ©  (idOid)  = 
0  A  (Ci  ©  Dj  =  idOid  V  Cj  ©  D2  =  id) 

pH  Di  ,Si,Ci)  Q  pi(D2,S2,Ci)  =  pi(D\ ,  Si  ©  S2,  C2). 


The  proof  is  by  induction  over  arguments  of  relations.  The  arguments  are  defined  on  a 
well  founded  set,  ordered  by  D 1  and  C2-  Each  pi-term  in  the  expression  pi(Di,S\,C\ )  © 
pt(D2,  S2,C2)  is  unfolded  to  arrive  at 

(St  +  £>1  ©  (idOpi(D{ ,  Si,  C,))  ©  Ci)  ©  (S2  +  £>2  ©  (:dDpi(D2,  S2,  C2))  ©  C2). 

Equations  in  the  hypothesis  eliminate  the  cross  terms  to  give 

Si  ©  S2  +  D\  0  (idOpi(Dt,  Si, Ci))  ©  (.daP»(C2,S2,C2))  ©  C2. 

Now  we  can  apply  38  to  bring  the  two  pi-terms  together. 

Si  ©  S2  +  D\  ©  (idOpi(D\ ,  Si,  Ci)  ©  pi( D2,  S2,  C 2 ) )  ©'  C2- 

Applying  the  induction  hypothesis  we  now  have 

Si  ©  S2  +  Di©  (idOpi(Dt  ,S\  ©  S2,  C2))  ©  C2- 

Folding,  we  finally  conclude  that  pi(Dt ,  Si,  Ct)  Q  pi(D2,  S2,  C2)  =  pi(Di,Sx  ©  S2,  C2). 
Using  equations  46  and  47  we  can  state  a  useful  corollary  as 

pi'(D,  S  ©  null,  id)  ©  pi(id,  null  QT,C)  =  pi(D,  S  ©  null  ©  T,  C). 


Proposition  2  Propagation  of  constraints. 
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pi(D,  S,  (Raid)  oC)  =  pi(D  ©  (Raid),  S,  C). 


Again  D  must  map  a  well  founded  set  into  lists.  By  induction  and  equation  34 

pi(D,  S,  (/JQt'aj  ©  C)  =  S  +  D  ©  (idOpi(D,  S,  (Raid)  ©  C ))  ©  (Raid)  ©  C 
=  S  +  D  ©  (Raid)  ©  (idOpi(D,  S,  (Raid)  ©  C))  ©  C 
=  S  +  DQ  (Raid)  ©  (idapi(D  ©  (#□«!), S, C))  © C 
=  pi(D  ©  (Raid),  S,  C). 

The  relation  R  has  changed  positions.  Now  Raid  tests  on  the  way  down,  before  any  large 
structure  has  been  built,  instead  of  on  the  way  back  up.  A  simple  consequence  is  the 
equation  map(F)  ©  map(G)  =  map(F®G).  This  follows  from  the  previous  two  propositions 
and  the  fact  that  map(R)  =  pi(id,  null,  Raid). 

If  a  concatenation  of  sequences  is  required  to  be  empty,  we  can  avoid  the  concatenation 
by  requiring  that  each  subsequence  of  the  sequence  be  empty. 

(59)  cone  ©  null  =  map(null)  ©  [  ]. 


This  rule  follows  by  induction  from  the  definitions  of  cone,  null,  and  map  with  equations 
8,  49,  39,  and  40. 

6.  Transformations.  The  equations  developed  form  the  foundation  for  a  nontrivial 
program  synthesis  that  is  interesting  for  two  reasons.  Primarily,  it  relies  less  on  heuristics 
than,  for  example,  the  Burstall  and  Darlington  fold/unfold  technique  [11],  Instead  it  is 
directed  by  the  operator  definitions  and  equations  between  relations.  Therefore  the  method 
is  easily  mechanized.  Secondly,  unlike  Hogger’s  techniques  [12],  it  is  carried  out  using 
relation  level  reasoning  without  resorting  to  object  level  variables.  This  saves  symbols  and 
makes  the  derivation  more  concise  and  broadly  applicable. 

The  sample  problem  has  two  parts.  The  first  part  finds  the  list  intersection  of  two  lists. 
This  might  be  an  example  of  a  library  program.  Library  programs  may  well  be  written 
efficiently  for  the  immediate  application  but  combinations  are  often  inefficient.  If  two  pro¬ 
grams  are  written  so  that  constraints  are  applied  as  early  as  possible,  before  alternatives 
are  created,  the  composition  may  have  some  constraints  that  are  applied  too  late. 

A  programmer  should  not  expect  to  be  penalized  with  the  inefficiencies  of  library  pro¬ 
grams.  The  second  program  exhibits  this  problem.  It  is  a  clear,  easy  to  understand  program 
that  simply  tests  to  see  if  two  sequences  are  disjoint.  Using  the  rules  developed  earlier,  we 
remove  the  inefficiencies  in  that  program. 

6.1.  Disjointedness  as  empty  intersection.  The  first  part  of  this  program  finds 
the  intersections  of  two  lists  by  distributing  one  list  over  the  elements  of  the  other,  then 
finding  those  other  elements  that  are  members  of  the  list. 

(60)  list-intersect  =  distr  ©  map(member  ©  [hd]  +  nonmember  ©  [  ])  ©  cone. 

The  program  disjoint  determines  if  two  lists  are  disjoint  by  simply  testing  for  an  empty 
intersection.  If  it  returns  any  answer  at  all  then  the  two  lists  are  disjoint.  As  defined,  this 
program  is  unnecessarily  inefficient  although  it  is  understandable  in  terms  of  its  parts. 


(61) 


disjoint 


de] 


list-intersect  ©  null. 
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The  transformation  strategy  applies  relation  equations  outwards  in,  from  left  to  right. 
After  each  successful  rewrite  the  enclosing  expression  is  attempted  before  deeper  optimiza¬ 
tion  is  done  to  the  subexpressions[13j.  The  first  two  recursions  are  within  the  definition 
of  list. intersect.  Let  buildone  represent  the  expression  member  ©  [Ad]  +  nonmember  ©  [  ]. 
Then, 


list. inter  sect  ©  null  =  distr  ©  mapfbuildone)  ©  cone  ©  null. 

Attention  is  directed  to  the  two  recursions  in  distr  ©  map(buildone)  Both  distr  and  map 
are  defined  with  pi  so  they  are  merged  and  simplified. 

distr  ©  map(buildone)  =  p»((AdQ*d|t/0»<f],  Ad©  null,  id)  ©  map(buildone), 

=  pi([hdOid\tlOid\,  hd  ©  null,  id)  ©  pi(id,  null,  buila  meOid), 

=  p*([AdOidjt/Oid],  Ad©  nul/,buildoneOid), 

=  pi([hdOid\tlOid]  ©  (buildoneOid) ,  hd  ©  null,  id), 

=  pi([hdDid  ©  buildone\tlOid  ©  id),  hd  ©  null,  id), 

=  pi([AdClid  ©  buildone\tlDid\,  hdQ  null,  id). 

Next,  cone  ©  null  is  expanded  to  map(null)  ©  [  ]  which  is  pi(id,  null,  nullOid )  ©  [  ].  Thus 
list.intersect  ©  null  is  now  two  recursions  followed  by  [  j.  This  is 

pi([hdOid  ©  buildone\tlOid ],  hd  ©  null,  id)  ©  pi(id,  null,  nullOid)  ©  [  ]. 

Once  again  we  merge  recursions  and  propagate  constraints  to  obtain 

list.intersect  ©  null  =  pi([hdOid  ©  buildone\tlQid],  hd  ©  null,  nullOid)  ©[  ], 

=  pi([hdOid  ©  buildone\tlOid\  ©  nullOid,  hd  ©  null,  id)  ©  [  ], 

=  pi(  [hdOid  ©  buildone  ©  null\tlOid  ©  id],  hd  ©  null,  id)  ©  [  ]. 

Now  buildone  is  a  sum,  over  which  we  can  distribute  null  to  obtain  buildone  ©  null  = 
member  ©  [hd\  ©  null  +  nonmember  ©  [  ]  ©  null.  In  addition,  one  branch  of  the  sum  goes 
away  as  [Ad]  ©  null  always  fails.  Therefore,  we  have 

disjoint  =  pi([Addzd  ©  ( nonmember  ©  [  ]  ©  nu//)|t/CHd  ©  id],  Ad©  null,  id), 

=  pi([hdOid  ©  (nonmember  ©  [  j)jtfOid],  Ad©  null,  id)  ©  . 

This  is  the  order  in  which  the  implemented  transformation  system  rewrites  the  original 
program.  The  mechanically  derived  program  for  disjoint  tests  for  nonmembership  before 
creating  large  structures.  When  the  two  lists  differ  at  their  first  components,  the  remaining 
components  need  not  be  tested.  For  two  lists  of  30  elements,  this  program  is  approximately 
100  times  faster  than  the  original. 

This  is  a  significant  speedup,  but  we  should  note  that  ail  arbitrary  amount  of  speedup 
is  possible  with  the  converse  operator.  For  example  with  sequential,  left— to— right  AND,  the 
cost  of  solving  A’  (parent’)''  V'  is  much  greater  than  the  '•ost  of  solving  A"  (parent")  V  for 
a  large  family  tree. 

6.2.  Eliminating  intermediate  lists.  Wadler  describes  a  deforestation  method  for 
avoiding  intermediate  lists  [14].  He  applies  it  to  a  program  to  compute  the  sum  of  squares  of 
numbers  between  l  and  N.  The  method  uses  the  unfold-fold  method  on  recursive  equations 
and  applies  to  deterministic  programs.  Transformations  based  on  relation  algebras  extend 
these  techniques  to  nondeterministic  computations  in  a  verifiable  way. 

The  program  performs  just  three  main  steps.  The  program  constructs  a  sequence  from 
N  to  1,  squares  every  number,  and  sums  the  sequence.  The  first  interesting  observation  is 
that  all  of  these  operations  are  described  by  the  pi  operator. 
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pt'([»d|sti61],egO©  [],*'<*)© 
pi{id,  null,  sqrOid)Q 
pi(id,  null  0  1  ©  eqO,  plus). 

The  realization  for  a  Q- relation  algebra  requires  the  extra  relations  egO ,  su61,  sqr,  and  plus. 
These  have  the  obvious  definitions  except  that  egO  is  just  the  single  pair  {0(0) . 

The  first  two  recursions  can  be  merged,  with  proposition  1,  and  the  sqr  operation  can 
be  brought  forward,  with  proposition  2,  to  obtain 

p>([sgr|su61],  egO©  [  ],  id)  0  pi(id,  null  ©  1  ©  eq0,p/us). 

Once  again  the  two  recursions  can  be  merged  to  obtain  the  result  p»([sgr|su61],  egO.p/us). 

7.  Conclusion.  Programs  can  be  specified  as  proper  binary  relations  with  projections. 
Binary  relations  have  an  associated  algebraic  structure  useful  for  program  synthesis,  verifi¬ 
cation,  and  optimization.  In  particular,  Tarski’s  Q-relation  algebras  offer  a  concise  notation 
and  a  firm  foundation  for  transformational  programming.  The  abstract  relation  operators 
are  appropriate  for  describing  the  generic  constructs  that  often  arise  in  programming. 

This  work  includes  the  definition  of  a  general  operator  pi  that  describes  linear  recursions 
and  gives  two  broadly  applicable  equations  that  merge  recursions  and  propagate  constraints. 
These  equations  provide  an  equational  method  for  reasoning  and  scheduling  specifications 
for  computation. 

The  system  based  on  these  equations  transforms  program  specifications  so  that  the 
result  often  specifies  a  different  algorithm.  This  implementation  shows  that  we  can  in 
some  cases  build  new  programs  on  previously  constructed  ones  without  the  usual  efficiency 
penalty  from  the  combination. 

Acknowledgements.  I  am  grateful  to  Rich  Kaste  for  his  suggestions  and  attention 
to  detail.  I  also  thank  Roger  Maddux,  Jim  Lipton,  and  Anil  Nerode  for  discussions  and 
background  on  relation  algebras  and  operators.  Thanks  also  go  to  Barbara  Broome  for  her 
comments,  to  Morton  Hirschberg  for  his  support,  and  to  Jossie  Kirst  for  her  assistance. 

References. 

[1]  A.  Tarski  k  S.  Givant,  “A  formalization  of  set  theory  without  variables,”  in  Colloquium 
publications #41,  American  Mathematical  Society,  Providence,  Rhode  Island,  1987. 

[2]  C.  S.  Peirce,  “On  the  Algebra  of  Logic:  A  Contribution  to  the  Philosophy  of  Notation,” 
American  Journal  of  Mathematics  VII  (1885). 

[3]  A.  Tarski,  “On  the  calculus  of  relations,”  The  Journal  of  Symbolic  Logic  6  (1941),  73-89. 

[4]  Roger  D.  Maddux,  “Introductory  course  on  relation  algebras,  finite-dimensional  cylindric 
algebras,  and  their  interconnections,”  1988,  Notes. 

[5]  John  Backus,  “Can  Programming  Be  Liberated  from  the  von  Neumann  Style?;  A  Func¬ 
tional  Style  and  Its  Algebra  of  Programs,”  Comm.  ACM  21  (1978),  613-641. 

[6]  R.  Berghammer  k  H.  Zierer,  “Relational  Algebraic  Semantics  of  Deterministic  and  Non- 
deterministic  Programs,”  Theoretical  Computer  Science43  ( 1986),  123-147. 

[7]  A.  Mili,  J.  Desharnais  k  F.  Mili,  “Relational  Heuristics  for  the  Design  of  Deterministic 
Programs,”  Acta  lnformatica24  ( 1987),  239-276. 


647 


[8]  D.  Chan,  “Constructive  Negation  Based  On  the  Completed  Database,”  in  Logic  Pro¬ 
gramming:  Proceedings  of  the  Fifth  International  Conference  and  Symposium,  R.  A. 
Kowalski  and  K.  A.  Bowen,  ed.,  MIT  Press,  Cambridge,  MA,  1988,  111-125. 

[9]  A.  Colmerauer,  “Equations  and  inequations  on  finite  and  infinite  trees,”  in  FGCS’84 
Proceedings,  1984,  85-99. 

[10]  Joxan  Jaffar,  Jean-Louis  Lassez  Sc  John  Lloyd,  “Completeness  of  the  Negation  as  Failure 
Rule,”  in  Proceedings  of  the  1983  IJCAI,  1983,  500-506. 

[11]  R.M.  Burstall  Sc  John  Darlington,  “A  Transformation  System  for  Developing  Recursive 
Programs,”  J.  Assoc.  Comput.  Mach. 24  (Jan.,  1977),  44-67. 

[12]  C.J.  Hogger,  “Derivation  of  Logic  Programs,”  J.  Assoc.  Comput.  Mach.28(Apr.,  1981), 
372-392. 

[13]  Noor  Islam,  Tom  Myers  Sc  Paul  Broome,  “A  Simple  Optimizer  for  FP-like  Languages,”  in 
Proc.  ACM  Conf.  on  Functional  Programming  Languages  and  Computer  Architecture, 
1981,  33-39. 

[14]  Philip  Wadler,  “Deforestation:  Transforming  programs  to  eliminate  trees,”  in  LNCS,  H. 
Ganzinger,  ed.#300,  Springer- Verlag,  Berlin,  1988,  344-358. 


648 
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ABSTRACT:  Evaluate  a  real  polynomial  f  and  its  derivatives  at  a  real  number  r  and  consider 
the  signs  {-,0,+}  of  the  results.  Herein  lies  new  techniques  for  obtaining  information  from 
the  sequence  of  signs  which  arise.  One  techniques  assigns  a  (magic)  number  to  a  sign  se¬ 
quence.  Another  assigns  a  path  to  a  sign  sequence. 

INTRODUCTION:  Say  f  is  a  degree  n  polynomial  with  real  coefficients.  We  have  the  deriva¬ 
tive  sequence  D(f)  =  (f,  f’,  f",...,  f(n))  for  f.  For  a  real  number  r,  D(f)(r)  stands  for  the  se¬ 
quence:  (f(r),  f’(r),  f"(r) . f(n)(r))  ancl  sign  D(f)(r)  stands  for  the  sign  sequence: 

(sign  f(r),  sign  f’(r),  sign  f"(r) . sign  f(n)(r))  where  sign  0  is  0.  Such  a  sign  sequence  is  said 

to  be  derived  from  f  at  r .  The  following  will  be  discussed  in  greater  detail: 

1  Given  two  real  numbers  r  and  s,  how  to  tell  from  the  sign  sequences:  sign  D(f)(r)  and 

sign  D(f)(s)  if  r  <  s  .  APPROXIMATE-ANSWER:  Multiply  adjacent  signs  and  dot 
product  the  result  with  (1 ,2,...)  to  obtain  the  Magic  Numbers  #sign  D(f)(r)  and 
#sign  D(f)(s) .  Then  #sign  D(f)(r)  <  #sign  D(f)(s)  if  and  only  if  r<s. 

2  How  to  form  a  path  through  a  sign  triangle  corresponding  to  a  sign  sequence. 

3  How  to  tell  from  the  paths  of  sign  D(f )(r)  and  sign  D(f)(s)  if  r  <  s  ? 

4  How  the  p  nh  condition  relates  to  the  [Coste-Roy]  algorithm  to  determine  from 

sign  D  (r)  and  sign  D(f)(s)  if  r  <  s  ? 

5  How  patho  relate  to  Magic  Numbers? 

6  How  the  property  of  paths  not  crossing  helps  determine  when  sign  sequences  arise  as 

sign  D(f)(r)  and  sign  D(f)(s) ,  i.e.  from  a  single  polynomial  and  its  derivatives? 

7  Generalizations  to  non-polynomial  functions  f. 

8  Unrealizability  of  certain  sign  sequence  patterns. 

Proofs,  further  details  and  additional  related  results  will  be  published  later.  Discussions  with 
and  between  Dexter  Kozen  and  Jim  Renegar  got  me  interested  in  this  area.  The  excellent  arti¬ 
cle  [Coste-Roy]  finished  getting  me  hooked. 

MAGIC  NUMBERS:  If  S  is  a  finite  sequence  of  -'s  ,  0's  and  +’s  we  calculate  the  Magic 
Number  #S  for  S  as  follows.  Say  S  =  (Sq,  s^ ,  S2,  ....  sn)  where  each  Sj  e  { -.  0,  +  } .  As  with 
counting  sign  changes  in  Sturm  Sequences,  the  number  #S  depends  on  the  sign  change 
transitions  from  Sj.1  to  Sj  as  i  varies  from  0  to  n.  However,  how  far  along  the  transition  oc¬ 
curs,  is  also  taken  into  account.  #S  is  defined  in  two  stages.  First  pS  is  defined  as  the  vec- 
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tor:  {  SqS1  ,  s-|S2  ,  s2s3 . sn-isn  )  •  For  multiplication  purposes,  -  should  be  considered 

-1  and  +  should  be  considered  +1  .  The  definition  of  #S  is  concluded  as  the  dot  product: 
#S  =  pS-(1 ,2,3, ...  n) .  Notice  that  the  dot  product  weights  later  transitions  more  heavily  than 
early  transitions. 

EXAMPLE:  Suppose  S(r)  =  sign  D(f)(r)  where  f  is  the  polynomial  X2  - 1  .  The  sign  se¬ 
quences  derived  from  f  at  r  and  their  Magic  Numbers  are  given  by: 


range  of  r 

sign  D(f)(r) 

#sign 

r  <  -1 

(+>  ■»  +) 

-3 

r  =  -1 

(0,  +) 

-2 

-1  <r<0 

(*.  *.  +) 

-1 

r  =  0 

(-,  0,  +) 

0 

0  <  r  <  +1 

(■»  +>  +) 

+1 

r  a  +1 

(0.  +,  +) 

+2 

+1  <r 

(+•  +.  +) 

+3 

The  reader  might  consider  what  sign  sequences  are  derived  from  f  =  X2  and  f  =  X2  +  1  . 

EXAMPLE:  The  last  sign.  For  a  polynomial  f  with  positive  leading  coefficient,  all  sign  se¬ 
quences  derived  from  f  have  +  as  their  last  (right  most)  sign.  Similarly,  all  sign  sequences 
derived  from  polynomials  with  negative  leading  coefficient  have  -  as  their  last  sign. 

EXAMPLE:  Extreme  left  and  right.  For  a  polynomial  f  let  sign  D(f)(-°°)  denote  the  unique 
sign  sequence  sign  D(f)(r)  for  r «  0  and  let  sign  D(f)(+°°)  denote  the  unique  sign  sequence 
sign  D(f)(r)  for  r »  0  .  sign  D(f)(-«.)  consists  of  alternating  -  and  +  signs.  Hence  every 
transition  is  (+,-)  or  (-,+)  and  #sign  D(f)(-°o)  reduces  to  the  dot  product 

(-1,  -1, ...  -1)  •  (1,2 . degree  f).  Thus  #sign  D(f)(-°°)  =  -  (degree  f)  (1  +  degree  f)  /  2  . 

sign  D(f)(+oo)  consists  of  all  +  signs  and  #sign  D(f}(+<»)  =  (degree  f)  (1  +  degree  f)  /  2  . 

Suppose  r  and  s  are  real  numbers  where  neither  f  nor  any  of  its  derivatives  have  a  zero  be¬ 
tween  r  and  s.  Then  f  and  all  its  derivatives  have  the  same  sign  at  r  and  s.  I.e.  sign  D(f)(r)  = 
sign  D(f)(s)  and  so  #sign  D(f)(r)  =  #sign  D(f)(s) .  This  shows  why  the  hypothesis  about  r  and 
s  being  separated  by  a  zero  of  f  or  one  of  its  derivatives  is  needed  in  the  following  theorem.  It 
is  also  what  was  missing  from  the  APPROXIMATE  ANSWER  at  (1). 

9  THEOREM:  Suppose  f  is  a  real  polynomial  and  r  <  s  are  real  numbers  where  f  or  one  of  its 
derivatives  has  a  zero  in  the  closed  interval  [r,  sj.  Then  #sign  D(f)(r)  <  #sign  D(f)(s) . 

The  easy  proof  will  be  published  later.  Impatient  readers  are  encouraged  to  prove  it  on  their 
own.  The  theorem  has  a  number  of  immediate  corollaries.  For  flavor,  here  are  three: 

*  If  r  <  s  and  f  or  one  of  its  derivatives  has  a  zero  in  the  closed  interval  [r,  s]  then 
#sign  D(f)(r)  cannot  equal  #sign  D(f)(s)  . 
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*  There  is  no  real  polynomial  f  and  real  numbers  r  and  s  where  sign  D(f)(r)  does  not 

equal  sign  D(f)(s)  but  #sign  D(f)(r)  =  #sign  D(f)(s) .  (Because  "sign  D(f)(r)  does 
not  equal  sign  D(f)(s)" ,  implies  that  f  or  one  of  its  derivatives  must  have  a  zero  in 
[r,s]  or  [s,r]  depending  whether  r  <  s  or  s  <  r  and  the  previous  corollary  applies. 

*  If  r  <  s  and  f  or  one  of  its  derivatives  has  a  zero  in  the  closed  interval  [r,  s]  then 

sign  D(f)(r)  cannot  equal  sign  D(f)(s) .  (This  special  case  of  Thom’s  theorem  fol¬ 
lows  immediately  from  the  first  corollary.) 

10  DERIVABILITY:  One  aspect  of  the  corollaries  is  to  describe  behavior  which  cannot  occur 
among  sign  sequences  which  arise  as  sign  D(f)(r)  and  sign  D(f)(s) .  For  example,  the  sec¬ 
ond  corollary  shows  that  for  the  sign  sequences  (0,0,+)  and  (-.0,+)  there  is  no  quadratic 
real  polynomial  f  and  real  numbers  r  and  s  where:  sign  D(f)(r)  =  (0,  0,  +)  and  sign  D(f)(s)  = 

(-,  0,  +).  An  interesting  problem  is  to  develop  necessary  and  sufficient  compatibility  conditions 
for  m  sign  sequences  to  arise  as  sign  D(f)(r1 ) sign  D(f)(rm)  from  a  single  polynomial  f  and 
its  derivatives  evaluated  at  m  points.  Sets  of  sign  sequences  which  arise  in  this  fashion  are 
called  derivable.^  We  have  more  to  say  about  derivability  in  the  next  section. 

SIGN  TRIANGLES  AND  PATHS:  To  simplify  the  discussion,  we  confine  ourselves  to  sign  se¬ 
quences  without  zeros.  These  will  give  rise  to  piecewise  linear  paths  in  a  sign  triangle.4  The 
sign  triangle  and  the  path  corresponding  to  a  sign  sequence  (rather  sign  column)  is  illustrated 
below.  Start  at  the  bottom  and  draw  a  path  passing  through  the  signs  coming  from  the  sign 
column.  This  is  illustrated  in: 


11  EXAMPLE 

sign  column 


♦ 


sign  triangle  with  path 


3  They  are  derived  from  the  polynomial  f  at  the  points  r1 , ...  rm  . 

4  Had  zeros  been  present,  the  paths  would  have  "width”.  In  fact,  when  zeros  are  pres¬ 
ent,  the  paths  tend  to  look  like  the  wakes  of  ships  and  are  called  wakes  instead  of 
paths. 
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The  illustration  not  only  assumed  that  the  sign  column  had  no  zeros  but  also  that  the  bottom 
sign  was  a  ♦  .  Had  the  bottom  sign  been  a  - ,  use  a  sign  triangle  with  +’s  and  -’s  inter¬ 
changed.^  We  are  interested  in  paths  arising  from  sign  columns  which  are  the  transpose  of 
sign  sequences  sign  D(f)(r).  Such  a  path  is  simply  called  the  path  of  sign  D(f)(r) ,  glossing 
over  the  transpose.  Modulo  the  simplifying  assumption  of  sign  sequences  not  containing  zero 
we  have: 

12  THEOREM:  Suppose  f  is  a  real  polynomial  and  r  <  s  are  real  numbers  where  f  or  one  of 
its  derivatives  has  a  zero  in  the  closed  interval  [rf  s].  The  path  of  sign  D(f)(r)  does  not  cross 
the  path  of  sign  D(f)(s)  and  at  some  row  of  the  sign  triangle  the  path  of  sign  D(f)(r)  lies  strict¬ 
ly  to  the  left  of  the  path  of  sign  D(f)(s)  .6 

EXPLANATION:  What  does  "paths  not  crossing"  mean?  All  paths  touch  at  the  bottom  of  a 
sign  triangle  and  may  overlap.  We  say  that  two  paths  cross  if  at  some  row  Path-]  is  strictly  to 
the  left  of  Path2  and  at  another  row  Path1  is  strictly  to  the  right  of  Path2  .  It  is  easy  to  pro¬ 
duce  examples  of  a  real  polynomial  f  together  with  real  numbers  r  and  s  where  the  path  of 
sign  D(f)(r)  and  the  path  of  sign  D(f)(s)  touch  and  diverge  at  several  places. 

EXAMPLE:  Here  are  the  paths  of  sign  D(X2  -  1  )(r)  for  r  <  -1  ,  -1  <  r  <  0 , 0  <  r  <  1  ,  1  <  r : 


(12)  has  stronger  but  similar  corollaries  to  (9).  For  a  while  it  was  hoped  that  "paths  not  cross¬ 
ing"  gives  a  necessary  and  sufficient  derivability  condition  (10).  This  is  true  for  sign  sequences 
of  length  four  or  less.  I.e.  coming  from  polynomials  of  degree  three  or  less.  Bruce  Anderson 
produced  an  example  of  a  set  of  sign  sequences  of  length  five  whose  paths  do  not  cross  but 
are  not  derived  from  any  degree  four  real  polynomial.  He  is  working  on  an  example  of  sign  se¬ 
quences  of  length  six  whose  paths  do  not  cross  and  yet  are  not  derived  from  any  real  valued 
function  together  with  five  successive  derivatives.  Carl  de  Boor  has  suggested  the  name 
"higher-order  Rolle's  theorem"  for  certain  theorems  which  describe  collections  of  sign  se¬ 
quences  whose  paths  do  not  cross  -  i.e.  are  allowed  by  Rolle's  Theorem  -  but  are  not  deriv¬ 
able. 


5  Actually,  the  Vs  and  -’s  in  the  triangle  are  not  really  needed  to  construct  the  path 
once  one  observes  that  the  rule  is:  diagonally  up  to  the  right  when  the  sign  stays  the 
same  and  diagonally  up  to  the  left  when  the  sign  changes. 

6  Carl  DeBoor  views  this  as  a  graphic  way  to  express  Rolle’s  theorem. 


The  first  algorithm  to  determine  the  relative  size  of  r  and  s  from  the  sign  sequences  sign  D(f)(r) 
and  sign  D(f)(s)  appears  in  [Coste-Roy].  Translated  into  our  path-language  their  algorithm7 * 
can  be  described  as  follows: 

All  paths  touch  at  the  bottom  vertex  of  the  sign  triangle  and  never  cross. 

To  determine  which  of  two  paths  lies  to  the  left  of  the  other 

1 .  Start  from  the  bottom  of  both  paths  and  proceed  upward  row  by  row  of  the  sign 

triangle. 

2.  At  the  row  where  one  path  diverges  to  the  left,  that  path  corresponds  to  the 

smaller  of  r  and  s. 

GENERIC  BEHAVIOR:  Conceptually  one  may  think  of  zeros  between  the  ’-’s  and  Vs  in  each 
row  of  the  sign  triangle.  If  rows  of  sign  triangles  are  numbered  from  bottom  to  top  the  rows  are 
the  generic  sign  behavior  of  polynomial  functions  -  with  positive  leading  coefficient  -  of  the  cor¬ 
responding  degree: 

13  EXAMPLE 

row  generic  behavior  of  function 

5  -  0  ♦  0  -  0  ♦  0  -  0  ♦ 

4  ♦  0  -  0  ♦  0  -  0  + 

3  -  0  ♦  0  -  0  4 

2  ♦  0  -  0  ♦ 

1  -  0  ♦ 

0  + 

Not  only  are  we  seeing  the  generic  sign  behavior  of  functions  of  each  degree,  assuming  each 
function  is  the  derivative  of  the  one  above,  the  signs  are  appropriately  located  left  to  right.  For 
functions  or  derivatives  with  multiple  roots,  it  is  useful  to  collapse  parts  of  rows  of  the  sign  tri¬ 
angle  corresponding  to  the  multiple  roots. 

SURPRISING  APPLICATION  OF  PATHS:  One  may  sometimes  determine  relative  magnitude 

Q 

of  functional  values  from  sign  sequences.  This  is  true  (on  the  rare  occasions)  when  the  paths 
of  two  distinct  sign  sequences  coincide  in  the  top  row  of  the  sign  triangle  and  one  row  below.9 
For  example,  suppose  f  is  a  real  polynomial,  r  and  s  are  real  numbers  and: 

sign  D(f)(r)  =  (+,+,  +,  +,  +,  +,  +,-,  +  )  sign  D(f)(s)  =  (+,+,  -,  +,  +,  +,  +,+,  +  ) 


7  In  the  special  case  where  neither  sign  sequence  has  a  zero. 

®  We  ignore  the  trivial  case  where  two  sign  sequences  begin  with  different  signs. 

9  Presumably  this  is  part  of  a  more  encompassing  theory  or  technique.  Unfortunately, 
the  big  picture  is  a  mystery. 
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From  paths  or  Magic  Numbers  we  see  that  r  lies  to  the  left  of  s.  Since  both  sign  sequences 
begin  with  ♦  ,  we  see  that  both  f(r)  and  f(s)  are  positive.  Surprisingly,  one  can  conclude  from 
the  sign  sequences  that  f(r)  <  f(s) .  The  reasoning  goes  as  follows. 

Since  sign  D(f)(r)  contains  no  zeros,  one  may  move  r  a  small  amount  to  the  right  or  left 
and  keep  the  same  sign  sequence  but  vary  the  height  of  f(r)  slightly.  Thus  we  may  as¬ 
sume  that  f(r)  does  not  equal  f(s).  Suppose  f(r)  >  f(s) .  Choose  a  constant  c  where: 
f(r)  >  c  >  f(s)  and  let  g  =  f  -  c.  The  successive  derivatives  of  g  are  the  same  as  the  suc¬ 
cessive  derivatives  of  f.  By  choice  of  c,  g(r)  >  0  and  g(s)  <  0  .  Thus 
sign  D(g)(r)  =  (+,+,  +,  +,  +,  +,  -,  +,-,+)  sign  D(g)(s)  =  (-,+,  -,  +,  +,  +,  +,+,  +  ) 

l.e.  the  sign  sequences  for  f  and  g  agree  except  the  initial  +  in  sign  D(f)(s)  has  been 
changed  to  a  - .  This  is  a  contradiction,  because  these  paths  of  sign  D(g)(r)  and 
sign  D(g)(s)  cross.  Hence,  f(r)  <  f(s) . 

By  considering  similar  but  much  lower  degree  examples,  one  can  easily  reason  -  without  using 
paths  -  that  f(r)  <  f(s)  or  vice-verse.  Presumably  the  same  type  of  reasoning  on  a  larger  scale 
could  be  applied  to  the  example  above. 

PATHS  and  MAGIC  NUMBERS:  The  Magic  Number  results  may  be  proved  directly  or  from 
sign  triangles  by  introducing  numbers  to  accompany  the  signs  within  the  sign  triangle.  A  path 
will  give  rise  to  a  column  or  vector  of  numbers  which  allows  one  to  translate  geometric  path 
results  into  Magic  Number  results.  The  key  idea  is  numbering  sign  triangles  as  indicated  in: 


-:-4  +:-2 

-:0 

+:2 

-:4 

+:-3  -:-1 

+:1 

-:3 

+:-4  -:-2 

+:0 

-:2 

+:4 

-:-3  +:-1 

-:1 

+:3 

+:-2 

-:0 

+:2 

-:-1 

+:0 

+:1 

BEYOND  POLYNOMIALS:  Lawrence  D.  Brown  has  shown  sign  sequence  and  path  results 
may  be  applied  to  non-polynomial  functions.  See  the  examples  below.  This  led  to  an  abstract 
formulation  of  the  functions  which  may  be  used  in  place  of  successive  derivatives  and  opened 
up  the  field  to  rational  functions,  fractional  exponents  and  other  functions.  Here  is  an  informal 
sketch. 

Instead  of  the  sequence  of  functions  (f,  f\  f", ... ) ,  consider  a  sequence  of  real  functions 
(fQ,  f^ . ft )  which  are  continuous  on  an  open  interval  (a,b)  where: 
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14  DEFINITION: 

1 .  Each  fj  only  has  a  finite  number  of  zeros  in  (a,b). 

2.  ft  has  no  zeros  in  (a,b). 

3.  For  p  in  (a,b),  if  fj(p)  =  0  then 

a.  fj(p+)  and  fj+i(P+)  have  the  same  sign  and 

b.  fj(p-)  and  fj+-|(p-)  have  opposite  signs. 

If  you  wish  to  try  an  example,  draw  a  random  ft  with  no  zeros  in  (a,b).  You  will  see  that  ft.1 
can  have  at  most  one  zero  in  (a,b).  By  induction,  f^.j  can  have  at  most  i  zeros  on  (a,b). 

For  pin  (a.b)  the  sign  sequence  at  p  is:  (  sign  f0(P),  sign  f-j(p) . ft(p) )  and  is  denoted 

SS(p) .  Let  S  be  the  union  of  all  the  zeros  of  the  fj’s  in  (a,b).  S  is  finite  and  splits  up  (a.b)  into 
open  intervals.  For  r  and  s  in  the  same  open  interval:  SS(r)  =  SS(s) ,  as  before.  In  fact,  when 
you  substitute  SS(r)  for  sign  D(f)(r)  and  #SS(r)  for  #sign  D(f)(r),  the  theory  -  as  pertains  to 
relative  magnitude  of  r  and  s  -  is  the  same  Magic  Number-wise  and  path  through  sign  triangle- 
wise.  For  example,  paths  do  not  cross  and  for  r  and  s  which  are  not  in  the  same  open  inter¬ 
val:  r<s  if  and  only  if  the  path  of  SS(r)  is  to  the  left  of  the  path  of  SS(s). 

15  EXAMPLE:  Let  fQ  =  f  /  g  where  f  is  a  degree  n  polynomial  and  g  is  continuous.  Let  (a,b) 
be  an  open  interval  where  g  has  no  roots.  Assume  g  is  positive  on  (a,b).  The  sequence  of 
functions:  fQ  =  f/g,  f1  =  f,  f2  =  fM, ...  fn  =  f^  satisfy  (14). 

16  EXAMPLE:  On  the  interval  (0,  ~)  the  sequence:  fQ  =  X1/2  -  X1/3  ,  f1  =  3X1/3  -  2X1/6  , 
f2  =  6X1/6  -  2  ,  f3  =  6  satisfies  (14). 

The  key  to  (16)  is  the  following.  Use  "o"  for  composition  of  functions.  Suppose  fQ  =  fog  where 
f  is  a  degree  n  polynomial  and  g  is  continuous  and  increasing  on  (a,b).  The  sequence  of  func¬ 
tions  is:  f0  =  fog  ,  f1  =  f’og  ,  f2  =  fog  ,  ...  fn  =  f^n)og  .  Using  f  =  Z3  -  Z2  and  g  =  X1/6  gives 
(16). 

FINAL  REMARKS:  The  following  issues  are  under  consideration. 

Generalization  of  the  above  work  to  complex  polynomials  and  generalization  to  functions  from 
Rn  to  Rn  In  the  complex  case,  argument  replaces  sign. 

Certain  functional  transformations  do  not  change  the  set  of  sign  sequences  derived  from  a 
polynomial.  More  specifically,  suppose  f(X)  is  a  real  polynomial.  Replace  f  by 
g(X)  =  af(  bX  -  c  )  for  real  numbers  a,  b  and  c  with  a,  b  >  0  .  Since  sign  D(g)(  (p+c)/b  )  = 
sign  D(f)(p) ,  the  same  sign  sequences  are  derived  from  f  and  g  but  with  different  transition 
points  from  one  sign  sequence  to  the  next.10  What  other  functional  transformations  preserve 
sign  sequences  in  this  sense?  The  full  set  of  such  transformations  forms  a  group  under  com¬ 
position. 

The  Magic  Numbers  of  the  sign  sequences  show  that  if  the  same  sign  sequences 
are  derived  from  f  and  g,  they  occur  in  the  same  order. 
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It  would  be  interesting  to  find  normal  form  representatives  for  sets  of  sign  sequences  which 
can  be  derived  from  a  function.  Here  one  is  looking  for  a  collection  of  polynomials  (the  hf’s) 
where  for  each  polynomial  f  there  is  an  hf  such  that  the  same  sign  sequences  are  derived  from 
f  and  hf.  Moreover  if  the  same  sign  sequences  are  derived  from  f  and  g  then  hf  =  h^  .  hf  is 
the  normal  form  of  (the  sign  sequences  derived  from)  f.  One  wants  an  algorithm  to  find  hf  from 
f  or  from  the  set  of  sign  sequences  derived  from  f. 

In  the  formulation  of  the  Magic  Number,  (1 ,2,3, ...  n)  is  dotted  with  pS  .  If  (1 ,2,3, ...  n)  is  re¬ 
placed  by  a  different  increasing  sequence  of  positive  numbers,  #S  is  changed,  but  (9)  still 
holds.  This  easily  comes  out  of  the  numbered  sign  triangles  mentioned  at  (PATHS  and 
MAGIC  NUMBERS)  above.  Given  a  derivable  set  of  sign  sequences  S,  can  one  derive  S  from 
a  polynomial  f  and  find  an  increasing  sequence  of  positive  numbers  to  replace  (1 ,2,3, ...  n)  so 
that  for  each  sign  sequence  S  in  S,  S  is  derived  from  f  at  #S  ,  i.e.:  S  =  sign  D(f)(#S)  ? 

REFERENCE:  M.Coste,  M.F.Roy  1988  Thom’s  lemma,  the  coding  of  real  algebraic  numbers 
and  the  computation  of  the  topology  of  semi-algebraic  sets.  J.  Sym.  Comp.  5  121-129. 
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GEOMETRIC  MODELS  FOR  DEVELOPABLE  AND  MINIMAL  SURFACES 


T.  F.  Chen,  G.  J.  Fix,  R.  Kannan 
Department  of  Mathematics 
University  of  Texas  at  Arlington 
Arlington,  Texas  76019 


Abstract. 

Computational  geometry  is  a  relatively  new  and  unusual  subject  in  the  sense  that  its 
orientation  is  quite  applied  in  nature,  yet  it  makes  heavy  use  of  pure  mathematics,  notably  algebraic 
and  differential  geometry.  The  technological  importance  of  geometric  models  has  increased  in  recent 
years.  Indeed,  the  major  bottleneck  in  developing  effective  computer  aided  design  (CAD)  software  has 
centered  around  geometry  and  a  range  of  issues  associated  with  computer  vision.  In  addition, 
computational  geometry  is  finding  wide  applications  in  the  computer  graphics  industry.  In  this  paper 
we  summarize  the  results  for  selected  problems  in  this  area  associated  with  developable  and  minimal 
surfaces. 


I.  The  Two  Curve  Problem  for  Developable  Surfaces. 

Developable  surfaces  are  defined  as  regular  surfaces  with  zero  Gaussian  curvature  ([l]-[2]).  As 
such  they  are  locally  isometric  to  planar  regions;  i.e.,  they  can  be  obtained  from  planes  by  bending 
(which  preserves  arc  length  and  angles).  Their  technological  importance  arises  from  this  property. 
Indeed,  more  surfaces  constructed  using  composite  materials  will  fall  into  this  category  ([3]). 

A  problem  of  interest  in  design  is  in  the  following.  Given  two  spatial  curves  a(-),  /?(•)  find  a 
developable  surface  If  connecting  these  curves.  We  shall  consider  solutions  within  a  subclass  of  all 
developable  surfaces.  In  particular,  we  shall  consider  surfaces  If  which  consist  exclusively  of  parabolic 
points,  i.e.,  surfaces  where  only  one  of  the  two  principal  curvatures  vanish. 

It  is  known  ([1],  [2],  [4])  that  such  surfaces  are  ruled,  and  admit  a  parameterization  of  the  form 
(1)  x(u,  v)  =  a(u)  +  ui2'(u) 

as  u,  v  vary  over  open  sets  of  the  reals  R.  For  a  surface  of  the  form  (1)  to  be  a  developable  (i.e.,  have 
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zero  Gaussian  curvature)  it  is  necessary  and  sufficient  that  the  tangent  a  to  the  generating  curve  a, 
the  tangent  w  to  the  rulings  w(-),  and  the  ruling  itself  be  coplanar.  This  condition  can  be  written 


(2) 


where  £•,  •,  ■]  is  the  standard  box  product.  The  two  curve  problems  can  be  stated  in  this  context  as 
seeking  a  function  v(u)  for  which 


(3) 


/3(v(u))5(u)+v(u)w(u) 


as  u  varies  over  its  parameter  range. 

Weiss  and  Furtner  [5]  has  proposed  an  interactive  searching  algorithm  to  solve  this  problem. 
Their  idea  was  to  construct  the  developable  by  rulings  (lines).  The  criteria  is  to  connect  a  point 
a0  =  a(u0)  on  the  a  —  curve  to  a  point  /?0  =  /?(v0)  on  the  /?  “curve  if  the  two  tangents  /?0  =  /?(vo)> 
50a(u0)  and  the  displacement  vector  0o  —  ao  are  coplanar;  i.e., 


Using  a  graphics  terminal,  one  traces  say  the  a-curve  searching  at  each  step  for  an  appropriate 
connection  to  the  /3-curve. 

There  are  cases  where  (4)  may  not  hold  any  point  pairs  a(u0),  /3(v0).  This  is  for  example  the 
case  when  S(-)  is  a  circle  and  /?(•)  is  a  line  through  the  center  of  the  circle.  This  example  also  shows 
that  the  two  circle  problem  itself  may  not  have  a  solution.  In  other  cases,  (4)  may  be  very  hard  to 
verify  at  a  large  number  of  points. 

To  develop  an  alternative  to  this  approach  a  dynamical  algorithm  was  developed  in  reference 
[6]  for  the  function  v(u).  (n  particular,  it  was  shown  that  this  function  satisfied  the  nonlinear  ordinary 
differential  equation 


where  0  and  its  derivatives  are  evaluated  at  v(u)  and  a  and  its  derivatives  are  evaluated  at  u.  From 
this  system  a  number  of  conclusions  can  be  drawn.  First,  a  condition  like  (4)  is  needed  at  least  at  one 
point  pair  g0,  0Q.  This  serves  are  initial  conditions  for  the  system.  If  (4)  holds  nowhere,  then 
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obviously  a  connecting  developable  does  not  exist  as  the  line,  circle  cited  above  indicates.  Also  since 
(5)  is  nonlinear  local  existence  and  uniqueness  is  assured  only  if  (4)  holds  at  some  point. 


T'*  illustrate  the  algorithm  for  a  few  cases  consider  first  the  problem  of  connecting  a  circle  and 
an  ellipse  5,  0  with  a  developable.  In  this  case  u  is  taken  as  the  arc  length  of  q,  and  an  elementary 
analysis  shows  that  a  global  solution  v(u)  exists  for  (4),  (5)(u0  =  0),  and  that 

(6)  *(L)  =  0(0), 

where  L  is  the  total  arc  length  of  5.  To  construct  the  associated  developable  surface,  the  ordinary 
differential  equation  (5)  was  integrated  from  t  =  0  to  t  =  L  with  fourth  order  explicit  Runga-Kutta 
rule.  The  results  are  shown  in  Figure  1. 


FIGURE  1 
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The  next  example  consists  of  a  circle  a  and  a  three  leaf  curve  0.  Again  a  periodic  solution  v(u) 
is  obtained  with  the  period  equal  to  the  total  arc  length  of  the  circle  3.  The  results  are  shown  in 
Figure  2. 


FIGURE  2 


The  developable  joining  a  and  0  may  not  in  general  connect  each  point  of  0  to  a  point  on  a. 
Such  a  situation  is  shown  in  Figure  3.  In  this  case  the  global  solution  v(u)  does  not  take  on  the  full 
parameter  range  for  /?(v).  Results  are  shown  in  Figure  3. 
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FIGURE  3 


A  finite  time  blow  up  in  (5)  could  occur.  This  will  happen  for  example  as  an  inflection  point 
J(v(u [))  of  the  5-curve  (i.e.  5=0  at  this  point).  If  q(u,)  is  not  an  inflection  point  for  the  a-curve 
then  the  dynamical  system  (5)  can  be  reformulated  with  u  as  a  function  of  v.  and  continued  past  the 
inflection  point  in  the  .i-curve. 

Current  research  is  dealing  with  a  rational  treatment  of  singularities  which  occur  in  either  the 
a-curve  or  the  5-curve.  Such  situations  are  common  in  practice  and  correspond  to  edges  or  corners  of 
the  object  to  be  designed.  In  reference  [6]  preliminary  results  using  conic  lofting  techniques  (see  also 
[7]-[8])  are  presented. 
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II.  Grid  Refinement  and  Nonlinear  SOR  Techniques  Applied  to  the  Minimal  Surface  Problem. 

Numerical  methods  for  the  Plateau  problem  have  been  dealt  with  by  either  a  combination  of 
variational  and  finite  difference  methods  or  by  a  finite  element  method  with  appropriate  restrictions  on 
the  existence  of  double  points  when  projected  on  a  plane.  In  all  of  these  approaches  the  basic  idea  is 
always  to  take  advantage  of  the  strict  convexity  of  the  associated  variational  formulation  in  solving  the 
discrete  problem.  Some  references  to  these  ideas  may  be  seen  in  [1-4).  However  one  of  the  basic 
problems  that  remains  to  be  formally  studied  is  in  proper  choice  of  the  iterative  process  to  solve  the 
discrete  system  of  equations  and/or  proper  selection  of  the  grid  points.  As  an  example,  one  of  the 
common  features  of  all  the  above  referred  papers  is  to  use  a  SOR-type  iterative  process  with  an 
arbitrary  choice  of  the  relaxation  parameter.  This  leads  to  a  trial  and  error  approach  to  the  choice  of  a 
relaxation  parameter  for  the  entire  iterative  process.  The  second  remark  that  concerns  the  above 
mentioned  papers  is  that  not  much  emphasis  is  placed  on  locating  the  grid  points.  We  have  attempted 
with  considerable  success  both  of  these  aspects  in  this  paper. 

In  order  to  illustrate  the  ideas  we  restrict  ourselves  to  a  typical  example  studied  by  most  of  the 
authors,  namely,  the  catenoid.  Thus  let  Y^=-j  cosh  £a(x  —  c)J,  0  <  x  £  1.  The  constants  a  and  c 
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are  to  be  determined  by  the  associated  boundary  conditions.  If  one  rotates  the  graph  of  Y  =  f^  we 
get  a  catenoid.  Let  us  set  Y ^=Y ^  =  A,  where  A  is  a  constant.  Then  c  =  Q.5  and  the  constant 
“a”  is  determined  by  the  equation  cosh  )=aA.  The  critical  value  of  A  such  that  the  equation  has 
no  real  solutions  for  A  below  this  value  is  0.75444.  In  fact,  one  can  see  that  as  A  goes  from  0.7  to 
0.755  in  the  approach  of  [2],  the  number  of  iterations  goes  from  462  to  3448. 

Before  we  present  some  of  the  numerical  results  we  outline  the  iterative  process  used.  Thus,  if 
<£:RN— *R  be  the  strictly  convex  functional  corresponding  to  the  discrete  problem  the  iterative  sequence 
is  defined  by: 

Given  {  x^},  j  =  1,  2,  •••,  k  . 

i)  *(xk)  =  <Kxk) 

ii)  ~k  -~k 

iii) 

Then  set 


Let  x  be  such  that 


x-  =x* ,  \  any  integer  between  1  and  N. 

^.xrk  unl*M!a  /  V k  ^  n 


x.  s£x.  unless  <f> ■  (xK)=0. 
'k  k  k 


xh1 


where  rK€(0,  1). 

Some  of  the  numerical  results  are  presented  below.  The  advantage  of  grid  refinement  can  be 
easily  seen  by  the  performance  of  the  algorithm  even  when  we  approach  the  critical  value  of  A. 


REFERENCES 

1.  M.  Hinata,  M.  Shimasaki  and  T.  Kiyono,  “Numerical  solution  of  Plateau’s  problem  by  a 
finite-element  method,  Math.  Comp.  28  (1974)  45-59. 

2.  T.  Tsuchiya,  “On  two  methods  for  approximating  minimal  surfaces  in  parametric  form”. 
Math.  Comp.  46(1986)  517-529. 

3.  H.  J.  Wagner,  “A  contribution  to  the  numerical  approximation  of  minimal  surfaces", 
Computing,  19  (1977)  35-58. 

4.  T.  F.  Chen  and  R.  Kannan,  Grid  refinement  and  Nonlinear  SOR.  to  appear. 


663 


TABLE  FOR  NONLINEAR  SOR  AND  GRID  REFINEMENT 

Table  1.  Numerical  solution  vs.  exact  solution  with  relaxation  parameter  r=0.85  and  boundary  data 
y(0)=y(l)=l. 


X 

0.20 

0.30 

0.40 

0.50 

0.60 

0.70 

0.80 

Finite  difference^!) 

0.90305 

0.87294 

0.85473 

0.84828 

0.85363 

0.87097 

0.90070 

finite  elemental) 

0.90208 

0.87222 

0.85445 

0.84856 

0.85445 

0.87221 

0.90208 

exact 

0.90194 

0.87202 

0.85424 

0.84834 

0.85424 

0.87202 

0.90194 

Table  2.  Finite  difference  solution  with  relaxation  parameter  r=0.85,  and  boundary  data 
y(0)=y(i)=0.78. 


a)  uniform  grid 


X 

0.20 

0.30 

0.40 

0.50 

0.60 

0.70 

0.80 

ra=0.78(S3) 

0.61075 

0.55527 

0.51888 

0.50159 

0.50392 

0.52723 

0.57440 

exact 

0.60955 

0.56005 

0.53083 

0.52121 

0.53083 

0.56005 

0.60995 

b)  nonuniform  grid 

X 

0.16216 

0.27477 

0.38739 

0.50000 

0.61261 

0.72523 

0.83784 

ra  =0.78(25) 

0.63754 

0.57542 

0.53633 

0.52042 

0.52864 

0.56336 

0.62986 

exact 

0.63458 

0.57063 

0.53342 

0.52121 

0.53342 

0.57063 

0.63456 

Table  3.  Finite  difference  solution  with  relaxation  parameter  r=0.85,  nonuniform  grid, 
and  boundary  data  y(0)=y(l)=ra. 


X 

0.11789 

0.21695 

0.35848 

0.50000 

0.64152 

0.78305 

0.88211 

ra=0. 756(50) 

0.61498 

0.53693 

0.47169 

0.44699 

0.46675 

0.54367 

0.62832 

ra=0.755<52) 

0.61152 

0.53226 

0.46594 

0.44061 

0.46034 

0.53846 

0.62486 

exact(ra=0.756) 

0.61658 

0.53417 

0.46289 

0.43993 

0.46289 

0.53417 

0.61658 

exact(ra=0.755) 

0.61149 

0.52692 

0.45393 

0.43046 

0.45393 

0.52692 

0.61149 
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Table  4.  Finite  element  solution  with  relaxation  parameter  r= 

y(0)=y(l)=ra. 

=0.85  and  boundary  data 

X 

0.20 

0.30 

0.40 

0.50 

0.60 

0.70 

0.80 

ra=0.78(2i) 

0.61121 

0.56166 

0.53261 

0.52305 

0.53262 

0.56165 

0.61123 

ra=0. 77(27) 

0.59141 

0.53926 

0.50830 

0.49878 

0.50879 

0.53923 

0.59135 

ra=0.76(42) 

0.56672 

0.51084 

0.47829 

0.46757 

0.47827 

0.51085 

0.56670 

ra=0.756(«) 

0.55356 

0.49532 

0.46149 

0.45039 

0.46145 

0.49527 

0.55350 

ra=0.755(5») 

0.54941 

0.49035 

0.45610 

0.44487 

0.45608 

0.49033 

0.54935 

Table  5.  Exact  solution  for  boundary  data  y(0)=y(l)=ra. 

X 

0.20 

0.30 

0.40 

0.50 

0.60 

0.70 

EE3H 

ra=0.78 

0.60995 

0.56005 

0.53083 

0.52120 

0.53083 

0.56005 

0.60995 

ra=0.77 

0.58931 

0.53663 

0.50586 

0.49574 

0.50586 

0.53663 

0.58931 

0.56279 

0.50588 

0.47276 

0.46190 

0.47276 

0.50588 

0.56279 

0.54625 

0.48618 

0.45135 

0.43993 

0.45135 

0.48618 

0.54625 

ra  =0.755 

0.53930 

0.47776 

0.44212 

0.43046 

0.44212 

0.47776 

0.53930 

Table  6.  Finite  element  solution  for  boundary  data  y(0)=y(l)=0.755,  r=0.85,  nonuniform  grid 

X 

0.18153 

0.28769 

0.39384 

0.50000 

0.60616 

0.71231 

0.81847 

finite  element(52) 

0.56315 

0.49624 

0.45751 

0.44481 

0.45745 

0.49614 

0.56307 

exact 

0.55374 

0.48389 

0.44361 

0.43046 

0.44361 

0.48389 

0.55374 

665 


03  ' 


FINITE  ELEMENT  METHOD 
Y  (0)  =Y  (1  )  =0 . 780 

0---  INITIAL  GUESS 

q 1  5TH  ITERATION 

i  LAST  ITERATION 

+  ---  exact  solution 


°1 


-AX  I  S 


FINITE  ELEMENT  METHOD 
Y  (0)  =  Y  (!)  =0  •  770 

a---  r  N  r  T  !  AL  GUESS 
O  I  5TH  ITERATION 
i  ---  LAST  ITERATION 
+  ---  EXACT  solution 
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X-AXIS 
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finite  element  method 

Y  (0)  =  Y  (!)  =0  •  760 

□  INITIAL  GUESS 
Q---  I 5TH  ITERATION 
i  ---  LAST  HERAT !  ON 
4----  EXACT  SOLUTION 


'-AXIS 
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FINITE  ELEMENT  METHOD 
Y  (0)  —  Y  (!)  =1  -  300 

□--*  INITIAL  GUESS 
O  : 5TH  ITERATIO M 
i  ---  LAST  iteration 
+---  exact  solution 


00 


difference  method 

Y  (C)  -Y  ( 1 ) -0 • 755 

j  --  INITIAL  CwESS 
!  5TH  !  TFRATI2N 

^  last  itfraticn 
+  ---  exact  sclut :c.*; 


-1 


-AXIS 


p :  m  : T  f:  difference  method 

v  (C)  -  Y  ( i  )  -  O  •  756 

-  INITIAL  GvlESS 
q---  !  STH  I’F.RATrCM 
i  ---  -AST  ;TFRATICN 
+  ---  FXACT  SOLUTION 


o 

CT 

'] 
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finite  difference  method 

Y  (C)  - Y  (1)  -  O.  770 

jt,  --  'n:t:al  OuESS 
0  --  ;  5TH  I  TF.R  AT  : 

.  LAST  :tFRAT!2N 


EXACT  SCLUT.'C.V 


IXV- 


FINITE  DIFFERENCE  METHOD 
Y  (0)  -  Y  ( I  )  --0.  730 

s  --  al  Guess 
Q  T  5TH  itfrat:cn 
j.  --  last  ::frat!s?.' 

*---  FXACT  SOLUTION 


o 

uO 
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PI‘KTF  D  I  FFFRENCE  METHOD 

y ( o ) - y ( i ) - i . oco 

Z  --  INITIAL  GUESS 
0  t  5TH  ITERATION 
^  ---  LAST  ! ’ERATION 
+---  EXACT  SOLUTION 


FINITE  ELEMENT  METHOD 
Y  (0)  =Y  (1  )  =0. 756 

3  ---  INITIAL  GdESS 
0---  ! 5TH  ITERATION 
i,  ---  LAST  ITERATION 

+  ---  exact  solution 


o 

o 


-AXIS 


•  60  1.00 


FINITE  ELEMENT  METHOD 
Y  (0)  =Y  (1  )  =0. 755 

□  INITIAL  GUESS 
0---  I 5TH  ITERATION 
i  LAST  ITERATION 
+---  EXACT  SOLUTION 
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Abstract 


In  1985  the  author  introduced  the  Functional  Binary  Decomposition  (FBD) 
algorithm,  FBD,r  °  FBD7\  Applied  to  a  raster  formatted  spatial  representation,  these 
functions  generate  a  set  of  8-connected,  1-element  regions  which  are  provably  robust. 
In  1987  the  author  developed  the  straight  line  path  algorithm  which  uses  this 
representation  to  solve  spatial  problems.  Recently  the  author  developed  a  temporal 
problem  solver  which  also  uses  this  representation.  These  algorithms,  and  others, 
have  been  developed  in-house  at  the  CECOM  Center  for  Signals  Warfare  (C2SW)  as 
part  of  an  ongoing  research  effort  to  develop  an  automated  system  which  will  assist 
the  U.S.  Army  Division/CORPS  Intelligence  Officer  (G2>  during  his  preparation  of  a 
tactical  situation  assessment  (TSA). 

During  the  TSA  process  the  G2  attempts  to  recognize  observed,  ongoing  enemy 
tactical  plans.  Fortunately,  finite  domain  plan  theory  is  well  formulated.  David 
Chapman,  however,  has  shown  ( 1987)  that  finite  domain  planning  which  permits 
action-domain  modification  is,  although  domain  independent  and  logically 
consistent,  undecidable.  For  real-world  ( infinite)  domains,  Chapman  has  relaxed  the 
domain  independence  requirement  and  has  conjectured  that  planners  must 
instantiate  local  truth  criteria  to  achieve  (piece- wise)  global  logical  consistency. 
Chubb  (1989),  however,  has  shown  that  such  criteria  can  not  exist  and  that 
furthermore  the  real-world  planner  is  incapable  of  reliably  recognizing  such  criteria 
if  it  did  exist.  Chubb  has  conjectured  that  real-world  planners  make  special 
circumscription  and  invariance  assumptions  while  constructing  real-world  plans.  It 
is  shown  that  these  assumptions  form  the  basis  for  the  FBDj? «  FBDf  representation 
This  representation  is  then  generalized  to  an  N-dimensional  form  capable  of 
representing  conjunctive  plans.  A  conjunctive  form  (CFBD)  of  the  N-dimensional 
representation  is  next  developed  and  spatial  and  temporal  problem  solutions 
generated  using  both  the  N-dimensional  and  CFBD  representations  are  shown  to  be 
equivalent.  A  simple  Army  logistics  problem  is  developed  using  two  CFBD 
representations.  Preliminary  experimental  results  suggest  that  optimal  tactical 
plans  are  ones  which  are  maximally  robust  with  minimal  associated  plan  execution 
cost. 
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Introduction 


Since  1984  the  U  S.  Army  CECOM  Center  for  Signals  Warfare  (C2SW)  has  been  ac¬ 
tively  involved  in  the  research  and  development  of  a  body  of  mathematics  which 
would  be  used  to  provide  automated  assistance  to  the  Army  division/C ORPS  Intelli¬ 
gence  Officer  (G2)  during  his  preparation  of  a  tactical  situation  assessment  (TSA) 

(4, 9, 11,12].  The  Army  TSA  consists  of  two  major  components:  an  Order  of  Battle 
(OB)  and  a  threat  assessment,  or  threat;  the  two  are  closely  related.  The  OB  is  <x  de¬ 
scription  of  WHAT  enemy  forces  and  equipment  are  present  within  the  tactical  do¬ 
main  of  interest  and  WHERE  they  are  located.  The  threat  assessment  describes 
WHY  the  enemy  has  his  OB  so  configured.  Early  on  in  this  research  effort  we  noted 
that  during  the  preparation  of  the  TSA  the  G2  is  attempting  to  recognize  ongoing  en¬ 
emy  plans.  Since  accurate  plan  generation  and  plan  recognition  capabilities  form  the 
basis  for  effective  command  and  control,  albeit  military  or  otherwise,  the  develop¬ 
ment  of  accurate  and  efficient  planning  algorithms  has  become  a  research  topic  of 
considerable  interest  [1,2,3,4,8,10,13,14,15,16]. 

Plan  generation  algorithms  may  be  classified  by  the  rank  of  the  planning  domain, 
i.e.,  finite  or  infinite.  Several  finite  domain  plan  generation  algorithms  e  cist 
[1,2,8,10,14,15,16]  and  finite  domain  planning  appears  to  be  well  understood.  How¬ 
ever,  the  extensibility  of  such  techniques  to  infinite  domains  continues  to  be  a  re¬ 
search  topic.  Recently,  D.  Chapman  [2]  proved  that  nonlinear  planning  within  finite 
domains  which  includes  representations  for  conditional  actions,  dependency  of  effects 
upon  input  situations,  or  derived  side  effects  is  undecidable.  A  paradox  central  to  any 
formal  theory  of  infinite  domain  plan  generation  and  recognition  is  how  humans  gen¬ 
erate  or  recognize  real-world  plans.  Chapman  and  Agre  [1,2]  have  conjectured  that 
the  plan  domain  independence  criteria  be  relaxed  and  that  plan  logical  consistency  be 
assured  locally  by  subsumption  of  domain  specific  truth  criteria,  a  type  of  circum¬ 
scription.  Chubb,  however,  has  shown  [3,4]  that,  with  the  possible  exception  of  finite 
domains,  the  existence  of  such  criteria  can  not  be  verified  a  prion  by  the  planner. 
Chapman  and  others  [1,2]  have  suggested  that  real-world  planners  improvise,  doing 
something  easy  and  observing  results.  This  heuristic,  however,  begs  the  question; 
what  discriminator  is  used  by  the  planner  to  discern  ’’easy”  from  ’’difficult”  plans 
since  the  criteria  necessary  to  demonstrate  plan  admissibility  within  infinite  do¬ 
mains  neither  exists  nor  can  be  recognized?  As  a  result,  Chubb  [3,4]  has  conjectured 
that  real-world  planners  must  assume  the  following, 

C-l  (circumscription):  prior  to  plan  action  execution  the  planner  is  aware  of  all  of  the 
domain  features  and  associated  values  which  may  effect  the  desired  plan  action 
execution,  and, 

C-2  (invariance);  during  the  plan  action  execution  the  domain  will  remain  essentially 
invariant  for  a  period  of  time  sufficient  to  realize  the  expected  value  of  the  planner’s 
executed  plan  action. 

Conjectures  C-l  and  C-2  are  logically  equivalent  to  Rao  and  Foo’s  [14]  Axiom  of  Sim 
pie  Actions.  We  will  show  that  C-l  and  C-2  form  the  basis  for  a  semantic  spatial  re¬ 
presentation  called  a  Functional  Binary  Decomposition  ( FBD>  which  is  used  to  solve 
spatial  and  temporal  problems  similar  to  those  considered  by  the  G2  during  his  prep¬ 
aration  of  the  TSA.  The  mathematical  foundations  for  the  FBD  are  introduced  in 
Section  I.  A  generalized  N-dimensional  FBD  called  a  Conjunctive  FBD  (CFBD)  is 
described  in  Section  II,  and  an  example  of  its  usage  is  given  in  Section  III. 
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I.  A  Planning  Paradigm 


A  plan,  P,  is  defined  as  a  time-ordered  sequence  of  plan  actions,  A„  i  —  1, ....  n,  where 
each  action  is  executed  by  a  plan  actor,  AC,  within  some  planning  domain  context, 

C i.  Each  actor-action-context  expression,  {AC,  A,  C};,  is  called  a  plan  tuple.  That  is, 

P  =  {{AC,  A,  C }/, .....  {AC,  A,  C}„}  (1) 

where  n  >0.  Each  plan  tuple  is  a  description  of  the  actor/action/context  prior  to  the 
initiation  of  the  action  by  the  actor.  Actions  are  executed  by  the  plan  actor  within 
domain  contexts  which  contain  features  which  makes  the  initiation  of  the  action 
possible.  Action  execution  transforms  the  domain  context  expression  into  a  new 
context  expression  wherein  a  new  plan  action  may  be  executed. 

The  plan  tuple  domain  context  expression,  C i,  is  represented  by  a  finite  set  of  feature 
tuples,  {{f 2,x,y»  vfyx.y}.  •••»  {fq,r,t»  vfq,r  t}}-  Each  tuple  expression  contains  a  feature  type 
designator,  iq,r,t  and  an  associated  feature  value,  vfq.r.<.  Feature  subscripts  represent 
the  feature  type  and  spatial  location  within  C ;.  For  the  purposes  of  this  paper,  we  as¬ 
sume  that  C i  is  syntactically  represented  as  a  raster-formatted  matrix  of  pixels,  c™, 
x,y  =  1, ....  m.  The  spatial  location  portion  of  fq>r  t  refers  to  the  physical  location  with¬ 
in  Ci  represented  by  the  pixel  c^.  Successful  plans  result  in  the  development  of  an 
Cn  +  l  context  expression  which  contains  the  desired  plan  goal,  G,  i.e.,  G  Q  Cn  + 1- 


Plan  Action  Execution 

Plan  action  execution  is  the  actor’s  reasoned  manipulation  of  domain  context  and  is 
described  by  the  plan  tuple  execution  function,  E.  Let  C = {{fg,  vf*}lv  g,k},  the  set  of  all 
possible  context  feature  types  and  values. 

Definition:  If  P  is  a  plan  and  {AC, A,C},  €  P,  fori=I n,  then  E  is  a  mapping  from  {AC, A, C}(  €  Pto 

the  elements  of  P(C),  the  power  set  of  C. 

By  C-l,  for  each  {AC,  A,  C}4  €  P,  there  exists  some  subset,  Pi  €  P(C),  where  (presum¬ 
ably)  Pi  C  Ci ,  such  that  if  {fq  r  t,  vfq  r  t}  (  Pi,  then  {fq  r  t,  vfq  r  J  effects  the  admissibil¬ 
ity  of  E{AC,  A,  C}i .  Note  that’by  C-l,  E{AC,  A,  C}i,  =  E{AC,  A,  p},.  Fortunately, 
the  tuple  {AC,  A,  p},  is  known  and  understood  for  a  variety  of  military  contexts.  In 
particular,  for  military  equipment  (e.g.,  AC  =  military  tank)  the  elements  of  P,  are 
known  a  prion  since  during  the  production  and  testing  of  the  actor  (tank),  a  variety  of 
actor  actions  (movement,  target  location,  firing,  etc)  are  tested  under  strictly  con¬ 
trolled  (Pi)  environmental  conditions.  For  most  military  equipment,  the  relationship 
between  AC,  A;  and  P(  has  been  documented  and  is  readily  available[7].  Then,  giv¬ 
en  a  {AC,  A}i,  the  value  of  P;  can  be  predicted  a  priori  and  may  be  written  as  a  Bool¬ 
ean  expression  in  terms  of  the  feature  tuples,  {fr,  vfr}  belonging  to  each  c™  €  Ci,  i.e., 
{{fj,  vfj}A  ~{fg,  vfg}  A  {{f*,  vfife}  v  {fr,  vfr}}  ....  V  ~  {fp,  vfp}}. 


Functional  Binary  Decomposition 

This  Boolean  expression  is  called  a  Functional  Binary  Decomposition 
transformation  (FBD r)  If  FBDr  is  used  as  a  criteria  for  actor  action  execution 
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admissibility,  it  is  a  mapping  from  Cj  to  the  binary  set  {0,1}.  That  is,  given  some 
{AC,  Ah, 


FBDT(cxy)  -  {0,1}  (2) 

for  every  cxy  €  C; . 

For  spatial  and  temporal  problem  solving  the  FBD  transformation  of  C,  is  given  the 
following  properties: 

1)  FBDr(cIy) =0  =*  action  execution  admissible, 

2)  FBD7HCX3,)  =  1  =*  action  execution  not  admissible, 

3)  0-element  pixels  are  4-connected,  and, 

4)  1-element  pixels  are  8-connected. 

An  FBD  transformed  C,  may  be  represented  in  terms  of  it’s  1-elements,  i.e., 
FBDr(cxy)  =  l.  Every  path-connected  set  of  1 -elements  is  called  an  obstacle  region  or 
region  and  is  characterized  by  the  FBD  representation  (FBDr).  Region  characteriza¬ 
tion  consists  of  the  region’s  boundary  list,  adherent  list,  boundary  cut  points,  and  inte 
rior  points  for  simply  connected  regions.  Multiply-connected  (MC)  regions  are  also 
characterized  by  their  MC-boundary  lists,  MC-adherent  lists,  and  MC-boundary  cut 
points,  i.e.,  see  Figure  1. 


MC- Boundary  Points 

A  multiply-connected  region  and  it's  associated  FBD  characterization. 

Figure  1. 


The  set  of  domain  1 -elements  so  characterized, 

FBDr°  FBDt(C{)  -* {fbdi,fbd2, ...,  fbdk }  (3) 

where  fbdt  is  the  ith  1-element  region,  is  provably  robust  [6].  Representational  ro¬ 
bustness  is  especially  important  during  tactical  situations  when  spatial  features  may 
change  quickly  and  abruptly. 

The  FBD7<Ci)  binary  representation  is  a  semantic  representation  of  action  execution 
admissibility.  This  representation,  in  keeping  with  U-l  and  C-2,  represents  the  cou¬ 
pling  between  the  execution  of  an  action  and  the  action  execution  domain  for  a  (possi¬ 
bly  snort)  period  of  time. 
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II.  Problem  Solving  Using  FBDr  «  FBDt<Ci) 


We  will  assume  that  for  those  actions  of  interest  to  the  G2,  2?{AC,  A,  C}t  will  involve 
spatial  movement  or  change.  Obviously,  temporal  change  occurs  during  action  ex¬ 
ecution.  Even  the  execution  of  the  ’’wait”  or  ”no-action”  action  involves  temporal 
change.  The  FBD  representation  in  (3)  will  be  used  to  solve  spatial  problems,  while  a 
variant  of  this  representation  is  used  to  solve  temporal  problems. 

A  spatial  problem  is  a  hypothesis  about  the  spatial  movement  required  during 
f?{AC,  A,  C}j.  Action  execution  begins  spatially  and  temporally  at  the  actor’s  spatial 
location  prior  to  action  execution,  i.e.,  some  c*y  €  C;.  At  the  conclusion  of  the  action 
execution  the  actor  is  hypothesized  to  be  spatially  located  at  one  or  more  goal  posi¬ 
tions,  {grs, ....  g*/,}. 


Definition:  A  spatial  problem.lCgy,  Qrd-  is  satisfied  or  solved  if  and  only  if  there  exists  a  4-connected, 
0-element  path,  Qn).  between  the  O-element  pixels  c^andg^  such  that  every  element  of 
g«)  is  an  element  of  FBDrfC,).  Pi  xelsc^y,  and  g„  are  path-connected  in  FBDrfC,) 


Problem  goals  may  be  specified  as  either  disjunctive  or  conjunctive,  e.g.,  P[ cxy,  {gkh  A 
grs})  requires  that  both  P(cxy,  gkh)  and  Picsy ,  grs)  be  satisfied. 


If  we  assume  that  the  time  required  for  action  execution  is  minimal  (i.e.,  C-2),  then  a 
type  of  spatial  problem  solving  technique  is  to  find  one  or  more  paths  between  0- 
element  points.  Chubb’s  Straight  Line  Path  Algorithm  (SLPA)  [5]  makes  use  of  the 
fbdi  adherent  list  information  to  efficiently  search  for  such  0-e  ement,  4-connected 


An  SLPA  solution  where  two  fbd,  regions  occlude  the  straight  line  projection  between 
start  and  goal  points.  Note  that  four  possible  paths  are  generated 

Figure  2. 


paths  (see  Figure  2).  This  technique,  however,  only  provides  satisfied  spatial  solu¬ 
tions  and  does  not  take  into  consideration  the  time  required  to  traverse  the  path.  A 
satisfied  spatial  problem  implies  that  action  execution  is  globally  admissible.  How- 
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ever,  action  execution  may  result  in  temporal  requirements  which  are  neither  practi¬ 
cal  nor  possible. 

Definition:  A  temporal  problem,  ((c*y,  g„) ,  K^.^)  is  satisfied  or  solved  if  and  only  if  there  exists  a 
spatial  solution.  P{ c,y,  g„),  such  that  the  time  required  to  traverse  P(c*y,  g^  <  <xy  n.  where  is 
the  problem's  temporal  criteria. 

Since,  by  definition,  a  temporal  solution  is  a  spatial  solution  but  the  converse  is  not 
necessarily  true,  a  more  general  and  powerful  problem  solving  methodology  is  to  first 
develop  a  temporal  problem  solution  which  is  then  recursively  constrained  to  produce 
a  tenable  spatial  solution.  For  example,  assume  that  the  set  A  represents  a  temporal 
solution  for  some  ((c*y,  gn) ,  K-xy-ra )•  Then  there  exists  some  8s0  such  that  KXy-rs  -  8 
represents  a  temporal  solution  where  (rank  A)  is  minimal.  The  value  of  8  may  be  de¬ 
veloped  by  recursively  solving  for  mm(rank  A)  with  monotonically  increasing  values 
of  8.  The  algorithm  which  develops  the  temporal  solution  is  now  described. 


Temporal  Problem  Solving 

The  time  needed  to  traverse  a  4-connected  pixel,  cxy  t  C1(  is, 

try  —  RXy  "=■  ( Wzy* V  max)  ( 4 ) 

where  RX>  is  the  side  length  of  the  (square)  location  represented  by  cxy,  Vmai  is  the 
maximum  velocity  which  the  actor  can  move  while  executing  this  action,  and  wxy  is 
the  coefficient  of  compliance  between  the  domain  and  actor  action  execution  where 
OcuixySl.O.  The  wxy  coefficient  value  indicates  the  percentage  compliance  between 
actor  action  execution  and  the  domain,  i.e.,  wgt  =  1.0  indicates  100%  compliance. 

Then  the  time  required  to  traverse  P[ cxy,  g */,)  is, 

kh  kh 

T xy-kh  =  2  tj  =  (Rxy ~ V max)  2  (iVi)  1  (5) 

i=xy  i=xy 

We  (safely)  assume  that  RI;y  is  a  constant  for  every  cxy  €  C( .  Vmax  is  assumed 
invariant  during  action  execution.  One-element  pixels  belonging  to  FBD/'fCj)  are 
assigned  a  coefficient  value  of  zero.  Division  by  zero  in  (5)  is  excluded  using  the 
FBDft(Ci)  representation  for  decision  checks  as  follows. 

The  temporal  problem  solving  (TPS)  algorithm  is  a  two  pass  algorithm.  Given  start 
and  goal  points  c™  and  gzd,  the  first  pass  computes  the  minimal  path  time  for  path 
P(cxy,  h kh)  where  nj kh  is  a  zero-element  point  in  FBDr(C,),  and  h kh  *  gzd  ■  The 
minimal  path  time  from  hkh  to  gzd  is  then  estimated.  The  estimated  T kh-zd  is 
developed  by  computing  the  min  4-connected  path  distance  between  points,  assuming 
that  there  are  no  1-element  points  occluding  tne  path  and  that  every  wt  has  a  value 
of  1.0.  If  minTxy  kh  +  estimated  T kh-zd  s  Kkh  zd  the  point  h^/,  is  considered  a 
potential  temporal  solution  path  point  and  is  saved  for  second  pass  processing. 

The  second  pass  consists  of  computing  minVzdkh  for  every  h  kh  point  selected  during 
pass-one.  IiminTxykh  +fninT'zd-kh  ==  K^^.then  the  hkh  point  must  belong  to  at 
least  one  P[cxy,  gkh )  which  satisfies  the  temporal  constraint  Kkh-zd ■  The  set  of  all 
such  h kh  becomes  the  TPS  solution  set.  Likewise,  a  minimal  spatial  solution  is 
developed  if  the  10,  values  are  fixed  as  a  positive,  non-zero  constant  for  ail  of  the  0- 
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element  points  belonging  to  FBDrlCi)  and  the  minKkh-zd  is  developed  during  the 
problem  solving  process. 


The  Conjunctive  FBD  Representation 

The  FBDr^FBDtKC,)  possesses  several  inherent  representational  weaknesses, 
which  arise  because  the  pixel  feature  information  is  spatially  indiscriminate. 
Locational  information  can  be  added,  however,  for  essentially  point  sources,  such  as 
buildings.  Extended  line  objects,  such  as  roads  and  railroads,  which  may  be 
represented  by  two  or  more  pixels  require  special  representation  since  inter-pixel 
trafficability  may  vary  dramatically  depending  upon  the  features  represented  per 
pixel.  In  addition,  pixel  spatial  resolution  is  typically  less  that  optimal  which  tends 
to  make  the  image  of  FBD7Xciy)  multi-valued! 

An  alternative  representation  methodology  is  to  use  an  N-dimensional  FBD 
representation  where  each  FBD/j 0  FBD^Xm)  planar  image  considers  only  a  subset 
of  featurespresent  within  each  Ci,  :.e.,XyCCi,  such  that  UXy  =  Cj  ,j=l,...,r.  For 
example,  Xig  may  represent  a  road  network,  whereas  Xid  may  represent  bodies  of 
water.  In  addition,  coupled  with  each  FBDj? 0  FBD?<Xy)  is  a  rule-based  system 
which  is  responsible  for  monitoring/changing  the  fbdh  region  binary  values  based 
upon  input  data  normally  available  to  the  G2.  For  example,  assume  that  a  major 
road  has  been  destroyed  during  a  tactical  engagement.  This  information  would  be 
given  to  the  ’’road”  FBD  rule-based  system  which  would  create/add  to  the  appropriate 
fbdh  region(s).  In  addition,  associated  with  every  FBD-r  function  there  exists  rules 
which  monitor  the  associated  (r-I)  FBD7  and  adjust  the  wt  values  accordingly.  An 
N-dimensional  temporal  solution  or  path  has  a  general  form  which  is  based  upon  an 
N-dimensional  definition  of  4-connectedness. 

Definition:  Apixelcxykc  FBD^X,*)  is  said  to  be  4N-connected  if  and  only  if  c^*  is  4- connected  to 
the  following  0-element  neighbors:  (c<x  i)y,k  .  C(x+  Vy,  k .  Cxfy-r),*  .  cx(y+  i),k)  for  all  7. N- 

Definition:  An  N-dimensional  path,  A(cxyy,  gz*  ,),  is  defined  to  be  the  ordered  list  of  0-element,  4N- 
connected  pixels  which  contain  no  loops  and  path-connect  cxy/  to  g,* ,  for  i,j  =  7 . N. 

The  N-dimensional  coplanar  stack  of  FBD7^Xy)  and  their  rules  form  the  basis  for  a 
spatial  representation  called  a  conjunctive  FBD  (CFBD).  The  CFBD  is  developed  as 
follows.  Letcx:y  i  be  the  <x,y)th  pixel  in  FBDTfXy)  Then  for  every  pixel,  ipXv  t 
CFBD(Ci),  set  the  value  of  ipt).  to 

q/xy  =  (cxy.!  cxy,2  A .  ACxy,r) 


the  conjunctive  value.  We  use  the  following  symbol  to  show  that  CFBI)(Ct)  is  formed 
from  the  conjunct  of  the  FBDjfXy)  where  j  —  r, 

CFBD(Ci)  =  A  FBDr(Xy)  (7) 

j  =  i 

The  following  theorem  follows  immediately. 

Theorem:  There  exists  a  path  P(ipIy  ,  ip Sg)  such  that  every  element  of  PfipIV  »  Ts#)  is 
an  element  of  CFBD(Cj)  if  and  only  if  there  exists  an  N-dimensional  path;  P(cxy,i , 
gsg/jsuch  thatl^  i,f  <  randCFBD(C()  =  A  FBDr<Xy),j  =  I . r. 
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Proof:  Follows  directly  from  (6),  (7),  and  the  last  two  definitions. 

We  now  examine  an  Army  constraint-based  logistics  problem  using  the  CFBD 
representation  and  the  temporal  problem  solver. 


III.  A  Logistics  Problem  Example 


In  this  example  we  assume  that  there  are  three  actors:  (Army)  tanks,  (Army)  fuel 
trucks,  and  a  mountain  snow  storm.  The  domain  is  the  northern  slope  of  a  mountain 
range  (see  Figure  3).  To  the  north  is  a  large  lake.  To  the  east  is  a  large  plains  area. 
Further  to  the  east  a  battle  is  about  to  begin.  Two  major  roads  are  evident.  One  goes 
through  the  mountains,  the  other  winds  about  on  the  northern  slopes  and  winds 
about  the  lake.  Both  roads  converge  to  the  west  of  the  eastern  plains  area  At  the 
beginning  of  this  scenario  the  fuel  trucks  are  stationed  in  the  northwestern  corner  of 
the  plains,  the  tanks  are  located  in  the  southwestern  corner,  to  the  west  of  the 
mountains,  and  the  snow  storm  is  about  to  move  easterly  through  the  mountains. 

The  local  commander  has  told  the  tank  commanders  to  prepare  for  battle  along  the 
eastern  front.  Army  doctrine  demands  that  the  tanks  assemble  in  a  staging  area 
approximately  two  hours  prior  to  battle  and  that  they  be  fully  fueled  at  that  time. 
The  tanks  will  attempt  to  distribute  themselves  uniformly  within  staging  area  in 
preparation  for  the  attack.  The  fuel  trucks  should  be  positioned  with  the  tanks 
within  the  staging  area  to  provide  them  with  fuel.  Table  1  gives  the  actor- Vmaj  and 


Actor:  Tank  Fueling  Trucks  Snowstorm 

Vmax  (m/ m)  200  400  0-50 

Start  Location  (30,2)  (1,2) 

Actors,  maximum  action  execution  velocity,  and  starting  positions. 

Table  1 

start  position  data.  Note  that  V max  for  the  storm  is  given  as  a  range  of  values  since  a 
snow  storm,  though  moving,  may  appear  to  be  stationary  to  a  fixed  observer.  In 
addition,  once  the  storm  has  passed  through  an  area  we  assume  that  the  snow 
remains.  The  presence  of  snow  is  considered  a  deterrent  to  travel.  Actor  goal 
positions  are:  (4,22),  ( 16,22),  and  (22,  22)  which  correspond  respectively  with  the  most 
northern,  midpoint,  and  most  southern  staging  area  positions. 


Mobility  Considerations 

Fueling  trucks  can  onlv  travel  on  or  near  the  roads;  most  of  the  remaining  terrain  is 
far  too  rough  for  travel.  Off-road  travel  is  slower,  depending  upon  terrain  factors. 

Tanks  can  move  anywhere  except  within  the  mountains  and  the  lake.  Tanks  can  also 
travel  on  the  roads  however  the  road  surface  increases  track  wear  thereby  causing 
the  tanks  to  move  more  slowly. 
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The  domain  of  the  logistics  problem  Actors  include  tanks,  fueling  trucks,  and  a  snow  storm 

Figure  3. 


A  mountain  snow  form  is  preparing  to  move  southeasterly  through  the  mountains 
making  the  mountain  road  impassable  for  both  the  fueling  trucks  and  tanks. 


A  Logistics  Problem 

Assuming  that  both  the  tanks  and  the  fuel  trucks  are  ready  to  move  to  the  staging 
area  and  that  no  snow  is/has  fallen,  what  is  the  best  strategy  for  the  tanks  ana  trucks 
to  use  to  minimize  the  time  the  tanks  must  wait  within  the  staging  area?  How  does 
the  presence  or  likelihood  of  a  mountain  snow  storm  affect  this  strategy,  if  at  all? 

In  order  to  solve  this  problem,  several  FBD’s  must  be  developed  for  this  domain:  a 
road  network  FBD  (Figure  A-l),  a  fuel  truck  FBD  (Figure  A-2),  and  a  tank  FBD 
(Figure  A-3).  Next,  a  fuel  truck  CFBD  (Figure  A-4)  and  a  tank  CFBD  (Figure  A-5) 
are  developed.  Finally,  coefficient  values  are  developed  for  the  fuel  truck  CFBD  and 
tank  CFBD.  The  fuel  truck  coefficients  are  1.0  (max  weight)  for  road  surfaces  and 
0.8  for  off-road  excursions.  The  tank  coefficients  are  set  to  1.0  for  off-road  surfaces 
and  0.75  for  on-road  surfaces.  Note  that  both  the  FBD’s  and  the  coefficient  values 
reflect  a  no-snow,  no-traffic,  ’’best-possible-conditions”.  The  constraint-based  model 
was  run  for  every  goal  point  with  both  actors  (see  Figures  4a,b,c,  and  d)  with  no  snow 
present. 

Table  2  lists  the  no-snow,  minimum  transit  times  from  the  tank  and  truck  start 

ftoints  to  the  the  three  goal  points.  In  this  model  trucks  always  lead  the  tanks  by  at 
east  2.2  minutes  and  no  logistical  problems  are  foreseen.  Note  that  the  minimal 
temporal  path  for  the  trucks  to  both  (16,22)  and  (22,  22)  is  through  the  mountain 
pass.  The  presence  of  the  third  actor,  the  snow,  makes  the  decision  to  use  the 
mountain  pass  road  less  than  optimal. 


Start  to: _ (22.22) _ (16,22) _ (4.22) 

Tank;  16.4  19.7  26.5 

Truck:  14.2(M)  15  8<M)  18  0 

Actor  minimum  transit  times  (no  snow) 
Table  2. 

(M)  =  using  the  southern,  mountain  road 


Temporal  problem  solver  results 
using  Tank  CFBD  representation 
Points  (30  2)  to  (4  22)  yield  a 
mm  time  of  26.5  minutes. 


Figure  4a 


Temporal  problem  solver  results 
using  Fuel  CFBD  representation. 
Points  (1  2)  to  (22  22)  yield  a 
mm  time  of  14  3  minutes. 


Same  as  Figure  4a  except  points  are 
(30  2)  and  (22  22),  the  southern  most  point. 
Min  time  is  16.4  minutes. 


Figure  4b 


Same  as  Figure  4c  except  points  are 
(12)  and  (4  22)  Min  time  is  18  0  minutes 
Note  that  off-road  excursion  is  made  by 
trucks  as  part  of  minimal  time  solution. 


Figure  4c 


Figure  4d 
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Snow  Present 

Snow  was  introduced  within  the  mountain  region  incrementally  from  no-snow 
present  (0%)  to  an  impassable  amount  of  snow  (100%).  Mobility  was  simply 
computed  to  be:  1.0  -  (%  snow  within  the  mountain  area).  Although  the  effect  of  snow 
upon  actor  mobility  is  linear,  the  presence  of  snow  produces  a  nonlinear  effect  upon 
the  the  temporal  computation  because  the  spatial  component  (the  percentage  of  the 
total  road  surface)  affected  is  assumed  to  be  unknown  as  is  the  amount  of  snow 
present  within  that  area.  Note  that  as  each  as  in  equation  (5),  the  FBD 
characterization  transforms  this  pixel  into  a  region  1 -element  which  effectively 
prevents  any  further  mobility  consideration. 


o  i  11  II  »  n  It  )l  «  <•  M  n  H 

P«rc*nUg«  Snow  Prooont 

Minimum  route  time  for  the  fueling  truck  as  a  function  of  %  mountain  snow  present. 
Note  that  the  effect  upon  min  time  is  nonlinear 

Figure  5 


The  only  actor  affected  by  the  presence  of  snow  was  the  fueling  truck  which  used  the 
most  southern  route  (Table  2)  rather  than  the  longer  valley  route.  Figure  5  shows  the 
relationship  between  %  snow  present  and  transit  time  for  the  trucks.  Note  that  the 
southern  route  is  always  shorter  than  the  valley  route  until  %  snow  exceeds  60%  at 
which  time  the  valley  route  time  becomes  the  shortest  temporal  route  to  both  (16,  22) 
and  (22, 22)  for  the  trucks. 

Without  knowing  either  the  amount  of  snow  present  nor  the  percentage  of  the  route 
affected  bv  the  snow,  the  conservative  or  ’’worst  case”  minimum  transit  times  are 
given  in  Table  3. 


Start  to. _ (22.  22) _ (16.22) _ (4.22) 

Tank.  16.4  19.7  26.5 

Truck:  20  7<V)  192(V)  18.0 

Actor  worst  case  minimum  transit  times  (100%  snow) 


(V)  =  valley  route 


Table  3 
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Summary  and  Conclusions 

It  appears  that  the  conservative  logistics  will  service  most  of  the  tanks  with  fuel  as 
they  arrive  in  the  staging  area.  However,  the  importance  of  this  plan  is  not  that  it 
offers  adequate  logistics  support  but  that  the  plan  is  insensitive  to  the  mountain 
snow  scenario;  the  plan  is  more  robust  than  the  other  plans.  The  application  of 
robust  plans  which  are  less  likely  to  be  constrained  by  other  domain  actors  appears  to 
be  a  powerful  command  strategy.  We  have  indicated  that  the  accuracy  of  a  real-world 
ilanning  algorithm  depends  upon  its  ability  to  accurately  predict  the  interaction 
jetween  action  execution  and  the  planning  domain.  However,  the  author  has  shown 
3,  41  that  for  real-world  domains  this  relationship  is  at  best  enumerable  and 

Erobabilistic.  At  worst,  the  relationship  appears  (to  the  planner)  to  be  chaotic 
ecause  the  planner  is  not  aware  of  important  domain  feature  data  or,  equivalently, 
because  he  lacks  experience  with  similar  data  to  make  an  accurate  prediction.  Real- 
world  assumptions  C-l  and  C-2  suggest  that  planner  experience  is  the  most  robust 
criteria  to  employ.  As  such,  perhaps  constraint-based  problem  solving,  based  upon 
actor  experience,  is  best  used  to  develop  minimally  constrained,  maximally  robust 
strategies  as  given  in  Table  3. 

Experimental  results  seem  to  suggest  that  whenever  two  or  more  plans  share  the 
same  goal,  G,  the  preferred  plan  maximizes  plan  robustness  while  minimizing  the 
cost  of plan  execution.  If  Pi  and  P2  are  plans  (i.e.,  Pi  =  {(AC,A,C)/, ...,  (AC,A,C)*}, 

P2  =  {(AC,A,C)/, ...,  (AC,A,C)/j})  such  thatGc{C}*  +  ;  and  Gc{C}/,+  /  .then  Pi  is 
preferred  to  P2  if, 

X  I robustness( AC.A.C),  -cost\El AC,A,C),||  >  Y  \robustne ss(AC,A,C),  -cos/|JE(AC,A,C),ll. 

PI  P2 


All  of  the  algorithms  described  herein  have  been  implemented  in  COMMON  LISP  on 
a  Texas  Instrument  Explorer  II AI  Workstation.  Appendix  B  contains  a  copy  of  the 
FBD  characterization  implementation,  i.e.,  equation  (3). 
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APPENDIX  A 


Appendix  A  includes  FBD  and  CFBD  figures  which  were  generated  using  a  Texas 
Instrument  Explorer  II AI  Workstation. 


Road  network  FBD 


Fueling  Truck  FBD 


Tank  FBD 


Figure  A  - 1 


Figure  A-2 


Figure  A  - 3 


Fueling  Truck  CFBD 


Tank  CFBD 


Figure  A-4 


Figure  A  5 
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APPENDIX  B 

Appendix  B  contains  a  copy  of  the  in-house  generated  Common  LISP  code  used  to 
produce  a  set  of  FBD-characterized  regions,  i.e.,  equation  (3).  Input  consists  of  a 
1024  pixels  in  matrix  format  (32x32).  The  top-level  function  is  (MAKE-FBD  R)  where 
OsrsI.O.  Function  output  consists  of  a  set  of  LISP  atoms,  i.e.,  (fbdl,fbdl  + i, .... 
fbdk),  each  with  atomic  properties,  i.e. .boundary  list,  adherent  list,  cut  points,  me- 
boundary  list,  mc-adherent  list,  mc-cut  points,  and  the  associated  values  as  described 
herein. 
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;;  Mode: Common-Lisp;  Base:10 

I 


Title:  FBD  Program  to  support  AISAF  Situation  Assessment  Army  Effort 
This  file  contains  two  major  programs: 

a)  ability  to  build  a  random  bit  array,  FBD-ARRAY, 
and  b)  ability  to  make  an  FBD  representation  of  the  FBD-ARRAY. 
Call  program  with  (MAKE-FBD  THRSH) . 

Programmer:  Douglas  w  J  Chubb 

File :  RACHMANINOFF : FBD-REGION; CREATE-FBD . LISP 

Initial  Development:  891018  090000 
Last  Modification:  891122  090000 


(export  ' (xdim  ydim  fbd-array  start-x  start-y  next-x  next-y 
build-fbd  set-display-array)  'USER) 


DEFVARS  for  program 


(defvar  xdim  32  "fbd-array  width”) 

(defvar  ydim  32  "fbd-array  heigth") 

(defvar  fbd-array 

(make-array  (list  xdim  ydim) 

: type  ' art-ib) ) 

(defvar  thrsh  0.00)  ;  create  a  1-bit  array  fbd-array 

(defvar  start-x)  ;  boundary  start  point  (x,y) 

(defvar  3tart-y) 

(defvar  next-x)  ;  successor  point 

(defvar  next-y) 

(defvar  title  ' FBD-Array-Data) 

(w:make-font -purpose  fonts  :hll2b  -.flashy) 


;  fbd-array  x-dimension 
;  fbd-array  y-dimension 


;  Fill  FBD-ARRAY  with  binary  elements  (1  or  0)  depending  upon  value 
;  of  random  number  generator. 

(defun  set-fbd-array  (thrsh) 

(do  ( (i  0  (1+  i) ) ) 

((>  i  (1-  xdim))) 

(do  ((j  0  (l+  j))) 

( (>  j  (1-  ydim) ) ) 

(cond  ( (>»  (random  0.99)  thrsh) 

(3etf  (aref  fbd-array  i  j)  1)) 

(T  (setf  (aref  fbd-array  i  j)  0)))))  ) 


DISPLAY-FBD  creates  the  windows  which  display: 

1.  FBD-ARRAY 

2 .  Boundary  List 

3.  Adherent  List 

4.  Interior  Points 
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5.  Adherent  points  if  region  multiply-connected 

6.  Boundary  list  if  region  multiply-connected. 


(defun  dispiay-fbd  (top  left  results) 

(let'  ((mag  7)  ;  magnification  constant 

( fbd-window  (make-instance  'w: window 

:top  top  :left  left 

: height  (*  mag  ydim) 

: width  (*  mag  xdim) 

:  save-bits  nil 
: borders  1 

.•font -map  '  ((flashy) 

:  label  title 
: reverse-video-p  t 
: expose-p  t ) ) ) 

(setf  fbd-bitblt  (make-array  (list  ('  mag  xdim) ('  mag  ydim)) 

:type  ' art-lb) ) 

(arrav-initialize  fbd-bitblt  0) 

(do  (1x0  ( 1 +  x ) ) ) 

( (>  x  (1-  xdim) ) ) 

(do  ( ( y  0  (1+  y))) 

( ( >  v  (1-  ydim) ) ) 

(cc na  (  (=  1  (aref  fbd-arrav  x  y)  )  :  magnify  fbd-arry  results 

(do  (  ( i  ('x  mag)  (!->-  i))> 

( (>=  i  ! *  ( 1+  x)  mag) ) ) 

(do  (  ( j  (  *  y  mag )  (1+  j )  )  ) 

( (>=  j  ( '  (1-  v)  mag) ) ) 

(setf  (aref  fbd-bitblt  i  :)  1)  ))  >)  )* 

(send  fbd-window  :bitblt  w:alu-seta  ('  mag  xdim)  l*  mag  ydim)  fbd-bitblt  0000) 

(mapc  (function  (lambda  (fbd-atom) 

(cond  ( (equai  title  'Boundary-List) 

(mapc  (function  (lambda  (opt) 

(setf  cpt  (car  cpt) ) 

(wrmake-blinker  fbd-window  '  w:  character-blir.ker 
.■font  fonts  :tr!2b 
(character  #\* 

:x-pcs  ('  mag  (second  cpt)) 

:y-pcs  (*  mag  (first  opt))  ')) 

(get  fbd-atom  'cut-points))  ) 

( (equal  title  'M-C-Adh-Bounaary-Lists) 

(mapc  *' (lambda  (cp-iist) 

(mapc  (function  (lambda  (cpt) 

(setf  cpt  (car  cpt) ) 

(w (make-blinker  fbd-window  ' w : character-blinker 
:font  fontS(trl2b 
.•character  #\* 

:x-pos  (*  mag  (second  cpt)) 

:y-pos  (*  mag  (first  cpt))  ))) 

cp-list) 

)  (get  fbd-atom  ' mc-cut-points ) )  ) 

(T) )  ))  results) 

)  ) 


MAKE-FBD  is  the  starting  routine  which  creates  initial  binary  array, 
boundary  list,  etc.  If  thrsh  <»  0.0,  routine  being  called  by 
mouse  routine  and  input  data  drawn,  not  using  a  random  number 
venerator . 
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(defun  MAKE-FBD  (thrah) 

(let  ((reaults  nil)) 

(setq  add-regions  nil)  ;  add  &  delete  variable  used  during 

(setq  delete-regions  nil)  ;  TEST-FBD-ROBUSTNESS  function. 

(cond  ( (>-  thrsh  0 . 0) (set-fbd-array  thrsh) ) ) 

(make-frame) 

(setf  title  '  FBD-Array-Data) 

(display-fbd  10  300  nil) 

(do  (  (i  0  (1+  i)  ) ) 

( (>  i  (1-  xdim) ) ) 

(do  <  (  j  0  (1+  j) ) ) 

( (>  j  (1-  ydim) ) ) 

(con d  ( (=  (aref  fbd-array  i  j)  1) 

(setq  start-x  i) 

(setq  start-y  j) 

(setq  next-x  nil) 

(setq  next-y  nil) 

(setq  results  (cons  (build-fbd  i  j  (1-  i)  j  (gensym  ' fbd) )  results))))  )) 

(display-results  results) 
results  )  ) 


DISPLAY-RESULTS  sets  up  variables  and  atom  properties  to  display 
using  DISPLAY-FBD  subroutine. 


(defun  display-results  (results) 

(array-initialize  fbd-array  0) 

(setq  title  'Boundary-List) 

(set-display-array  fbd-array  results  'b-list)  ;  display  boundary  points 
(display-fbd  250  150  results) 

(array-initialize  fbd-array  0) 

(setq  title  'Adherent-List) 

(3et-display-array  fbd-array  results  'a-list-out)  ;  display  adherent  points 
(display-fbd  250  450  nil) 

(array-initialize  fbd-array  0) 

(setq  title  'Interior-Points) 

(set-display-array  fbd-array  results  'i-list)  ;  display  interior  points 
(display-fbd  510  20  nil) 

(array-initialize  fbd-array  0) 

(setq  title  'Mul-Conn-Adherent-Lists) 

(set-display-array  fbd-array  results  'a-list-in)  ;  display  adherent  points 
(display-fbd  510  300  nil) 

(array-initialize  fbd-array  0) 

(setq  title  ' M-C-Adh-Boundary-Lists ) 

(set-display-array  fbd-array  results  'mc-list)  ;  display  boundary  points 
(display-fbd  510  580  results)  ) 


;  SET-DISPLAY-ARRAY  sets  contents  of  "PROP"  into  FBD-ARRAY  for  display  on  TV 


(defun  set-display-array  (farray  fbd-atom-list  prop) 

(mapc  (function  (lambda  (fbd-atm) 

(mapc  (function  (lambda  (plst) 

(cond  ((listp  (first  plst)) 

(mapc  (function  (lambda  (pix) 

(setf  (aref  farray  (first  pix) (second  pix))  1)))  plst)  ) 
(T  (setf  (aref  farray  (first  plst) (second  plst))  1)  ))  )) 

(get  fbd-atm  prop)  )  ) )  fbd-atom-list) ) 


BUILD-FBD  is  the  master  routine  which  creates  the  FBD  representation  for 
each  8-connected,  1-element  region  in  FBD-ARRAY. 


(defun  build-fbd  (xp  yp  x2  y2  fbd-atom) 

(let*  (  (b-list  (list  (list  xp  yp) ) ) 

(a-list-out  nil) 

(a-list-in  nil) 

(i-list  nil) 

(1-element  nil)) 

(cond  ( (and  next-x  next-y) (setf  b-list  nil) ) ) 

(mapc  (function  (lambda  (pts) 

(cond  ( (and  {=  (aref  fbd-array  (first  pts) (second  pts) ) 
(null  1-element)) 

(setf  a-list-out  (cons  pts  a-list-out) ) ) 
((and  (=  (aref  fbd-array  (first  pts) (second  pts)) 
1 -element ) 

(setf  a-list-in  (cons  pts  a-list-in))) 

(  (null  1-element) 

(setf  b-list  (cons  pts  b-list) ) 

(setf  1-element  pts) ) 

(T  (setq  i-list  (cons  pts  i-list)))))) 


0) 


0) 


save  successor 


(cw-scan-pattern  xp  yp  x2  y2) 
(cond  ((null  1-element) 

(setf  (get  fbd-atom 


) 


a  sinaie 


.  -element 


(T 


b-list)  b-list) 

(get  fbd-atom  'a-list-out)  a-list-out) 

(get  fbd-atom  'i-list)  i-list)  ) 

(get  fbd-atom  'b-list) 

(append  (get  fbd-atom  'b-list)  (reverse  b-list))) 

(get  fbd-atom  'a-list-out) 

(append  (get  fbd-atom  ' a-list-out )( reverse  a-iist -out ) ) ) 
(setf  a-list-in 


(setf 

(setf 

(setf 

(setf 


(set-difference  a-list-in  (get  fbd-atom  'a-list-out)  -.test 
(setf  (get  fbd-atom  'a-list-in) 

(union  (get  fbd-atom  'a-list-in)  a-list-in  :test  #'equal)) 
(setf  (get  fbd-atom  'i-list) 

(set-dif ference  (union  i-list  (get  fbd-atcm  'i-list)  :test 
(get  fbd-atom  'b-list)  :test  t'equal))  )) 


point 


ccint 


* ' equal ) ) 


#' equal ) 


(cond  ( (null  1-element) 

(remove-region  fbd-atom)  fbd-atom) 

( (and  (  =  xp  start-x) (=  yp  start-y) (null  next-x) (null  next-y) ) 

(setf  next-x  (first  1-element)) 

(setf  next-y  (second  1-element) ) 

(build-fbd  next-x  next-y  xp  yp  fbd-atom) ) 

(  (and  (=  xp  start-x) 

(“  yp  start-y) 

(=  next-x  (first  1-element) ) 

(=*  next-y  (second  1-element))) 

(setf  (get  fbd-atom  'b-list)  ;  remove  las-  two  pts 

(cddr  (reverse  (get  fbd-atom  'b-list))))  ;  from  boundary  ust 

(count -cut -points  fbd-atom  'b-list  'cut-points) 

(setf  (get  fbd-atom  'a-list-in)  ;  compute  0-element  pts 

( remove-duplicates 

(set-difference  (get  fbd-atom  'a-list-in) 

(get  fbd-atom  'a-list-out) 

:test  f'equai)  :test  f'equal)) 
(build-interior-list  fbd-atom) 

(parse-4-space  fbd-atom) 

(rebuild-boundary-list  fbd-atom) 

(remove-region  fbd-atom)  fbd-atom) 

(T  (build-fbd  (first  1-element) (second  1-element)  xp  yp  fbd-atom)))  )) 
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(defun  parse-4-space  (fbd-atom) 

(let  ( (ain  (remove-duplicates  (get  fbd-atom  'a-list-in)  :test  #'equal))) 

(cond  (ain  (setf  (get  fbd-atom  'a-list-in)  nil) 

(parse-4-space2  fbd-atom  (list  (first  ain) )  (cdr  ain) ) 
(order-adherent-list  fbd-atom)  ) 

(T  )  )  )  ) 

;  PARSE-4-SPACE2  parses  the  4-connected  ADHERENT  points  into  a  4-connected 
;  subspace  of  O-elements. 

(defun  parse-4-space2  (fbd-atom  pt  O-elements) 

(let*  (  (4neigh  nil)  (0-list  O-elements)) 

(cond  ( (null  O-elements) 

(setf  (get  fbd-atom  'a-list-in) 

(cons  pt  (get  fbd-atom  'a-list-in)))) 

(T  (mapc  (function  (lambua  (sps) 

(mapc  (function  (lambda  (4pt) 

(cond  ((member  4pt  O-elements  .'test  t'equal) 

(setf  4neigh  (cons  4pt  4neigh) ) 

(setf  0-list  (remove  4pt  0-list  :test  #'equai))) 

(T  ) )  ))  (4neighbors  sps))  ))  pt) 

(cond  ( (null  4neigh) 

(setf  (get  fbd-atom  'a-list-in) 

(cons  pt  (get  fbd-atom  '  a-list-in) ) ) 

(parse-4-space2  fbd-atom  (list  (first  0-list))  (cdr  0-list))) 
(T  (parse-4-space2  fbd-atom  (append  pt  4neigh)  0-list)))  ))  )) 


ORDER-ADHERENT -LIST  orders  each  4-connected  region  of  O-elements 
into  elements  which  represent  the  ADHERENT  LIST  for  that  region. 


(defun  order-adherent-list  (fbd-atom) 

(let  (  (alist  nil)  (alist-in  nil)  (1-flag  nil)) 

(mapc  (function  (lambda  (adherent-list) 

(setf  alist  nil) 

(mapc  (function  (lambda  (adherent -point) 

(setf  1-flag  nil) 

(mapc  (function  (lambda  (8neigh) 

(cond  ( (=  1  (aref  fbd-array  (first  8neigh) (second  Sneigh) ) ) 

(setf  1-flag  T) ) 

(T) )  ))  (8neighbors  adherent-point)) 

(cond  ((null  1-flag)  (setf  alist  (cons  adherent -point  alist))  )) 

) )  adherent-list) 

(setf  alist-in  (cons  (remove-duplicates 

(set-difference  adherent-list  alist  :test  O'equai) 
:test  t'equal)  alist-in)) 

))  (get  fbd-atom  'a-list-in)) 

(setf  (get  fbd-atom  'a-list-in)  nil) 

(mapc  (function  (lambda  (adherent-list) 

(setf  (get  fbd-atom  'a-list-in)  (cons  (build-adherent-boundary  adherent-list 

(get  fbd-atom  'a-list-in))) 

) )  alist-in)  ) ) 


BUILD-ADHERENT-BOUNDARY  is  a  helper  function  to  ORDER-ACHE RENT-LIST 
function . 


(defun  build-adherent-boundary  (alist) 

(let*  ( (aorder  (copy-3eq  alist) ) (aaa  nil) (boundary  nil)) 

(cond  ( (<  (length  alist)  3)  alist)  ___ 
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(T  (setf  aaa  (stable-sort  aorder  #' (lambda  (ptl  pt2) 

(cond  (  (<  (first  ptl)  (first  pt2) )  T) 

(T  NIL) )  ) )  ) 

(setf  aorder  (stable-sort  aaa  #' (lambda  (ptl  pt2) 

(cond  ((and  (*  (first  ptl)(fir3t  pt2)) 

(<  (second  ptl) (second  pt2) ) )  T) 

(T  NIL) )  ) )  ) 

(setf  start-x  (first  (car  aorder) ) ) 

(setf  start-y  (second  (car  aorder) ) ) 

(setf  next-x  nil) 

(setf  next-y  nil) 

(setf  boundary  (list  (list  start-x  start-y) ) ) 

(build-adherent-b2  start-x  start-y  (1-  start-x)  start-y  aorder  boundary)  ))) 


BUILD-ADHERENT-B2  is  another  helper  function  to  BUILD-ADHERENT-BOUNDARY 
function . 


(defun  build-adherent-b2  (xp  yp  x2  y2  spiist  bblist) 

( let  ( (next  nil) ) 

(mapc  #' (lambda  (sppt) 

(cond  ((and  (null  next) (member  sppt  spiist  :cest  *'equai)) 

(setf  next  sppt) 

(setf  bblist  (cons  sppt  bblist) )  ) 

(T  nil) )  ) (cw-4-scan  xp  yp  x2  y2 )  ) 

(cond  ((null  next)  (print  "error")) 

((and  (=  xp  start-x) 

(=  yp  start-y) 

(null  next-x) 

(null  next-y))  (setf  next-x  (first  next)) 

(setf  next-y  (second  next) ) 

(build-adherent-b2  (first  next) (second  next) 

xp  yp  spiist  bblist)) 

(  (and  (=  xp  start-x) 

(=  yp  start-y) 

(=  next-x  (first  next)) 

(=  next-y  (second  next)))  (reverse  (cddr  bblist))) 

(T  (build-adherent-b2  (first  next) (second  next) 

xp  yp  spiist  bblist)))  )) 


CW-4-SCAN  generates  a  4-connected,  clock-wise  scan  pattern 


(defun  cw-4-scan  (xp  yp  x2  y2 ) 

(let*  ((4pts  (4neighbors  (list  xp  yp) ) ) 

(cwlst  (cdr  (member  (list  x2  y2)  4pts  :test  t'equai)))) 
(append  cwlst  (set-difference  4pts  cwlst  :test  t'equai))  )) 


4NEIGHB0RS  generates  the  4-connected  neighborhood. 


(defun  4neighbors  (el) 

(let*  ((x  (first  el))(y  (second  el))) 

(li3t  (list  (1-  x)  y)  (list  x  (1+  y)  ) 

(list  (1+  x)  y) (list  x  (1-  y) ) )  )) 


;  RE3UILD-BCUNDARY-LIST  is  master  routine  used  to  generate  the  multiply-connected 
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boundary  lists .  This  routine  uses  the  ordered  ADHERENT  LISTS  in  property 
"a-list-in"  i.e.,  Adherent-List-inside-region,  to  generate  the  boundary  lists. 


(defun  rebuild-boundary-list  (fbd-atom) 

(cond  ((null  (get  fbd-atom  'a-list-in))  ) 

(T  (rebuild-boundary- list 3  fbd-atom) 

(cond  ((null  (get  fbd-atom  'mc-list))  ) 

(  (listp  (first  (first  (get  fbd-atom  'mc-list))))  ) 
(T  (setf  (get  fbd-atom  'mc-list) 

(list  (get  fbd-atom  'mc-list)))  )) 
(count-cut-points  fbd-atom  'mc-list  '  mc-cut-points ) 

)  )  ) 


REBUILD-BOUNDARY-LIST3  is  the  "work-horse'  function  for  generating  the 
multiply-connected  "inner"  boundary  lists. 


(defun  rebuild-boundary- list 3  (fbd-atom) 

(let*  (  (iblist  nil)  (ib2  nil)  (fpt  nil)  (pattern  nil)  (sp-flag  nil)) 

(mapc  #' (lambda  (adlist) 

(setf  iblist  nil) 

(cond  ( (>  (length  adlist)  1) 

(setf  fpt  (first  adlist)) 

(mapl  #' (lambda  (adherent) 

(cond  ( (=  1  (length  adherent)) 

(3etf  pattern  (cw-scan-pattern 
(first  fpt) 

(second  fpt) 

(first  (first  adherent)) 

(second  (first  adherent)))  )) 

(T  (setf  pattern  (cw-scan-pattern 

(first  (second  adherent)) 

(second  (second  adherent) ) 

(first  (first  adherent) ) 

(second  (first  adherent) ) )  ) )  ) 

(cond  ( (>  (length  adherent)  2) 

(setf  pattern  (set-difference  pattern 

(member  (third  adherent)  pattern  -.test  #'equal) 
:test  #'equal) )  )  ) 

(setf  sp-flag  nil) 

(setf  ib2  nil) 

(mapc  #' (lambda  (ng) 

(cond  ( (and  (null  sp-flag) 

(or  (member  ng  (get  fbd-atom  'i-list)  :test  #' equal) 
(member  ng  (get  fbd-atom  'b-list)  :test  #'equal))) 
(setf  ib2  (cons  ng  ib2))  ) 

( (and  ib2 

(null  sp-flag) 

(member  ng  adlist  :test  i'equal)) 

(setf  sp-flag  T) ) 

(T  ) )  )  pattern) 

(setf  ib2  (reverse  ib2) ) 

(cond  ((and  (>  (length  iblist)  1) 

(equal  (first  ib2) (second  (reverse  iblist))) 

(equal  (second  ib2) (first  (reverse  iblist) ) ) ) 

(setf  ib2  (cddr  ib2) )  )) 

(setf  iblist  (append  iblist  ib2)) 

)  adlist)  ) 

(T  (mapc  #' (lambda  (ng) 

(cond  ((or  (member  ng  (get  fbd-atom  'i-list)  :test  #'equal) 

(member  ng  (get  fbd-atom  'b-list)  :test  #'equal)) 
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(setf  iblist  (cons  ng  iblist) )  ) 

(T  ) )  )  (8neighbors  (first  adlist)))  )) 


(cond  (iblist  (setf  (get  fbd-atom  'mc-list) 

(cons  (setf  ib2  (remove-dups-sequences  iblist)) 
(get  fbd-atom  'mc-list)))  ) 

(T  (print  "ERROR. .Rebuild-Boundary-List2") ) ) 

)  (get  fbd-atom  'a-list-in))  )) 


;  When  the  MC  boundaries  are  first  generated  there  are  (possibly)  numerous 
;  duplicates,  both  point-wise  and  patterns  of  points.  The  following  routines 
;  effectively  remove  these  patterns. 


(defun  remove-dups-sequences  (mclist) 

(let  ( (mc2  mclist) (duup  nil)) 

(cond  ( (and  (>  (length  mclist)  1) 

(equal  (first  mclist) 

(first  (reverse  mclist)))) 

(remove-dups-sequences  (cdr  mclist) )  ) 

(T 

(mapl  #' (lambda  (mc-mc) 

(cond  ( (=  (length  mc-mc)  1) (setf  duup  (cons  (first  mc-mc)  duup))  ) 
((equal  (first  mc-mc) (second  mc-mc) >  ) 

(T  (setf  duup  (cons  (first  mc-mc)  duup)))  )  )  mc2) 

(cond  (  (=  (length  mc2)  (length  duup)) 

(remove-many-items  ( remove-dup-tails  mc2))) 

(T  (remove-dups-sequences  (reverse  duup) )  ) )  ) )  ) ) 


(defun  remove-dup-tails  (mc2) 

(let  ( (rmc2  (reverse  mc2) ) ) 

(cond  ((and  (>  (length  mc2)  3) 

(equal  (first  mc2) (second  rmc2)) 
(equal  (second  mc2) (first  rmc2) ) ) 
(remove-dup-tails  (cddr  mc2) )  ) 

((and  (>  (length  mc2)  5) 

(equal  (first  mc2) (third  rmc2) ) 
(equal  (second  mc2) (second  rmc2) ) 
(equal  (third  mc2) (first  rmc2) )  ) 
(remove-dup-tails  (cdddr  mc2) ) ) 

(T  mc2) )  ) ) 


(defun  remove-many-items  (mc3) 

(let  ( (duf  nil)  (skip-flag  0) ) 

(cond  < (<  (length  mc3)  4)  mc3) 

(T 

(mapl  #' (lambda  (mc22) 

(cond  ( (>  skip-flag  0) (setf  skip-flag  (1-  skip-flag))) 
(T 

(cond  ( (<  (length  mc22)  4) 

(3etf  duf  (cons  (first  mc22)  duf))  ) 
((and  (<  (length  mc22)  6) 

(equal  (first  mc22) (third  mc22) ) 
(equal  (second  mc22) (fourth  mc22))) 
(3etf  skip-flag  1)) 

( (<  (length  mc22)  6) 

(3etf  duf  (cons  (fir3t  mc22)  duf))) 
((and  (equal  (first  mc22) (fourth  mc22) ) 
(equal  (second  mc22) (fifth  mc22) ) 

700 


(equal  (third  mc22) (sixth  mc22) ) ) 

(setf  skip-flag  2)) 

.((and  (equal  (first  mc22)  (third  mc22)  ) 

(equal  (second  mc22) (fourth  mc22) ) ) 

(setf  skip-flag  1) ) 

(T  (setf  duf  (cons  (first  mc22)  duf )))))) )  mc3) 

(cond  ( (»  (length  duf) (length  mc3)  )  mc3) 

(T  ( remove-many-items  (reverse  duf))))  ))  )) 


8NEIGHBORS  generates  the  8-connected  neighborhood  of  points. 


(defun  8neighbors  (pixel) 

(let  ((x  (first  pixel) ) (y  (second  pixel))) 

(list  (list  (1-  x)  y)  (list  (1-  x)  (1+  y)  ) 

(list  x  (1+  y )  )  ( list  (1+  x)  (1+  y).) 

(list  (1+  x)  y)  (list  (1+  x)  (1-  y)  ) 

(list  x  (1-  y)  )  (list  (1-  x)  (1-  y)  )  )  )) 

BUILD-INTERIOR-LIST  is  master  routine  for  discovering  1-element  region 
interior  points.  Method  used  is  "region-growing". 

(defun  build-interior-list  (fbd-atom) 

(cond  ((null  (get  fbd-atom  ' i-list) )  ) 

(T  (interior-hunt  fbd-atom  (get  fbd-atom  'i-list))  ))  ) 


(defun  interior-hunt  (fbd-atom  intlist) 

(let*  ((neighborhood  nil) (i-flag  nil)) 

(mapc  (function  (lambda  (ipoint) 

(setf  neighborhood  (interior-point-check  fbd-atom  ipoint) ) 

(cond  (neighborhood  (setf  (get  fbd-atom  'i-list) 

(union  neighborhood  (get  fbd-atom  'i-list) 

:test  #' equal) ) 

(setf  i-flag  (append  i-flag  neighborhood) )  ) 

(T) )  ) )  intlist) 

(cond  (i-flag 

(setf  i-flag  (remove-duplicates  i-flag  :test  f'equal)) 

(setf  (get  fbd-atom  'i-list) 

(remove-duplicates  (get  fbd-atom  'i-list)  (test  O'equal) ) 
(interior-hunt  fbd-atom  i-flag)) 

(T  ) )  )  ) 


(defun  interior-point-check  (fbd-atom  ip) 

(let*  ( (scan  nil) (neighbor  nil) ) 

(setf  scan  (cw-scan-pattern  (first  ip) (second  ip) (1-  (first  ip)) (second  ip))) 

(mapc  (function  (lambda  (bip) 

(cond  ((and  (=  (aref  fbd-array  (first  bip) (second  bip))  0) 

(not  (member  bip  (get  fbd-atom  'a-list-in)  :test  #'equal)) 

(not  (member  bip  (get  fbd-atom  'a-list-out)  :te3t  f'equal))) 

(setf  (get  fbd-atom  'a-list-in) (cons  bip  (get  fbd-atom  'a-list-in) 

( (-  (aref  fbd-array  (first  bip) (second  bip))  0)  ) 

((or  (member  bip  (get  fbd-atom  'i-list)  :test  f'equal) 

(member  bip  (get  fbd-atom  'b-list)  :test  f'equal) )  ) 

(T  (setf  neighbor  (cons  bip  neighbor) )  ) )  ) )  scan) 
neighbor  ) ) 


) 


CW-SCAN-PATTERN  returns  a  cw  scan  list  of  8-neighborhood  points  about 
point  (xl,yl)  starting  at  the  first  cw-position  from  point  (x2,y2). 
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;  The  last  point  returned  is  point  (x2,y2) . 


(defun  cw- scan-pattern  (xl  yl  x2  y2) 

(let  < (cw-scan  (list  (list  (1-  xl)  yl) (list  (1-  xl) (1+  yl)) 

(list  xl  (1+  yl)  )  (list  (1+  xl)  (1+  yl)) 

(list  (1+  xl)  yl)  (list  (1+  xl)  (1-  yl)) 

(list  xl  (1-  yl)  )  (list  (1-  xl)  (1-  yl) ) ) ) ) 

(append  (cdr  (append  (member  (list  x2  y2)  cw-scan  :test  #' equal) 

(3et -difference  cw-scan  (member  (list  x2  y2)  cw-scan  :test  #'equal)))) 
(list  (list  x2  y2 ) ) )  )) 


COUNT -CUT -POINTS  discovers  boundary  list  cut  points.  Once  found  the  points  are 
annotated  with  the  number  of  new  regions  which  will  be  established  if  that 
1-element  is  changed  to  a  0-element. 


(defun  count -cut -points  (atm  prop  indicator) 

(let*  (  (c-list  (get  atm  prop))  (c-lister  nil)  (cut-points  nil)  (ncuts  0)  (rest-cuts  nil)) 
(mapc  (function  (lambda  (pt) 

(cond  ((li3tp  (first  pt)) 

(setf  cut-points  nil) 

(setf  c-lister  nil) 

(setf  rest-cuts  (list-cut  pt  1)) 

(mapc  (function  (lambda  (pt2) 

(3etf  ncut3  (-  (length  pt)  (length  (remove  pt2  pt  -.test  #' equal)))) 

(cond  ( (and  (>  ncuts  1) 

(not  (member  pt2  cut-points  :test  #' equal) ) 

(list-cut  (remove  pt2  pt  :test  #'equal) 

<+  ncuts 

^  (length  (remove  pt2  rest-cuts  :test  #'equal))))! 

(setf  cut-point3  (cons  pt2  cut-points)) 

(3etf  c-lister  (cons  (cons  pt2  ncuts)  c-lister) )  ) ) 

))  pt) 


(cond  (rest-cuts 

(mapc  #' (lambda  (rcut) 

(cond  ((member  rcut  cut-points  :test  #'equal)  ) 

( (and  (=1  (-  (length  ot) 

(length  (remove  rcut  pt  :test  #' equal)))) 

(>  (length  (member  rcut  pt  :test  #'equal))  1) 

(member  (second  (member  rcut  (reverse  pt)  :test  #'eq! 

ual) ) 

(8neighbors  (second  (member  rcut  pt  :test  #' ! 

equal) ) ) 

: test  #' equal) ) 

(setf  cut-points 
(cons 

(second  (member  rcut  (reverse  pt)  :test  #'equal) ) 
cut-points) ) 

(3etf  c-lister  (cons  (cons  (first  cut-points)  2)  c-list! 

er)l) 

(T) ) )  rest-cuts) ) ) 

(setf  (get  atm  indicator) (cons  c-lister  (get  atm  indicator)))  ) 

(T  (setf  ncuts  (-  (length  c-list) (length  (remove  pt  c-list  :test  f'equal)))) 
(cond  ((and  (not  (member  pt  cut-points  :test  #'equal)) (>  ncuts  1)) 

(setf  cut-points  (cons  pt  cut-points) ) 

(setf  (get  atm  indicator)  (cons  (cons  pt  ncuts)  (get  atm  indicator)  ! 

) )  ) ) 

)  )  ) )  c-list) 

(setf  (get  atm  indicator)  (reverse  (get  atm  indicator)))  )) 
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LIST-CUT  checks  to  see  if  multiple  point  in  boundary  list  i3  cut-point 
and  if  any  other  points  are  potential  cut  points. 


(defun  list-cut  (mcl  ncuts) 

(let  ((cut-flag  nil)) 

(mapl  #'  (lambda  (mc2) 

(cond  ((and  (=  (length  mc2)  1) 

(not  (member  (fir3t  mcl) (8neighbors  (first  mc2) )  :test  #'equal))) 
(setf  cut-flag  (cons  (first  mc2)  cut-flag) )  ) 

( (=  (length  mc2)  1)  ) 

((not  (member  (second  mc2) (8neighbors  (first  m~2) )  :test  #'equal)) 

(setf  cut-flag  (cons  (first  mc2)  cut-flag) )  ) 

(T  ) )  )  mcl) 

(cond  ( (>=  (length  cut-flao)  ncuts)  cut-flag) 

(T  nil))  )) 


REMOVE-REGION  removes  1-element  points  from  FBD-ARRAY  once  the  region 
has  been  completely  characterized  by  an  equivalent  FBD  expression. 


(defun  remove-region  (atm) 

(mapc  (function  (lambda  (pts) 

(setf  (aref  fbi-array  (first  pts) (second  pts) )  0) 
))  (get  atm  'i-list)  ) 

(mapc  (function  (lambda  (pts) 

(setf  (aref  fbd-array  (first  pts) (second  pts) )  0) 
))  (get  atm  'b-list)  )  ) 


MAKE-FRAME  creates  a  0-element  frame  about  the  FBD-ARRAY. 


(defun  make-frame  () 

(do  ( (i  0  (1+  i) ) ) 

( (>  i  (1-  xdim) ) ) 

(do  (  ( j  0  (1+  j) )  ) 

(  (>  j  (1-  ydim) ) ) 

(cond  ((or  (=  j  0) (=  j  (1-  ydim) ) (=  i  0) (=  i  (1-  xdim))  ) 
(setf  (aref  fbd-array  i  j)  0)))  ))) 
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GEOMETRIC  REASONING  FOR  RECOGNITION  OF 
THREE  DIMENSIONAL  OBJECT  FEATURES  *  t 


M.  Marefat  and  R.  L.  Kashyap 
School  of  Electrical  Engineering 
Purdue  University,  West  Lafayette,  IN 


ABSTRACT.  A  method  for  extracting  manufacturing  shape  features  from  the  boundary 
representation  of  a  polyhedral  object  is  presented.  In  this  approach,  the  depressions  of  the  part 
are  represented  as  cavity  graphs  which  are  in  turn  used  as  a  basis  for  hypothesis  generation- 
elimination.  The  proposed  cavity  graphs  are  an  extended  representation,  in  which  the  links 
reflect  the  concavity  of  the  intersection  between  two  faces,  and  the  node  labels  reflect  the  rela¬ 
tive  orientation  of  the  faces  comprising  the  depression.  The  hypotheses  are  generated  by 
decomposition  of  the  cavity  graphs  into  maximal  constituent.  The  incorrect  hypotheses  are 
eliminated  by  rule-based  experts  which  can  discard  a  hypothesis  or  opportunistically  improve 
and  propose  it  for  reexamination. 

Emphasis  is  put  on  automatic  analysis  of  depressions  which  are  formed  by  interactions  of 
primitive  features,  because  previous  methods  have  limited  success  in  handling  interactions.  It  is 
shown  that  although  there  is  a  unique  subgraph  for  each  primitive  feature,  every  cavity  graph 
does  not  correspond  to  a  unique  set  of  primitive  features.  Consequently,  since  the  cavity  graph 
of  a  depression  may  not  be  the  union  of  the  representations  for  the  involved  primitives,  we 
introduce  the  concept  of  virtual  links  for  the  formal  analysis  of  the  depressions  based  on  cavity 
graphs.  Finally  a  suitable  method  for  automatic  determination  of  the  virtual  links  is  presented. 
This  method  is  based  on  combining  topologic  and  geometric  evidences,  and  uses  a  combination 
of  Dempster-Shafer  decision  theory  and  clustering  techniques  to  reach  its  conclusions.  Experi¬ 
mental  results  for  a  number  of  examples,  which  are  not  correctly  analyzed  by  previous  systems, 
are  presented  through  out  the  paper,  and  implementation  details  are  discussed. 

1.  INTRODUCTION.  Understanding  the  shape  of  an  object  is  essential  in  CAD  and  CAM 
to  automate  the  link  between  design  and  manufacturing,  and  is  important  in  computer  vision  for 
recognizing  objects  based  on  their  properties.  The  traditional  CAD  description  of  the  design  of 
a  prismatic  part  represents  its  geometry  as  a  data  structure  involving  faces,  edges,  and  vertices. 
However  to  manufacture  a  part,  we  need  to  describe  it  in  terms  of  the  higher  level  semantic 
features  such  as  slots,  holes,  and  pockets  because  the  machining  operations  create  only  these 
features  and  not  edges,  faces  or  vertices.  Thus  this  higher  level  feature  information  should  be 
extracted  from  the  CAD  description.  The  problem  is  then  to  find  a  useful  description  of  the 
object  in  terms  of  the  shape  features  given  a  boundary  representation  of  the  object  in  terms  of 
face,  edge,  and  vertex  entities. 

Several  approaches  have  been  proposed  for  recognition  of  object  features  from  CAD  data. 
Woo  [16]  used  convex  hull  techniques  to  describe  the  object  as  alternating  sums  of  volumes. 
Kyprianou  [9]  and  Staley  [13]  applied  syntactic  pattern  recognition  methods  to  classify  depres¬ 
sions.  Henderson  [6]  and  Kung  [8]  used  logic  programming  and  expert  systems  to  extract  shape 
features  while  Joshi  [7]  developed  Attributed  Adjacency  Graphs  for  a  part.  De  Floriani  [5]  uses 
connectivity  properties  to  classify  features  into  DP-features  (protrusions  or  depressions)  and  H- 
features( through  holes  or  handles).  While  most  of  the  above  methods  use  the  boundary  descrip¬ 
tion  of  the  object  as  input,  Lee  and  Fu  [10]  propose  algorithms  for  extraction  and  unification  of 
some  features  from  a  CSG  tree.  However,  previous  methods  have  very  limited  success  in 
recognition  of  interacting  features  because: 

(i)  Interaction  between  primitives  produces  different  versions  of  a  primitive.  There  is  no 

unique  representation  for  all  different  occurrences  of  a  primitive  feature. 


*  Supported  by  the  U.S.  Army  Research  Office 

f  A  detailed  discussion  of  the  reported  research  appears  in  the  October  1990  issue  of 
IEEE  Transactions  on  Pattern  Analysis  and  Machine  Intelligence. 


705 


(ii)  During  the  course  of  interaction  between  primitives,  faces  of  a  primitive  feature  may  be 
divided  into  several  disconnected  components  thus  creating  new  faces.  Furthermore,  two 
intersecting  faces  of  a  primitive  (when  the  primitive  is  isolated)  may  not  be  adjacent  in  a 
compound  feature  containing  the  interaction  of  the  primitive  with  another  primitive. 

In  this  paper  we  describe  a  novel  approach  for  identifying  and  recognizing  the  primitive 
features  in  the  depressions  of  a  polyhedral  machining  part  Figure  1  illustrates  the  overall 
approach  for  our  proposed  method.  It  takes  advantage  of  graph  matching,  expert  system,  and 
reasoning  with  uncertainty  techniques.  Briefly  the  contributions  of  this  work  can  be  summar¬ 
ized  as  follows: 

(i) .  A  new  graph  representation  for  the  primitive  shape  features  of  an  object  is  introduced.  In 

this  representation  each  face  is  represented  by  one  face,  the  node  labels  determine  the  rela¬ 
tive  orientation  of  faces  of  the  object  in  space,  and  the  links  determine  the  non-convexity 
between  two  faces. 

(ii) .  A  method  for  extracting  primitive  shape  features  of  an  object  even  when  these  primitives 

are  interacting  is  developed.  This  method  uses  hypothesis  generation-elimination 
approach.  The  hypotheses  are  generated  by  decomposing  the  cavity  graphs  of  the  object 
into  maximal  subgraph  constituents,  and  the  incorrect  hypotheses  are  eliminated  by  rule 
based  experts  which  evaluate  each  hypothesis. 

(iii) .  The  concept  of  virtual  links  for  the  graph  representation  of  an  object  is  introduced. 

Because  the  graph  of  a  depression  is  not  necessarily  the  union  of  the  representations  for 
the  involved  primitives,  to  include  the  correct  hypotheses  in  the  hypothesis  space,  we  show 
that  unification  to  merge  the  nodes  of  a  group  of  separated  faces,  and  augmenting  cavity 
graphs  with  virtual  links  form  powerful  shape  analysis  techniques. 

(iv) .  We  develop  a  method  of  reasoning  with  uncertainties  to  determine  the  virtual  links  to  be 

augmented  to  a  cavity  graph.  Virtual  links  can  be  determined  by  combining  geometric  and 
topologic  evidences.  The  shape  primitives  found  based  on  these  techniques  may  be  inter¬ 
preted  as  the  primitives  most  probably  comprising  the  depression. 

The  remainder  of  this  paper  describes  the  details  of  our  methods  and  of  the  prototyped  sys¬ 
tem  implemented.  In  the  next  section  we  describe  the  primitive  features  and  methods  for 
extracting  them  when  they  are  not  interacting.  Section  three  discusses  interaction  between 
primitives.  Section  four  describes  generation  of  hypotheses  and  the  reasoning  strategy  to  select 
the  correct  subset  of  them.  Sections  five  and  six  are  devoted  to  presenting  the  concept  of  virtual 
links,  the  methods  for  determining  the  most  probable  features  in  a  cavity,  and  the  gathering  and 
combining  processes  of  topologic  and  geometric  evidences.  Section  seven  discusses 
verification.  Implementation  is  briefly  presented  in  section  eight,  and  discussion  and  expen- 
mental  results  from  a  prototyped  system  interfaced  to  the  PADL/2.0  [1]  solid  modeling  system 
showing  the  extraction  of  primitive  features  from  some  parts  that  are  not  correctly  analyzed  bv 
the  previous  systems  are  presented  in  section  nine. 

2.  PRIMITIVE  FEATURES.  Faces  of  a  part  are  usually  machined  in  groups.  These 
groups  form  primitive  machining  features,  such  as  pockets,  slots,  and  steps,  because  chunks  of 
manufacturing  knowledge  are  associated  with  each  group.  A  primitive  shape  feature  (or  primi¬ 
tive  feature)  of  an  object  may  informally  be  defined  as  a  connected  set  of  faces  from  its  boun¬ 
dary  with  semantic  meaning  for  accomplishing  a  desired  task.  Figure  2  shows  the  primitive 
features  we  considered.  One  may  note  that  each  depicted  primitive  represents  a  family  of  shape 
features  since  concave  and  convex  angles  may  assume  any  value  in  the  concave  or  convex 
range.  Each  member  of  the  family  is  obtained  by  selecting  a  different  combination  of  face 
intersection  angles  from  the  appropriate  range  such  that  the  combination  is  physically  realizable 
in  3D.  We  selected  shape  features  that  are  important  for  machining  a  pan,  but  the  methods  we 
use  may  be  applied  to  shape  features  of  interest  in  other  areas. 
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3.  Prismatic  Hole 


Figure  2:  Primitive  features  and  their  isolated  local  representations. 
Node  labels  are  dominant  direction  of  normal  to  the  face 
away  from  the  material.  Other  consistent  permutations  of  X,  Y, 
and  B  also  lead  to  representations  for  the  same  primitive. 


2.1  Representing  Primitive  Features.  In  order  to  recognize  the  primitives  in  the  boundary  of  an 
object,  a  unique  representation  for  each  primitive  feature  is  required.  Our  representations  for 
the  primitives  are  similar  to  the  Attributed  Adjacency  Graphs  proposed  by  Joshi  [7]  with  the 
nodes  labeled.  The  representations  for  primitives  are  also  shown  in  figure  2.  In  our  approach, 
each  primitive  is  represented  by  a  labeled  graph.  Each  face  of  the  primitive  corresponds  to  a 
node  of  this  graph,  and  two  nodes  are  connected  by  a  link  if  the  corresponding  faces  intersect. 
Each  node  is  labeled  by  a  label  from  the  set  {B,  -B,  +X,  -X,  +Y,  -Y}.  The  node  labels  show 
the  orientation  of  a  face  in  space.  The  node  3:  +Y  ,  for  instance,  indicates  that  the  dominant 
component  of  the  normal  to  face  3  is  in  the  +Y  direction  of  the  local  coordinates. 

A  unique  subgraph  pattern  corresponds  to  each  primitive,  but  rotating  the  X,  Y,  and  B 
labels  in  a  graph  such  that  the  local  coordinates  follow  the  right  hand  rule  corresponds  to  a 
graph  for  an  identical  feature.  Assuming  that  we  keep  our  local  coordinates  fixed,  replacing 
node  labels  in  the  template  of  a  primitive  corresponds  to  a  rotated  version  of  the  primitive 
(hence  the  subgraph  pattern  is  the  same  but  node  labelings  are  different). 

2.2  Extracting  Isolated  Primitive  Features.  Using  the  above  representation,  extracting  the  pnmi- 
tive  features  from  the  boundary  of  an  object  whose  features  are  isolated,  that  is  the  primitives 
are  not  interacting,  becomes  a  simple  task.  A  global  graph  of  the  object  is  first  constructed.  A 
global  graph  is  a  representation  for  the  entire  object.  The  global  graph,  G ,  for  an  object,  S,  has 
the  following  properties: 

(i) .  Each  face  fi  of  the  object,  S,  is  represented  by  a  node  in  G. 

(ii) .  For  every  edge,  etJ,  of  the  object  shared  by  two  faces  ft  and  fr  there  is  an  edge  con 

necting  the  corresponding  nodes  in  G. 

(ii).  I  (G)  is  a  labeling  for  the  edges  of  G,  which  marks  an  edge  in  the  graph  concave  if  the 
faces  sharing  the  corresponding  edge  of  the  object  are  concavely  adjacent,  and  marks 
the  edge  convex  otherwise. 

To  find  the  graphs  representing  the  cavities  in  the  object,  the  subgraph  of  the  global  graph 
consisting  of  only  the  concave  links  is  obtained.  Each  component  of  this  subgraph  disconnected 
from  the  other  components  is  a  cavity  graph  of  the  object.  The  nodes  of  each  cavity  graph  are 
labeled.  To  label  the  nodes,  the  node  with  the  greatest  number  of  concave  edges  is  selected  to 
be  the  base  and  is  labeled  B.  A  pair  of  X,Y  directions  are  chosen  so  that  together  with  the  nor¬ 
mal  to  the  base  they  conform  to  the  right  hand  rule.  The  other  nodes  are  labeled  according  to 
the  direction  of  the  normal  to  the  corresponding  face  away  from  the  object.  The  label  for  the 
node  corresponding  to  face  /,  is  denoted  by  /'(/). 

The  above  cavity  graphs  may  now  be  matched  to  the  representations  for  the  primitives. 
Each  cavity  graph  corresponds  to  one  primitive’s  representation  because  the  primitives  are  iso 
lated.  Each  match  generates  one  hypothesis  stating  that  the  depression  in  the  boundary  of  the 
object  corresponds  to  a  primitive  of  the  type  matched.  The  expert  rule-based  systems  check 
each  hypothesis  and  approve  the  correct  ones.  Figure  3  shows  this  process  of  extracting  the 
primitive  features  from  the  boundary  of  an  object  when  they  are  isolated.  While  identifying  a 
primitive  the  information  describing  its  instance,  such  as  its  base  face,  its  axis,  its  diameter,  etc. 
is  reconstructed.  This  information  is  represented  in  frames  which  are  stored  in  the  Feature 
Knowledge-Base. 

3.  INTERACTING  FEATURES.  Primitive  features  are  usually  not  isolated.  In  fact  the 
significance  of  this  research  is  associated  with  recognizing  interacting  features  and  finding  their 
constituent  primitives.  The  number  of  primitive  features  to  be  recognized  is  finite,  but  the 
configurations  of  features,  their  intersections,  and  the  types  of  interactions  that  may  arise  in  a 
compound  cavity  are  unlimited.  It  is  not  practical  to  enumerate  all  possible  interactions  of 
shape  features. 

The  major  hindrance  in  determining  the  constituent  primitives  of  an  object  is  nonunique¬ 
ness.  The  cavity  graphs  for  the  depression  consisting  of  interactions  may  not  match  the 
representation  for  any  of  the  primitives.  In  fact  the  subgraph  representing  a  primitive  in  an 
interaction  could  be  different  from  the  representations  shown  above  for  isolated  primitives. 
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HYPOTHESES: 

(Pocket  Base:l  Sides:(2  3  4  5)) 

(Thru_slot  Base:9  Sides:(8  10)) 

(c) 

EXTRACTED  PRIMITIVES: 

(verified  (Pocket  Base  :  1  Sides  :  (2  3  4  5))  Certainty  :  1.0) 
(verified  (Thru_slot  Base  :  9  Sides  :  (8  10))  Certainty  :  1.0) 

(d) 


Figure  3:  Extraction  of  Isolated  Primitive  Features 

(a):  a  part  with  an  isolated  pocket  and  a  thruslot  primitive. 
(bl),(b2):  The  cavity  graphs  of  the  object. 

(c) :  The  hypotheses  generated  from  the  cavity  graphs  in  (bl),(b2). 

(d) :  The  verified  primitives  extracted  by  the  prototyped  system. 
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Consider  the  part  shown  in  figure  4.  The  subgraph  of  the  global  graph  of  the  part  c-  nsisting  of 
only  the  concave  links  does  not  match  the  representation  for  any  of  the  primitives.  1  ne  pan  has 
a  pocket  (faces  13,14,15,16,20,12)  and  a  prismatic  hole  (faces  16,17,19,20).  The  subgraph 
induced  by  nodes  13,14,15,16,20,12  is  the  subgraph  representing  the  pocket  in  the  interaction. 
Clearly  this  subgraph  is  not  isomorphic  to  the  representation  fora  pocket  primitive. 

The  problem  in  the  above  example  is  a  direct  result  of  the  interaction  between  primitive 
features.  As  a  consequence  of  the  intersection  of  the  hole  and  the  pocket,  one  of  the  side  faces 
of  the  pocket  is  divided  into  two  smaller  disconnected  faces,  13  and  14.  A  group  of  faces  that 
have  been  produced  as  a  result  of  dividing  a  large  face  because  of  interaction  of  primitive 
features  are  called  unifiable.  Faces  13  and  14  in  the  above  example  are  unifiable.  We  can  unify 
the  faces  in  a  unifiable  group  to  form  a  larger  conceptual  face.  The  conceptual  face  produced 
by  unifying  faces  13  and  14  is  represented  by  [13,14].  The  following  Observation  expresses  the 
minimum  conditions  required  for  a  set  of  faces  to  be  unifiable. 

Observation  1:  The  set  of  faces  U  =  {f\  ,/2,—,/i/  of  an  object  S  is  unifiable  if 

(i) .  every  face  in  the  set,  U,  is  embedded  on  the  same  plane,  i.e.  they  have  the 

same  equation, 

(ii) .  No  face  of  the  part  intersects  the  interior  of  the  face,  [/1,/2,—,/i]  formed  by 

unifying/!  ,/2,...,/,  and 

(iii) .  normals  of  all  elements  of  U  point  in  the  same  direction 

The  notion  of  unifiability  is  coupled  with  the  machining  of  the  set  of  unified  faces.  Condi¬ 
tions  (i)  and  (iii)  of  observation  1  indicate  that  the  unified  faces  have  similar  geometry,  and  con¬ 
dition  (ii)  guarantees  that  no  other  part  of  the  object  obstructs  the  larger  face  formed  by 
unification.  Therefore,  all  faces  in  a  unifiable  set  may  be  machined  by  the  same  group  of 
machining  operations. 

In  the  same  manner  that  the  node  labeling  helps  distinguish  between  representations  which 
otherwise  seem  identical  (example  in  figure  9),  unification  helps  in  obtaining  a  unique  represen 
ration  for  a  primitive  in  its  interaction  with  other  primitives.  Figure  4(c)  shows  the  subgraph  of 
the  global  graph  4(b)  consisting  of  concave  edges  with  the  faces  unified  and  the  nodes  labeled. 
It  is  clear  that  the  subgraph  of  the  cavity  graph,  4(c),  induced  by  nodes  [13, 14], 15, 16,20, 12  is 
isomorphic  to  the  representation  for  a  pocket.  In  fact,  the  cavity  graph  is  the  union  of  the  sub¬ 
graphs  representing  a  pocket  and  a  prismatic  hole  (subgraph  induced  by  16,17,19,20).  Using 
1(e)  to  denote  the  label  for  edge  e ,  and  /'(/)  to  denote  the  label  for  face  /,  the  cavity  graphs  can 
be  defined  as  follows: 

Definition:  A  Cavity  Graph ,  //,  is  a  subgraph  of  the  global  graph  ,  G,  of  the  part 

such  that  H  is  connected  and: 

(i) .  unifiable  faces  of  G  are  unified, 

(ii) .  1(e)  =  concave  ^e  e  E(H),  and 

(iii) .  /'</)  e  {B,  -B,  +X ,  -X,  +Y,  -Yj  -v fe  F (H). 

where  E(H)  and  F (H)  are  the  edges  and  the  faces  of  the  cavity  graph.  H,  respec¬ 
tively. 


The  cavity  graph  4(c),  as  mentioned  above,  is  composed  from  the  union  of  the  subgraphs 
for  a  pocket  primitive  and  a  prismatic  hole  primitive.  Therefore,  it  seems  logical  that  to  obtain 
’he  primitive  features  of  the  part  we  must  decompose  the  cavity  graph  into  the  subgraphs  for  the 
constituent  primitives.  The  cavity  graphs  obtained  according  to  the  above  definition  are,  in  gen 
eral,  a  proper  subgraph  of  the  union  of  the  representations  for  the  primitives  involved  in  the 
interaction,  or  isomorphic  to  the  union  of  the  representations.  Decomposing  the  cavity  graphs 
into  a  set  of  maximal  subgraph  patterns  representing  the  primitive  features  yields  a  set  of 
promising  hypotheses  about  the  primitive  constituents  of  the  depressions  in  the  part. 
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Figure  4:  Cavity  Graph  Representation  for  a  Depression 

(al)  Shows  an  example  part,  (a2)  shows  the  depression  in 
(al),  which  is  a  pocket  with  base-face  5,  with  a  prismatic  hole 
opening  into  it.  Note  that  faces  13  and  14  are  coplanar. 

(b)  shows  the  Cavity  graph  of  depression. 

In  (b)  (15  :  +Y)  means  that  the  normal  to  face  15 

is  in  direction  +Y.  Faces  13  and  14,  which  meet  all  criteria  for 

unification,  are  unified  and  denoted  by  [13,14]. 
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4.  HOW  ARE  HYPOTHESES  GENERATED?  Generation  and  verification  of  hypotheses 
form  the  core  of  our  method  for  extracting  shape  features.  The  purpose  of  the  hypothesis  gen¬ 
eration  step  is  to  find  a  set  of  promising  hypotheses  from  which  the  correct  hypotheses  about 
primitives  forming  the  depressions  can  be  selected  by  the  expert  consultants.  An  object  typi¬ 
cally  consists  of  several  depressions.  Each  depression  is  represented  by  one  or  more  cavity 
graphs.  To  generate  hypotheses,  the  cavity  graphs  are  decomposed  into  maximal  subgraphs 
corresponding  to  the  patterns  for  the  primitive  features. 

Observation  2:  There  is  an  ordering  H\,  H2  ...,  Hj  of  the  local  representations  of 
the  primitive  features,  such  that  H  i  3  H2  •  •  •  3  Hj  . 

From  the  local  primitive  representations  in  figure  2: 

H pocket  — ’  ^  blind  -slot  — 1  ^ prismatic  -hole  — ’  H blind -step  — 1  H slot  — 5  ^siep 

The  ordering  introduced  by  observation  2  is  used  to  assure  maximality  of  each  primitive  in 
a  decomposition.  We  first  look  for  subgraphs  of  a  cavity  graph  isomorphic  to  Ht,  and  after  all 
such  subgraphs  are  found  and  the  corresponding  hypotheses  are  generated,  we  look  for  sub 
graphs  isomorphic  to  Hi+ i<  Two  subgraphs  representing  different  hypotheses  may  have  com¬ 
mon  vertices  and  even  common  edges,  but  a  subgraph  is  not  allowed  to  represent  a  hypothesis  if 
all  of  its  edges  (and  hence  its  vertices)  are  elements  of  subgraphs  for  higher  order  primitives. 
This  continues  until  the  Cavity  Graph  is  completely  decomposed  into  its  maximal  constituents. 

For  the  purpose  of  efficiency,  this  decomposition  is  broken  down  into  steps.  For  each  cav¬ 
ity  graph,  an  ordered  set  of  faces  is  produced  such  that  faces  with  more  concave  edges  in  their 
boundary  are  ranked  first.  Faces  are  taken  one  at  a  time  from  this  ordered  set  of  faces  and  all 
consistent  and  acceptable  (with  respect  to  primitive  structure  and  node  labels)  hypotheses  for 
primitives  are  generated,  such  that  the  face  is  the  base  of  the  primitive.  A  hypothesis  is 
removed  from  the  hypothesis  set  if  its  graph  is  a  part  of  the  graph  of  any  previously  generated 
hypothesis.  This  ensures  maximality  in  the  set  of  hypotheses.  This  is  repeated  until  all  faces 
and  edges  belong  to  some  hypothesis.  The  process  is  performed  for  each  cavity  graph,  thereby 
generating  hypotheses  for  the  depressions  in  the  complete  object. 

Figure  5  shows  the  Cavity  Graph  of  a  depression  and  the  set  of  hypotheses  generated  on  it 
using  the  above  outlined  algorithm.  Hypotheses  suggest  the  existence  of  specific  primitive 
features  and  the  faces  comprising  each  one.  Each  hypothesis  is  represented  as  a  list.  The  first 
element  of  the  list  is  a  key  to  the  type  of  primitive  being  hypothesized.  The  other  elements  are 
the  base  of  the  primitive  and  a  list  of  its  side  faces  in  a  predefined  order. 

Although  the  algorithm  stops  when  all  the  faces  and  edges  are  in  some  hypothesis,  keeping 
track  of  what  faces  have  been  used  as  a  base  in  the  ordered  set  of  faces  enables  us  to  follow  our 
enumeration,  and  hence  to  generate  new  hypothesis  sets  from  where  we  left  off  without  dupli 
eating  the  earlier  effort. 

There  are  certain  properties  which  are  desirable  for  generators.  Winston  |15]  describes 
these  as: 

•  Good  generators  are  complete.  They  eventually  produce  all  possible  solutions. 

•  Good  generators  are  nonredundant.  They  never  damage  efficiency  by  proposing 
the  same  solution  twice. 

•  Good  generators  are  informed.  They  use  possibility-limiting  information,  res¬ 
tricting  the  solutions  they  propose  accordingly. 

The  following  observation  shows  that  our  hypothesis  generation  scheme  has  the  above  proper¬ 
ties: 

Observation  3:  The  hypothesis  generation  scheme  proposed  above  is  (i)  complete, 

(ii)  nonredundant,  and  (iii)  informed. 
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1.  (Pocket  Base:  1  Sides:  (5  (2  10)  (3  11)4) 

2.  (Hole  Base:  4  Sides:  (6  5  7)) 

3.  (Hole  Base:  4  Sides:  (6  5  8)) 

4.  (Hole  Base:  4  Sides:  (9  5  8)) 

5.  (Hole  Base:  4  Sides:  (9  5  7)) 

(c) 


Figure  5:  Hypotheses  Generated  for  a  Part 

(a) :  An  object  with  a  depression  formed  by  interaction  of  one  pocket  and 
two  polyhedral  holes  opening  into  it. 

(b) :  Cavity  graph  of  its  depression, 

Note  that  the  coplanar  face-pairs  2,10  and  3,11,  are  unifiable. 

These  faces  belong  to  the  pocket  primitive,  and  hence  in  the  cavity 
graph  (b),  one  node  represents  both  faces  of  the  unified  pair  [2,10] 
and  one  node  represents  the  unified  pair  [3,11]. 

(c) :  The  set  of  hypotheses  generated  for  the  part. 

The  3rd  and  5th  hypotheses  are  incorrect. 


(i) .  Completeness  is  due  to  the  fact  that  at  each  face,  all  primitive  templates  with 

that  face  as  their  base  are  checked,  and  eventually  all  faces  are  used  to  find 
templates  based  at  them. 

(ii) .  The  scheme  is  not  redundant  because  it  keeps  a  history  of  the  hypothesized 

primitives,  and  every  new  one  is  checked  in  step  (vi)  of  the  algorithm  against 
the  previous  ones,  and  if  a  hypothesized  primitive’s  set  of  edges  and  faces  is  a 
subset  of  another  primitive’s  set  of  edges  and  faces,  the  hypothesis 
corresponding  to  the  subset  is  discarded. 

(iii) .  The  method  is  informed  for  three  reasons: 

(a)  The  faces  of  a  depression  are  ranked. 

(b)  The  order  of  search  for  primitive  features  is  that  set  by  observation  2. 

(c)  At  any  time  it  is  known  what  faces  and  edges  are  hypothesized  about. 

The  first  set  of  hypotheses  generated  on  a  Cavity  Graph,  G,  is  not  necessarily  the  only  set. 
If  the  subsequent  analysis  shows  that  the  generated  hypotheses  represent  the  primitives  involved 
incorrectly,  subsequent  hypotheses  are  generated  using  a  supergraph  of  the  cavity  graph.  We 
describe  later  how  this  supergraph  is  built  in  the  section  on  the  combination  of  topologic  and 
geometric  evidences.  The  hypothesis  generation  process  is  recursive.  The  first  set  of  the 
hypotheses  is  formed  by  maximal  subgraphs  of  the  supergraph.  Although  seldom  needed,  later 
hypothesis  sets  are  produced  by  decomposing  the  supergraph  into  a  new  set  of  subgraphs. 
These  later  subgraphs  constitute  breaking  the  supergraph  into  smaller  components  which  are  not 
maximal.  However,  for  every  generated  set  of  hypotheses,  H  =  {h  \ ,  /12 . hj,  we  have: 

h\  U  h2  Kj  ■  ■  •  {jhi  =  G, 

where  h.  refers  to  the  local  representation  for  the  primitive  of  hypothesis  j, 

( 1  <j<n, 

and  G  =  the  cavity  graph  on  which  hypotheses  are  generated. 

The  hypothesized  features  must  be  checked  in  detail  for  correctness  by  the  rule-based  consul¬ 
tants.  The  generated  hypotheses  are  made  available  on  a  notice-board  to  the  rule-based  experts. 

4. 1  Elimination  of  Invalid  Hypotheses.  The  objective  of  the  rule-based  experts  is  selection  from 
the  proposed  set  of  hypotheses,  H,  of  the  subset  of  correct  hypotheses,  V,  which  accurately 
specifies  the  comprising  primitive  features  of  a  depression.  Rule-based  experts  examine  each 
generated  hypothesis  in  detail  to  decide  if  the  primitive  feature  suggested  by  the  hypothesis  is 
valid.  Each  expert  is  independent  and  equipped  with  knowledge  in  the  form  of  rules  and  pro¬ 
cedures.  An  example  rule  for  a  thru-slot  looks  like  this  in  pseudo  lisp: 

(thru-slot  base:  B  sides:  (sidel ,  side2))  :- 

(concave-adjacent  base  side!), 

(concave-adjacent  base  side2), 

(opposite  (principle-normal-component  sidel) 

( principle- normal-component  side2)), 

(intersect  base  sidel  edgel), 

(intersect  base  side2  edge2), 

(no-concave-edge-between  edgel  edge2), 

(not  (adjacent  sidel  side2)), 

((concave-adjacent  sidel  listl)  and  (length  list  I  '!)), 

((concave-adjacent  side2  list2 }  and  (length  list2  '  1 )). 

The  above  rule  says  that  the  three  faces  (base,  sidel,  and  side2)  form  a  thru-slot  if  the  sides 
are  concavely  adjacent  to  the  base,  the  sides  are  almost  parallel  and  have  opposing  normals. 
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there  is  no  face  concave  to  the  base  between  sidel  and  side2,  and  neither  sidel  nor  side2  is  con- 
cavely  adjacent  to  any  other  face  except  the  base  of  the  slot. 

The  type  field  of  a  hypothesis  is  used  to  search  for  the  rules  applicable.  The  matching  of 
the  types  triggers  a  rule.  The  conjuncts  in  the  condition  are  evaluated  when  a  rule  is  fired.  If  all 
the  conjuncts  succeed,  the  hypothesis  being  tested  is  assumed  correct  and  is  tagged  to  indicate 
so.  Let  us  consider  the  example  shown  in  figure  5  again.  There  is  one  depression  in  the  part 
and  one  cavity  graph  corresponding  to  it.  The  depression  in  the  part  consists  of  a  pocket  with 
two  holes  opening  into  it.  Five  hypotheses  are  generated,  one  for  a  pocket  and  four  for  holes, 
three  of  which  are  correct.  One  of  the  conjuncts  in  the  if  part  of  the  rule  for  holes  states  that  all 
the  faces  of  a  hole  should  be  convexly  adjacent  to  the  same  face.  This  condition  models  the  fact 
that  at  the  open  end  for  a  hole  there  is  a  loop  of  edges  produced  by  the  intersection  of  the  faces 
of  the  hole  with  the  opening  face  of  the  hole.  Applying  this  condition  to  the  four  hole 
hypotheses  of  figure  5  it  becomes  clear  that  the  third  and  the  fifth  hypotheses  are  incorrect 
because  faces  6  and  8  of  the  third  hypothesis  are  not  adjacent  to  the  same  opening  face  and 
similarly  faces  9  and  7  of  the  fifth  hypothesis  are  not  adjacent  to  the  same  opening  face.  These 
hypotheses  are  eliminated. 

5.  VIRTUAL  LINKS.  In  the  cases  that  correct  conclusions  about  the  primitive  features 
constituting  a  depression  can  be  drawn  from  the  original  cavity  graph  of  the  depression,  the 
correct  hypotheses  arc  found  by  the  expens  and  a  verification  step  checks  the  results.  In  such 
cases,  there  is  no  need  for  vinual  links.  However,  another  class  of  problems  that  arises  in  the 
analysis  of  primitive  interactions  is  when  the  intersection  of  primitives  causes  the  the  adjacency 
relationships  between  the  faces  of  a  primitive  to  be  changed.  In  this  case,  the  representation  of  a 
primitive  in  the  interaction  (subgraph  of  the  cavity  graph  induced  by  the  nodes  for  the  faces  of 
the  primitive)  does  not  match  the  template  for  the  primitive  (shown  in  figure  2),  because  some 
links  of  the  template  are  not  in  the  induced  subgraph.  To  clarify  the  problem,  let  us  consider  the 
example  in  figure  6.  The  object  in  figure  6(a)  has  a  blind_slot  (base  :  5  sides  :  (4.  3.  Ij)  and  a 
thru_slot  (base  :  5  sides  :  (2.  I)).  The  shape  primitives  share  the  same  base,  but  face  3  of  the 
blind_slot  is  disconnected  from  the  side  face  1.  Consequently  there  is  no  link  between  nodes  3 
and  1  of  the  cavity  graph,  so  the  subgraph  induced  by  vertices  5,4,3, 1  does  not  match  the  tem¬ 
plate  for  a  blind_slot.  In  order  to  generate  the  correct  candidate  hypotheses  for  such  pans,  it  is 
necessary  to  augment  the  cavity  graph  with  virtual  links.  For  the  cavity  graph,  G,  of  figure  6(b), 
the  hypothesis  space  includes  the  indicated  primitives  only  if  the  virtual  link,  (3  1),  between 
nodes  3  and  1,  is  added  to  G.  In  general,  we  don’t  know  how  many  links  are  required  a  priori. 
The  number  of  virtual  links  can  be  more  than  one.  The  most  appropriate  virtual  links  are  those 
whose  addition  makes  the  resulting  supergraph  isomorphic  to  the  union  of  representations  for 
the  involved  primitives. 

5.1  Direct  Method  to  Find  Virtual  Links  The  appropriate  virtual  links  for  a  cavity  graph  may  be 
found  by  approaches  including  enumeration,  partial  match,  expert  system,  and  uncertainty  rea¬ 
soning.  Enumeration  considers  each  possible  supergraph  of  the  cavity  graph  individually. 
Although  the  desirable  results  may  be  obtainable  with  this  approach,  it  has  practical  limitations 
because  the  power  set  of  all  possible  links  should  be  considered. 

We  refer  to  the  partial  match  approach  as  the  direct  method.  In  this  method,  the  partial 
matches  between  the  maximal  subgraphs  of  the  cavity  graph  of  depression  and  the  templates  for 
primitive  features  are  considered  as  potential  hypotheses,  and  used  for  determining  which  vir¬ 
tual  links  to  be  augmented  to  the  cavity  graph.  Consider  the  example  of  figure  6  again.  Node  5 
can  be  chosen  as  the  base  to  generate  the  maximal  subgraphs  of  the  depression,  since  it  has  the 
highest  number  of  incident  links.  Considering  different  primitives,  we  observe  that  if  the  three 
links  (4  2),  (2  1),  and  (1  3)  are  added  to  the  cavity  graph  it  may  represent  the  template  for  a 
pocket,  but  then  the  labels  do  not  form  a  consistent  set  for  a  pocket  primitive.  Next,  we  observe 
that  the  subgraph  induced  by  nodes  4,5,3  and  1  partially  matches  the  template  for  a  blind-slot. 
For  this  partial  match  to  be  complete,  a  link  between  the  nodes  3  and  1  of  the  cavity  graph.  G.  is 
required.  Proceeding  in  this  fashion,  the  direct  method  may  select  this  link  as  a  strong  candi¬ 
date  for  a  virtual  link  to  be  augmented  to  the  cavity  graph.  Obviously,  to  prove  the  necessity  for 
this  link  more  reasoning  and  tests  must  be  performed,  however,  the  direct  method  identifies 
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Figure  6:  A  Compound  Feature  whose  Cavity  Graph  is  not 
the  union  of  the  template  subgraphs  for  the  primitives  involved. 

The  depression  of  object  shown  in  (a)  consists  of  a 
blind  slot  and  a  thru-slot  sharing  base-face  5. 

However,  face  3  of  the  blind  slot  does  not 

intersect  face  1  as  a  result  of  the  blind  slot’s  interaction  with  the  thru-slot. 
The  cavity  graph  of  the  depression  is  G  without  the  dotted  link. 

Therefore  a  thru-slot  subgraph,  Gl,  and  a  blind-slot  subgraph,  G2, 
can  be  obtained  from  the  Cavity  Graph,  G,  only  if  the  shown  dotted 
virtual  link  is  added  to  G. 


strong  candidates. 

The  direct  method  of  considering  partial  matches  to  find  virtual  links  is  useful  for  reason¬ 
ing  with  relatively  small  cavity  graphs,  but  it  cannot  be  effectively  used  for  complicated  cavity 
graphs,  such  as  the  cavity  graphs  associated  with  example  part  of  figure  12.  There  are  too  many 
sufficiently  close  partial  matches  in  these  cavity  graphs.  A  better  method  should  be  used  to  take 
into  account  other  supporting  information,  and  automatically  generate  the  most  useful  virtual 
links.  We  show  in  the  next  section  that  a  method  based  on  combining  geometric  and  topologic 
evidences  can  be  used  to  achieve  this  purpose. 

6.  DETERMINING  VIRTUAL  LINKS  BY  COMBINING  TOPOLOGIC  AND 
GEOMETRIC  EVIDENCES. 

6.1  Dempster-Shafer  Theory  and  Extraction  of  Primitive  Features.  In  this  section,  we  introduce 
a  method  to  find  the  virtual  links  to  be  augmented  to  the  cavity  graph  of  a  depression  by  com¬ 
bining  geometric  and  topologic  evidences,  and  applying  a  clustering  technique.  Figure  7  shows 
the  basic  block  diagram  for  this  approach.  The  approach  is  based  on  determining  the  subset  of 
the  virtual  links  in  a  cavity  graph  that  should  be  reconstructed  to  obtain  the  supergraph  embed¬ 
ding  the  most  probable  primitive  features. 

As  we  mentioned  earlier,  the  number  of  required  virtual  links  are  not  known  in  advance. 
Let  us  consider  the  example  object  in  figure  8(a).  Its  depression  consists  of  two  pockets  with 
perpendicular  axes  opening  into  each  other.  The  base  of  the  first  pocket  is  face  1  (side-faces: 
2,4, 3, and  5),  and  the  other  pocket  has  face  2  as  its  base  (side-faces:  1,5,6,  and  4).  The  cavity 
graph  for  the  depression  is  shown  in  figure  8(b).  Because  there  is  neither  a  link  between  the 
nodes  3  and  1,  nor  a  link  between  nodes  2  and  6,  neither  the  subgraph  induced  by  nodes  1,2, 3, 4, 
and  5,  nor  the  subgraph  induced  by  nodes  1, 2,4,5,  and  6,  matches  the  template  for  a  pocket.  For 
the  example  part  of  figure  8,  two  virtual  links  ( (6  2)  and  (3  1) )  are  needed. 

Dempster-Shafer  Theory  [12]  is  particularly  useful  for  our  purposes,  because  of  its  ability 
to  narrow  the  set  of  promising  hypotheses  with  the  accumulation  of  evidence. 

Each  virtual  link  to  be  augmented  to  the  cavity  graph  can  be  represented  by  a  tuple  (/,/,). 
The  set  A  =  {( fwfn ).  •  •  •  ,  ifixfa)},  then,  is  the  hypothesis  which  suggests  the  set  of  i  links 
( if  1 1/12X  ‘ .  (fi  ifil)}  should  be  added  to  the  cavity  graph  of  a  depression. 

The  hypothesis  that  contains  the  set  of  all  possible  virtual  links  in  a  cavity  graph  is  the  set 
that  is  referred  to  as  the  frame  of  discernment,  denoted  ©.  Let  G  be  the  cavity  graph  of  a 
depression.  The  set  ©  is  obtained  by  determining  the  set  of  all  links  required  to  make  G  a  com¬ 
plete  graph.  (A  complete  graph  is  a  graph  that  is  simple*  and  each  pair  of  distinct  nodes  in  it  is 
joined  by  an  edge.)  Hence,  the  set  of  links  in  the  frame  of  discernment,  ©,  is  £C(G),  the  com¬ 
plement  of  E(G),  the  set  of  edges  of  G  with  respect  to  the  set  of  links  connecting  all  distinct 
pairs  of  nodes  of  G.  The  set  ©  for  the  example  part  of  figure  8  would  be: 

e={(f6h),  (ffj  1),  UV2),  (/3/1),  (A/2 X  1/4/5)/ 

An  underlying  assumption  of  the  Dempster-Shafer  Theory  is  that  the  hypotheses  under  con¬ 
sideration  are  mutually  exclusive  and  exhaustive.  These  assumptions  are  satisfied  by  the  set  of 
hypotheses  constituting  a  frame  of  discernment  for  a  cavity  graph. 

Different  pieces  of  information,  such  as  whether  the  planes  containing  two  faces  are  perpendic¬ 
ular  01  not,  form  evidences  helping  support  or  helping  exclude  a  one  element  subset  of  0.  The 
impact  of  each  evidence  is  represented  by  a  basic  probability  assignment  (bpa).  A  bpa  assigns  a 
number,  m,  in  the  range  [0  1]  to  each  subset  of  ©.  Shafer  describes  a  basic  probability  assign¬ 
ment  as: 

The  quantity  m{A)  is  called  A' s  basic  probability  number,  and  it  is  understood  to  be 

the  measure  of  the  belief  that  is  committed  exactly  to  A  [  12]. 


*  :  A  graph  is  simple  if  it  has  no  loops  and  no  two  of  its  links  join  the  same  pair  of  nodes. 
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Basic  Block  Diagram  For  Finding 
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This  proportion  of  belief  does  not  imply  belief  in  the  subsets  of  A.  The  quantity  Bel  (A),  for 
some  bpa,  m,  is  used  to  represent  the  total  belief  in  A,  which  indicates  certainty  in  A  and  its  sub¬ 
sets.  Shafer  further  explains: 

...  To  obtain  the  measure  of  the  total  belief  committed  to  A,  one  must  add  to  m  {A )  the 

quantities  m  (B)  for  all  proper  subsets  B  of  A  : 

Bel  (A )  =  £ m(B ) 

BczA 

Evidences  and  bpas  associated  with  each  evidence  should  be  selected  such  that  they  confirm  or 
disconfirm  what  is  implied  physically  by  adding  the  virtual  link  to  the  Cavity  Graph.  Some 
relevant  information  might  be:  Are  the  normals  to  faces  /,  and  fj  almost  perpendicular?  If  the 
addition  of  (fjj )  completes  the  representation  for  the  maximal  primitive,  P,  are  there  indica¬ 
tions  that  P  is  part  of  the  depression?  ...  Each  of  these  pieces  of  information  forms  the  basis  for 
an  evidence  helping  support  or  helping  exclude  a  one  element  subset  of  ©.  The  bpa,  mt, 
assigned  by  evidence  i  must  be  chosen  such  that: 

2 jniiAj )  =  1  ;  Aj  c  0 

j 

and  m(0)  =  0. 

m(0)  is  the  quantity  of  belief  that  remains  unassigned  after  various  amounts  of  belief  are 
assigned  to  all  proper  subsets  of  0. 

After  relevant  evidences  are  gathered  and  the  corresponding  bpa  assignments  are  made, 
the  supports  for  hypotheses  are  combined.  Given  two  observations  corresponding  to  hypothesis 
sets  X  and  T,  with  bpa  assignments  and  m2(Y),  we  use  Dempster’s  combination  rale  to 

compute  a  new  function  that  represents  the  impact  of  the  combined  evidence.  This  function 
assigns  m  \  ( X)m2(Y )  to  the  intersection  of  X  and  Y  .  Since  there  are  typically  several  subsets  of 
0  whose  intersection  is  the  same  as  that  of  X  and  Y,  the  bpa  assignment  of  the  combined  func¬ 
tion,  denoted  m1@m2,  for  every  hypothesis  set  A,  is  computed  from: 

m  i  ®m  2  (A )  =  — £  mi(Xi)m2(YJ)  ;  A*0  (2) 

IK 

‘•J 

m  i  ©m  2  (A )  =  0  ;  A  =0 

where: 

tc  =  X  ml{Xl)m2(YJ) 

U 

x,rv,=a 

The  result  of  the  combination  is  independent  of  the  order  in  which  the  evidences  are  gath¬ 
ered  and  combined,  an  essential  property  fulfilled  because  of  the  commutativity  of  multiplica¬ 
tion.  Also,  the  sum  of  all  bpas  assigned  by  m\(&m2  adds  up  to  1,  thus  satisfying  the  definition 
of  a  bpa. 

To  combine  all  supporting  evidences,  equation  (2)  is  applied  repeatedly.  Assuming  there 
are  j  pieces  of  evidence,  mx,  m2,  ...,  m;,  we  use: 

m  =  (...((m \®m2)®m2)@  ...)@my) 

to  come  up  with  m,  the  combined  bpa  for  each  singleton.  If  A  is  a  singleton,  Bel  (A ),  the  total 
belief  committed  to  A,  is  equal  to  m(A)  since  the  contribution  from  the  bpa  of  its  only  proper 
subset,  0,  is  zero.  Therefore,  to  determine  the  virtual  links  to  be  augmented,  the  combined  bpa 
m  can  be  used  to  rank  the  i  singleton  subsets  of  ©,  and  cluster  the  singletons  into  disjoint 
groups. 

Each  singleton  in  a  cluster  has  a  combined  bpa  value  which  is  similar  to  the  other  single- 
tons  in  its  cluster,  and  the  bpas  of  the  singletons  in  one  cluster  are  notably  different  from  the 
bpas  of  singletons  in  any  other  cluster.  The  hypotheses  associated  with  the  cluster  containing 
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the  highest  ranked  singleton  are  the  hypotheses  that  correspond  to  the  most  probable  virtual 
links.  These  links  are  augmented  to  the  cavity  graph  of  the  depression.  The  resulting  super¬ 
graph  is  then  analyzed  using  the  hypothesis  generate-verify  method  described  in  previous  sec¬ 
tions  to  find  its  constituting  primitive  features.  The  next  example  clarifies  the  method: 

Example:  Consider  the  part  of  figure  8(a).  The  hypotheses  generated  using  the 
cavity  graph  shown  in  8(b)  do  not  model  die  depression;  we  already  determined 
the  frame  of  discernment  to  be: 

0  =  ((/6/3),  if  (if  i)»  (f6f2)>  (/3/1),  1/3/2).  I/4/5)/. 

Evidences:  For  clarity,  let  us  consider  three  simple  bpa  assignments  for  links  in 
0:  If  two  faces,  f  and  fj,  have  orthogonal  principle  normal  components,  assign: 
m({(fjj)})  =  0.1  ,  and  m(0)  =  0.3;  if  two  faces,  f  and  fp are  convexly  adjacent, 
assign:  m  {{(fjj)}c)  =  0.7  ,  and  m  (0)  =  0.3;  and  if  two  faces,  /,  and  fj,  are  almost 
parallel  with  normals  in  the  opposite  direction,  m{{(fjj)}c)~  0.8,  and 
m  (0)  =  0.2.  There  are  seven  pieces  of  information  applicable  to  the  links  in  © 
with  the  following  bpa  assignments: 

1.  rn\Wt>h)})  =0.7  ;  mx(@)  =  0.3 

2.  m2(/(/6/3)/c)  =  0.7  ;  m2(0)  =  O.3 

3-  m3({(f6fO}c)  =  0^  ;  m3(0)  =  0.2 

4.  mA{{(ftf2)})  =0.7  ;  m4(©)  =  0.3 

5.  m3{{if3fx)})  =0.7  ;  m5(©)  =  0.3 

6.  m6(f(/3/2)/c)  =  0.8  ;  m6(0)  =  O.2 

7.  mnidfj  5)/c)  =  0.8  ;  m7(©)  =  0.2 

After  combining  them,  m{A )  =  •  *  •  ®m2(A)  and  we  get  the  following 

combined  bpas  for  the  singleton  hypotheses: 

'"(((/3/i )})  =  0.366  mWtfi)})  =  0.366 

=  0.1 10  m(f(/6/,)/)=  0.0 

'"(/l/3/2)J)  =  0.0  '"((1/4/5)/)  =  0.0 

Therefore  we  obtain: 

clusterl  =((/i/3),  I/2/6)/ 

cluster2  =  ((/3/6)/ 

cluster3  =  ((/3/  2),  1/4/5),  l/a/t )/ 

The  links,  from  the  highest  ranked  cluster  are  restored.  The  resulting  supergraph 
is  shown  in  Figure  8(c).  Based  on  the  graph  in  8(c)  the  following  primitive  con¬ 
stituents  of  the  depression  can  be  identified: 

(( pocket  Base:!  sides :(2  4  3  5))  certainty  :0. 8) 

(i pocket  Base:2  sidesfl  5  6  4))  certainty  :0. 8) 

It  is  important  to  note  that  in  assigning  the  above  bpas  to  the  corresponding  subsets  of  0. 
the  actual  numbers,  e.g.  0.7,  do  not  need  to  be  exact  for  our  purposes  because  we  are  not  draw¬ 
ing  conclusions  from  the  value  of  the  combined  bpas.  Conclusions  as  to  what  links  to  be  aug¬ 
mented  to  the  Cavity  Graph  are  based  on  the  final  ranking  of  the  links  of  0  obtained  from  com¬ 
bining  evidences. 

7.  VERIFICATION.  Whether  the  cavity  graph  or  its  supergraph  is  the  basis  for  the  gen- 
erated  hypotheses,  the  subset  of  hypotheses  selected  as  the  correct  primitives  modeling  the 
depression  needs  to  be  verified.  Selection  of  a  hypothesis  by  an  expen  is  a  temporary  accep¬ 
tance  for  that  specific  hypothesis.  In  recognizing  interacting  features,  it  is  important  to  analyze 
the  cavity  as  a  whole.  To  achieve  integrity,  a  selected  set  of  hypotheses,  V,  must  model  the 
whole  depression.  This  requires  all  faces  and  edges  of  the  underlying  cavity  graph  to  be  associ¬ 
ated  with  at  least  one  of  the  primitive  features  in  V.  If  there  are  edges  or  nodes  of  Cavity  Graph 
not  associated  with  any  selected  hypothesis  (dangling  edges  or  nodes),  then  all  hypotheses  in  V 
are  disbelieved.  This  strategy  avoids  misclassification  of  a  primitive  by  assigning  some  of  its 
topologic  entities  to  a  neighboring  primitive  feature. 
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3 


_ Reconstructed  Link 

Recognized  Primitive  Features: 

(verified  (pocket  Base  :  1  Sides  :  (2  4  3  5))  Certainty  :  0.8) 
(verified  (pocket  Base  :  2  Sides  :  (1  5  6  4))  Certainty  :  0.8) 

(d) 

Figure  8:  Adding  virtual  links  to  the  cavity  graph. 

(a) :  A  part  with  two  interacting  pockets. 

(b) :  Cavity  graph  of  the  depression 

The  subgraphs  of  this  cavity  graph  induced  by  the  facer  for  each 
pocket  do  not  represent  a  pocket  template  because  the  links  (3  1) 
and  (2  6)  do  not  exist  in  this  cavity  graph.  Specifically  the  maximal 
decomposition  of  this  cavity  graph  contains  representations  for  two 
holes  and  one  blind  slot,  any  combination  of  which  is  incorrect. 

(c) :  The  Cavity  graph  with  virtual  links  added. 

This  augmented  cavity  graph  is  readily  decomposed  into  two  maximal 
constituents  each  representing  one  desired  pocket. 

(d) :  The  most  probable  features  extracted  from  (c). 
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8.  RESULTS  AND  DISCUSSION,  Cavity  graphs,  one  of  the  representation  tools  we  have 
used  in  this  research,  are  an  extension  of  the  attributed  adjacency  graphs  proposed  by  Joshi  [7], 
Our  cavity  graphs  not  only  explicitly  represent  the  convexity-concavity  information  between 
two  intersecting  faces,  but  they  also  explicitly  represent  the  spatial  orientation  of  faces  relative 
to  one  another  in  3D  space.  These  relative  spatial  orientations  are  available  in  the  labels  associ¬ 
ated  with  the  cavity  graph  nodes  to  directly  participate  in  the  matching  process.  Node  labels  are 
important  for  the  correct  classification  of  primitive  features.  Their  utility  can  be  seen  with  the 
use  of  a  synthetic  simple  example.  Consider  the  two  simple  pans  shown  in  figure  9.  Their  cav¬ 
ity  graphs  without  the  node  labels  and  with  the  node  labels  are  shown  in  figures  9(bl-b2)  and 
9(cl-c2)  respectively.  There  is  no  difference  in  the  graphs  for  the  features  without  the  node 
labels,  but  object  (a)  contains  two  steps  and  object  (b)  contains  a  thru-slot.  It  is  clear  that  using 
the  graphs  without  the  face-labelings  as  the  basis  for  recognition  would  result  in  classifying 
both  features  in  the  same  class.  However,  a  quick  and  rough  labeling  of  the  nodes  reveals  that 
the  two  faces  concavely  adjacent  to  the  base  face  are  almost  perpendicular  in  object  (a)  and 
almost  parallel  in  object  (b). 

The  advantage  of  this  extended  representation  is  not  contained  to  representing  certain 
depressions  uniquely.  It  is  directly  accountable  for  reducing  complexity  greatly  in  generation  of 
hypotheses.  This  improvement  in  complexity  is  achieved  in  two  ways.  First,  only  the  template 
matches  which  are  supported  by  an  acceptable  combination  of  node  labels  produce  a 
hypothesis,  and  secondly,  the  node  unification  of  separate  faces  into  one  node  reduces  the 
number  of  nodes  and  therefore  complexity.  The  next  example  part  illustrates  how  these  two 
considerations  work  to  reduce  complexity.  In  figure  10,  five  slots  interact.  The  coplanar  faces 
10,  11,  12,  and  13  are  produced  because  of  the  crossing  of  the  three  parallel  slots  on  the  left  into 
the  thru-slot  in  the  middle.  The  thru-slot  in  the  middle  consists  of  seven  machining  faces:  Face 
16  is  the  base;  faces  10,  11,  12,  and  13  form  one  side  of  this  slot;  and  faces  14  and  15  form  the 
other  side.  In  addition  to  illustrating  how  all  faces  of  a  primitive,  even  when  they  are  arbitrarily 
split  in  an  interaction,  are  extracted  and  incorporated  in  its  description,  this  example  serves  to 
show  how  the  node  labeling  mechanism  prevents  the  generation  of  many  incorrect  hypotheses. 
The  only  cavity  graph  for  the  object  is  shown  in  figure  10(b).  A  blind  decomposition  of  the 
cavity  graph  into  primitive  templates  without  regard  to  node  labels  produces  as  many  as 

^  =  45  slot  hypotheses,  since  there  are  45  possible  pairings  of  faces  within  the  cavity  graph. 

However,  using  the  face  labeling  mechanism,  only  the  faces  with  opposite  principle  normal 
components  are  used  to  form  hypotheses.  Hence,  only  17  hypotheses  are  generated.  This  is  a 
much  smaller  subset  which  may  carefully  be  examined  by  the  experts.  The  five  slots  correctly 
constituting  the  interaction  are  found  and  verified  from  among  these  17  hypotheses.  The  result¬ 
ing  primitives  are  shown  in  10(c). 

The  next  example  shows  the  importance  of  unification  for  finding  machining  faces  which 
have  no  concave  edges.  Although  these  faces  are  not  part  of  the  subgraph  of  the  part  induced 
by  just  the  concave  links,  since  all  of  their  edges  are  formed  by  convex  face-intersections,  they 
must  be  included  in  the  description  of  the  primitives  they  are  a  part  of,  so  that  in  planning  or 
recognition  tasks  correct  results  are  obtained.  Figure  ll(al)  shows  an  example  part  with  four 
thru-slots.  Two  of  the  slots  cross  each  other  while  the  remaining  two  open  into  one  of  the 
former  slots  and  are  parallel  with  each  other,  (al)  and  (a2)  show  magnified  views  of  these  two 
latter  slots.  The  intersection  of  the  slots  causes  three  of  the  side  faces  to  break  into  discon¬ 
nected  components  forming  coplanar  face  pairs  2,3  and  4,5  and  9,13.  Note  that  faces  5  and  13 
are  adjacent  to  all  their  neighboring  faces  via  convex  edges.  This  example  shows  is  that 
although  in  a  cavity  graph  a  link  corresponds  to  a  pair  of  faces  being  concavely  adjacent, 
machining  faces  with  no  concave  edges  are  properly  identified.  Faces  5  and  13  are  two  such 
faces.  Face  5,  for  instance,  has  no  concave  edges,  but  it  is  one  of  the  side  faces  of  the  slot  hav¬ 
ing  base  face  14  and  side  faces:  5,4,13,9.  There  are  three  cavity  graphs  associated  with  this  pan 
which  are  shown  in  figure  11(b).  Face  5,  as  a  result  of  unification,  shares  the  same  node  with 
face  4  in  the  left  cavity  graph.  Therefore,  it  is  correctly  included  as  a  machined  surface  of  the 
slot  in  the  surfaces  of  extracted  primitives.  The  parts  giving  rise  to  these  types  of  interactions 
can  not  be  properly  handled  by  previous  systems  such  as  [7]. 
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(al)  (a2) 


2  3  1  2  3  1 


(bl)  (b2> 

Without 
Face  Labeling 


2  3  1  2  3  1 


Face  Labeling 


Figure  9 

The  Need  for  Face  Labeling: 

Shown  in  (al)  and  (bl)  are  two  simple  objects;  There  is  no 
difference  in  their  Cavity  graphs  without  face  labels,  (a2) 
and  (b2);  Cavity  graphs  including  the  face  labels  (cl)  and 
(c2)  capture  the  difference. 
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(verified  (thru_slot  Base  :  16  Sides  :  ((14  15)  (10  1 1  12  13)))  Certainty  :  1 .0) 
(verified  (thru_slot  Base  :  16  Sides  :  (1  2)  Certainty  :  1.0) 

(verified  (thru_slot  Base  :  16  Sides  :  (3  4)  Certainty  :  1.0) 

(verified  (thru_slot  Base  :  16  Sides  :  (5  6)  Certainty  :  1.0) 

(verified  (thru_slot  Base  :  16  Sides  :  (7  8)  Certainty  :  1.0) 

(c) 


Figure  10:  Limiting  the  Number  of  Generated  Hypotheses 

(a)  shows  an  example  object  with  5  interacting  slots.  The  group  of  faces 
[  10,1 1,12,13]  are  unifiable.  The  pair  of  coplanar  faces  14  and  15  are 
produced  by  splitting  a  larger  conceptual  face,  and  are  also  unifiable. 

(b)  shows  the  cavity  graph  generated  by  the  system  for  the  object.  Blind 
graph  matching  would  produce  as  many  as  45  possible  slot  subgraphs  from 
this  cavity  graph,  but  because  only  appropriate  combination  of  node  labels 
are  considered,  our  system  generated  only  17  hypotheses. 

(c)  shows  the  five  hypotheses  verified  to  be  correct  by  the  system. 
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(verified  (thru_slot  Base  :  14  Sides  :  ((5  4)  (9  13)))  Certainty  :  1.0) 
(verified  (thru_slot  Base  :  14  Sides  :  (1  (2  3)))  Certainty  :  1.0) 
(verified  (thru_slot  Base  :  7  Sides  :  (8  6))  Certainty  :  1.0) 

(verified  (thru_slot  Base  :  11  Sides  :  (10  12))  Certainty  :  1.0) 

(C) 


Figure  11:  Importance  of  Unification  in  Finding  Machining  Faces  with  no  Concave  Edges 

The  part  shown  in  (al)  has  four  thru-slots.  Note  that  the  coplanar  face 
pairs  [2,3],  [5,4],  and  [9,13]  are  unifiable  since  they  have  been  split 
as  a  result  of  interaction  of  slots.  (a2)  and  (a3)  show  a  magnified  view  of 
the  two  slots  which  are  parallel  with  each  other. 

Also  note  that  faces  5  and  13  are  adjacent  to  all  their  neighboring  faces 
via  convex  edges.  Although  methods  such  as  those  in  [7]  do  not  classify 
these  faces  as  part  of  shape  primitives,  the  methods  proposed  in  this  paper 
use  unification  to  capture  these  faces  in  the  proper  shape  primitive. 

(b)  shows  the  cavity  graphs  for  the  part  in  (al),  and  (c)  shows  the 
features  extracted  by  the  prototyped  system. 
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In  this  section  we  discussed  some  sample  results  which  will  shine  light  on  u.w  hxoV  antages 
obtained  using  the  methods  proposed  in  this  work  in  comparison  with  the  previous  existing  sys¬ 
tems.  The  primitives  in  these  test  parts  are  not  successfully  extracted  by  previous  systems. 

9.1  Extension  to  Non-Perpendicular  Face  Intersections.  In  this  section  give  a  straight- 
forward  extension  of  the  methods  to  accommodate  identification  and  extraction  of  the  primi¬ 
tives  in  parts  in  which  the  intersecting  faces  are  not  necessarily  at  right  angles  to  each  other.  As 
we  mentioned  earlier,  the  primitive  representations  of  figure  2  each  represent  a  family  of  primi¬ 
tives  which  are  obtained  by  varying  the  face-face  intersection  angles  between  all  the  allowed 
values  in  the  respective  convex  or  concave  range  that  are  physically  realizable.  In  order  to  find 
the  cavity  graphs  for  a  pan  with  inclined  faces,  the  nodes  must  be  labeled  appropriately.  As 
before,  the  node  corresponding  to  the  face  with  the  greatest  number_$>f  concave  edges  is  selected 
to  be  the  base  and  is  labeled  B.  A  set  of  local  coordinates,  (i/,  y/,  A^),  for  the  depression  is  con¬ 
structed  from  the  unit  normal  vector  to  the  base,  Nb,  together  with  two  other  unit  vectors,  .x?  and 
yi ,  such  th^t  they  are  mutually  orthogonal  to  each  other.  For  any  other  non-base  node,  the  unit 
normal,  Nf ,  of  the  corresponding  face(s),  /,  of  depression,  can  be  decomposed  in  to: 

Nf=n1-Nb  +  nXl-xJ+nyi-yJ 
— ^ 

The  dominant  component  of  Nf  determines  the  label  for  the  face,  /. 

After  the  construction  of  the  cavity  graphs,  geometric  reasoning  by  hypothesis  generation- 
elimination  and  addition  of  virtual  links  can  be  carried  on  as  highlighted  in  previous  sections. 
Our  next  example,  shown  in  figure  12,  demonstrates  the  extraction  of  shape  primitives  from  a 
part  with  non-perpendicular  intersecting  faces.  It  is  a  little  more  complicated  than  the  previous 
examples,  but  it  illustrates  the  potential  of  the  proposed  methods  for  handling  fairly  complex 
interactions  that  arise  in  machined  parts.  The  part  in  figure  12(al)  has  twelve  features:  five 
steps,  one  blind  slot,  two  thru-slots,  three  pockets,  and  one  prismatic-hole.  To  observe  some  of 
the  difficulties  one  may  note  the  following:  The  coplanar  faces  1 ,  30,  and  3 1  are  sharec  two 
thru-slots  and  one  blind  slot.  The  blind-slot  and  one  of  the  thru-slots  (the  one  formed  b\  side 
faces  4  and  5)  have  the  same  axis  so  that  the  end-face  3  of  the  blind-slot  is  disconnected  from  its 
side-face  5.  The  other  thru-slot  (with  side-faces  6  and  7)  crosses  the  blind-slot  splitting  its  side 
into  coplanar  faces  33  and  2.  Both  side-faces  of  the  blind-slot  intersect  its  base  non- 
orthogonally.  The  cavity  opened  in  the  middle  of  the  blind-slot  is  shown  separately  for  clarity  in 
figure  12(a2).  There  are  two  pockets,  one  with  base-face  12,  and  the  other  with  the  base-face  14. 
which  have  the  same  axis  and  share  the  side-faces  8,  [10,32],  and  1 1.  These  two  pockets  open 
into  each  other.  A  third  pocket,  which  has  an  axis  perpendicular  to  the  first  two  pockets,  opens 
into  the  bottom  of  the  second  one  (this  pocket  has  face  8  for  its  base).  Note  that  the  base-face  of 
none  of  the  pockets  intersects  all  of  its  sides.  In  the  pocket  with  base  12,  there  is  no  intersec¬ 
tion  between  base  and  side-face  [10,32],  in  the  pocket  with  base  14,  there  is  no  intersection 
between  the  base  and  side-face  11,  and  in  the  pocket  with  base  8,  there  is  no  intersection 
between  the  base  and  side-face  15.  Therefore,  the  methods  based  on  topology  alone  |5.7],  or 
generic  rules  [6]  cannot  capture  these  classes  of  interactions.  Finally,  it  is  worth  noting  that  the 
prismatic-hole  in  the  right  with  faces  18,  17,  8,  and  11  opens  at  an  inclined  angle  into  the  pock¬ 
ets.  Generally  this  type  of  interaction  is  difficult  to  handle  for  subtractive  methods. 

The  depressions  in  this  part  are  represented  by  seven  cavity  graphs.  Figures  (bl)-(b7) 
show  the  seven  cavity  graphs  finally  constructed  by  our  system.  Note  that  each  cavity  graph  has 
its  own  local  coordinates  and,  for  example,  the  local  coordinates  for  cavity  graph  (b6)  are  dif¬ 
ferent  than  those  of  cavity  graph  (b7)  as  shown  in  the  figure.  We  also  observe  that  the  system 
automatically  constructed  the  four  links  (3  5),  (15  8),  (12  [10,32]),  and  (14  11),  which  are 
shown  with  dotted  line  segments  in  figure  12(b).  The  first  added  link  completes  the  representa¬ 
tion  for  the  blind-slot  primitive,  (base:  1,30,31  sides:2,33,3,5),  and  the  other  three  links  are 
used  to  extract  the  three  pocxets,  one  with  base-face  12,  one  with  base-face  14,  and  the  other 
with  face  8  as  its  base.  Without  these  links,  these  primitives  would  not  be  present  in  the 
hypothesis  space  of  the  problem  and  could  not  be  considered.  The  result  of  primitives  extracted 
and  classified  by  the  system  is  shown  in  figure  12(c). 
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Figure  12:  Recognizing  Primitives  with  Inclined  Faces 

(a)  shows  a  part  with  inclined  faces. 

(bl)-(b7)  show  the  seven  cavity  graphs 
with  added  virtual  links  for  its  depressions. 

(c)  shows  the  primitive  features  recognized  by  the  system. 


(b7)  Link; 


Extracted  Features: 

(verified  (step  Base  :  21  Sides  :  (20))  Certainty  :  1.0) 

(verified  (step  Base  :  23  Sides  :  (22))  Certainty  :  1.0) 

(verified  (step  Base  :  25  Sides  :  (24))  Certainty  :  1.0) 

(verified  (blind_slot  Base  :  (1  30  31)  Sides  :  ([2,33]  3  5))  Certainty  :  0.8) 
(verified  (thru_slot  Base  :  (1  30  31)  Sides  :  (4  5))  Certainty  :  1.0) 
(verified  (thru_slot  Base  :  (1  30  31)  Sides  :  (6  7))  Certainty  :  1.0) 
(verified  (pocket  Base  :  12  Sides  :  (8  9  [10,32]  1 1))  Certainty  :  0.8) 

(verified  (pocket  Base  :  8  Sides  :  (14  [10,32]  13  15))  Certainty  :  0.8) 

(verified  (prismatic_hole  Base  :  18  Sides  :  (8  7  1 1))  Certainty  :  1.0) 
(verified  (step  Base  :  27  Sides  :  (26))  Certainty  :  1.0) 

(verified  (step  Base  :  28  Sides  :  (29))  Certainty  :  1.0) 

(verified  (pocket  Base  :  14  Sides  :  (13  1 1  [10,32]  8))  Certainty  :  0.8) 

(c) 
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10.  CONCLUDING  REMARKS.  We  have  described  methods  and  algorithms  to  extract  a  wide 
range  of  Interacting  features  and  find  their  constituent  primitive  machining  features.  We  have 
discussed  both  fundamental  concepts  and  practical  implementations.  The  recognition  method  is 
based  on  Cavity  Graphs,  which  provide  a  topologic  and  geometric  description  of  the  depres¬ 
sions  in  the  object  boundary.  Concepts  have  also  been  developed  to  combine  topologic  and 
geometric  evidences  to  extract  the  most  probable  features  of  a  depression.  This  method  enables 
us  to  identify  the  comprising  primitive  features  when  the  topologic  relationships  between  the 
faces  of  a  primitive  in  interaction  do  not  include  all  the  relationships  found  in  an  isolated  primi¬ 
tive.  The  proposed  representations  carry  more  geometric  information  than  other  methods,  thus 
allowing  a  more  accurate  machine  understanding  of  the  shape  of  the  part. 

Although  the  proposed  method  successfully  analyzes  the  feature  content  of  many  parts, 
much  work  still  needs  to  be  done  to  automate  the  interpretation  of  engineering  designs.  A  fun¬ 
damental  project  would  be  to  keep  a  record  of  the  design  session,  and  use  this  incremental  infor¬ 
mation  together  with  the  CAD  representation  of  the  designed  part  rather  than  using  only  the 
CAD  representation  for  automatic  extraction  of  the  machining  features  in  the  part. 
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ABSTRACT. 

A  new  class  of  finite  difference  scheme,  using  the  multigrid  iterative  method,  is 
presented  for  solving  the  time-dependent,  incompressible  Navier-Stokes  equations.  The 
schemes  are  quite  flexible,  maintaining  second-order  accuracy  with  overlapping  grids  in 
domain  decomposition  and  with  irregular  grid  systems.  A  variant  of  this  method  may  be 
used  to  solve  the  steady-state  equations. 

The  multigrid  iteration  procedure,  although  not  essential  to  the  method,  provides 
a  dramatic  speed-up  over  the  use  of  iterative  methods  such  as  S.O.R.  As  is  typical  of 
multigrid  methods,  the  work  per  grid  point  is  essentially  independent  of  the  finest  grid 
spacing. 

Results  axe  presented  of  the  application  of  one  of  the  schemes  to  the  computation  of 
the  two-dimensional  flow  past  a  rectangle  in  a  channel.  The  computation  makes  use  of 
overlapping  grid  domains.  Numerical  tests  are  presented  demonstrating  the  second-order 
accuracy  of  the  method. 


1.  INTRODUCTION. 

In  this  paper,  a  class  of  finite  differences  schemes  for  the  time-dependent  incompress¬ 
ible  Stokes  and  Navier-Stokes  equations  are  presented.  The  significant  features  of  these 
schemes  are  that  they  axe  second-order  accurate,  they  use  standard  grids,  that  is,  not 
staggered  grids,  and  the  schemes  retain  their  second-order  accuracy  when  used  with  non- 
uniform  and  non-orthogonal  grids.  In  addition,  the  schemes  axe  unconditionally  stable. 
Because  the  schemes  are  inherently  implicit,  iterative  methods  are  used  to  solve  for  the 
solution  at  the  next  time  level.  The  method  presented  here  is  a  multigrid  method,  but 
other  methods  may  also  be  used. 

For  ease  of  exposition  the  Stokes  equations  are  used  to  introduce  the  basic  schemes. 
Later,  in  section  4  it  is  shown  how  to  treat  the  additional  terms  of  the  Navier-Stokes 
equations. 

*  Supported  by  the  U.S.  Army  Research  Office 
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2.  THE  FINITE  DIFFERENCE  SCHEMES. 

The  time-dependent  Stokes  equations  are: 


iti  —  V  (7  +  Vp  =  / 
V  •  u  =  g 


(2.1 a) 

(2.1  b) 


The  vector  function  u  is  the  velocity,  and  the  scalar  function  p  is  the  pressure.  The 
functions  /  and  g  are  considered  to  be  given  data.  Notice  that  the  pressure  appears  only 
in  (2.1a)  and  only  in  terms  of  its  spatial  derivatives.  In  most  problems  the  function  g 
is  identically  zero,  but  we  include  the  general  case  because  it  fits  in  naturally  with  the 
multigrid  iteration  method. 

The  equations  (2.1 )  hold  in  some  domain  12,  and  to  specify  a  unique  solution,  boundary 
conditions  must  be  given.  The  simplest  conditions  are  to  specify  the  velocity  u  on  the 
boundary,  i.e., 

u  —  b  on  dQ.  (2.2) 

This  is  called  the  Dirichlet  boundary  condition. 

The  data  g  in  (2.1b)  and  b  in  (2.2)  must  satisfy  a  constraint  in  order  for  a  solution  to 
exist.  Using  integration  by  parts  we  have 


I  9  —  I  U  i7=  /  u  n  =  /  b  ■  n 

Jn  Jn  Jr>n  JdQ 


(2.3) 


where  n  is  the  outer  unit  normal  to  chi.  For  boundary  conditions  other  than  (2.2)  there 
may  or  may  not  be  a  constraint. 

Initial  data  is  specified  for  the  velocity  and  the  pressure 


>7(0.  r.  i/)  =  i70(j,  y) 
piO.x.g)  =  po(x.y) 


(2.4) 


The  initial  data  should  be  consistent  with  the  equations  (2.1),  i.e.. 


V  ■  (7n  =  g(O.x.g). 


and 

<]t  -  V2<y  +  Vp0  =  V  •  / 

For  an  introduction  to  the  theory  of  the  Navier-Stokes  equations  we  refer  to  [6]. 

The  first  finite  difference  scheme  is  based  on  the  C'rank-Nicolson  scheme  for  the  heat 
equation.  We  let  tn  =  nAf.  and  for  a  cartesian  grid,  let  X(  =  C A.r  and  ym  =  m Ay.  We 
first  discretize  only  in  time 


«,,  +  1  -  u 


At 


~  ~  J(V2(7"  +  1  +  V2 ii"  )  +  i(VP"+I  +  Vp")  =  /',+  *  +  0(  At2)  (2.5) 


V 


i7"  + 1  =  f/"+! 


(2.5) 
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The  Laplacian  is  discretized  using  the  standard  five-point  Laplacian 


— ^(+1  ,m  "h  ti/  —  i,m  2 U(i7n 

Ax2  Ay2 


+  0(Ax2  +  Ay2) 


The  discretization  of  the  gradient  of  the  pressure  in  (2.1a)  and  the  divergence  of  the  ve¬ 
locity  in  (2.1b)  must  be  done  so  as  to  insure  smoothness  or  regularity  of  the  solution.  Since 
the  theory  for  regularity  of  solutions  to  finite  difference  schemes  for  the  time-dependent 
Stokes  equations  is  not  developed,  we  use  the  theory  for  the  steady  Stokes  equations  as  a 
guide.  Considering  (2.1)  on  all  of  i?n,  we  may  transform  equation  (2.1)  with  the  Fourier 
transform  in  space  and  the  Laplace  transform  in  time,  obtaining  for  the  case  of  two  x 
dimensions 


3  +  |uq 
0 


s  +  M 


=  I/2 


where  \u>\2  =  u>2  -fu>2.  The  dual  variables  are  s.  for  the  Laplace  transform,  and  £3  =  (uq  .^2 ) 
for  the  Fourier  transform. 

The  regularity,  or  smoothness  properties,  of  ( 2. 1 )  depend  on  the  behavior  of  the  inverse 
of  the  matrix,  which  we  call  S,  on  the  left  of  (2.7).  The  matrix  S  is  the  symbol  of  the 
Stokes  equation  (2.1).  In  particular,  the  determinant  of  S  is 

det  5  =  — (s  -j-  |*.’|2 )|^|2 


and  is  bounded  well  away  from  zero  as  |u/|  and  Re  s  become  large. 

We  now  consider  the  symbol  of  finite  difference  schemes  for  the  Stokes  equations, 
see  [5].  We  begin  by  considering  central  difference  operations  for  both  the  gradient  and 
divergence  operators  in  (2.5).  The  symbol  is 

/  „  1  sin  £1  Ax  \ 


/sinfiAx  /sin  £2  Ay 


Ax 

/  sin  if 2  Ay 

^ y 


-1  ( sin'  1  Ax/2  siirf2Ay/2 

—  +  4«  r— : +  :  ^ 


The  determinant  of  Si  is 


det  Si  =  — <r 


"( 


sin2  £1  Ax  sin2  ^2  Ay 
A.r2  Ay2 


The  range  of  £1  and  £2  are  given  by  |f ,  |A.r  <  7t  and  |f2 1  Ay  <  tr.  Note  that  det  Si 
vanishes  for  |f  1  |A.r  and  |^2 1 A  //  equal  to  rr.  The  vanishing  of  the  determinant  of  the  symbol 


for  certain  frequencies  implies  that  these  frequencies  can  not  be  determined  by  the  data. 
This  statement  applies  as  well  for  domains  other  than  Rn . 

In  place  of  central  differences  we  consider  the  regularized  differences  inroduced  in  [2]. 
These  are: 


dp  ,  Pt+\,m  P(—l.m  Pt+2.m  3pt+l }m  4"  3  Pt,m  Pt—  l,m 

Tx  -  {"p  =  - 2aI - 6A; - 

dp  ^  ^  _  P(  m  + 1  pt.m  —  1  P(,m+2  ~  3p(,m-\-l  d"  3 Pt,m  Pt,m  —  1 

dy  y'r P  2Ay  6Ay 

du  ,m  U{—l,m  Mf+1  ,m  3 Ilf  m  d"  3t l{-.\  rn  ^(—2,n 

——  ~  dr  rU  — - 

dx  '  2A.r  6Ax 

dv  1  V(.m  —  1  Vf.m  +  1  3Uf,m  d"  3u;)7n  — 1  V(,m  —  2 

T  ~  S”V  =  - 2Tv - 6Ay - 


(2.8a) 


(2.8c) 


(2.8  d) 


Notice  that  the  additional  regularizing  terms  are  third-order  divided  differences  which  are 
shifted  forward  for  the  pressure  and  shifted  backward  for  the  velocity  derivatives.  At 
grid  points  near  the  boundaries,  where  the  approximations  (2.8)  can  not  be  applied,  the 
third-order  difference  is  shifted  the  other  way. 

For  the  scheme  using  the  regularized  differences  (2.8)  the  symbol  is 


n  0  1(^)0 

0  (7  1^2(1 

i Ci  >C-2  o 


sin  £i  A.r  4  (fl  sin3  £i  Ax/2 

U  =  - : - +  t10  - : - 

A.?  3  Ax 

sin  &  Ay  4  ,£,Av/2  sin*  6  Ay/2 

Ay  3r  Ay 


The  determinant  of  S2  is 


(let  S>  =  -na{  j C i  |2  d-  |C2  |2  ' 


Since  Ci  and  C2  do  not  vanish  for  0  <  |^i  |A.r  <  tt  and  0  <  |£2|Ay  <  rr.  we  see  that  the 
determinant  of  S2  does  not  vanish  except  for  £1  =  £2  =  0.  the  same  as  for  the  symbol  of 
the  Stokes  equations  (2.1).  Thus,  it  seems  likely,  based  on  the  theory  for  finite  difference 
approximations  of  the  steady  Stokes  equation,  that  the  solutions  of  the  scheme  using  the 
regularized  differences  (2,8)  will  be  smooth. 

The  Crank- Nicholson  scheme  for  the  heat  equation  is  not  dissipative  unless  the  ra¬ 
tio  Af/A.r2  and  Af/Ay"  are  bounded,  e.g..  [0].  The  scheme  based  on  (2.5)  is  also  not 
dissipative  and  this  also  effects  the  smoothness  of  the  solutions  of  the  finite  difference 
scheme.  Note  that  dissipativitv  is  related  to  the  behavior  of  the  symbol  as  Rts  — +  0  with 


| I’m  s|Af  <  7r.  Thus  the  determinant  of  Si  vanishes  for  sAt  =  it,  and  this  is  related  to  the 
non-dissipativity  of  the  scheme. 

The  next  two  schemes  for  the  Stokes  equations  are  both  dissipative  schemes.  The  first 
of  these  uses  the  second-order  backward  difference  time,  and  is 


+  ff?; 


-  nvx1  +  v^n+1  =  r 


•n+l  _  _n+I 


Vh  •  t7"+1  =  g 


Scheme  (2.9)  is  second-order  accurate  in  both  time  and  space.  Using  the  regularized 
differences  (2.8),  the  symbol  of  the  scheme  is  non-singular  for  £  and  $  nonzero,  and  in  the 
ranges  |£i|Ax  <  7r,|£2|Ay  <  tr,  and  \Im  s|At  <  it 

The  third  scheme  to  be  considered  here  may  be  regarded  either  as  a  second-order 
accurate  discretization  about  the  time  level  (n  +  2/3) A t  or  as  a  weighted  average  of  the 
first  two  schemes.  Taking  2/3  of  scheme  1  and  1/3  of  scheme  2  we  arrive  at  scheme  3 

H“"+l  -  +  f^"+'  +  -  /n+2/3  ,2.10, 

VA  .u"+1  =  yn+1 . 

Symbols  for  these  two  schemes  do  not  vanish  for  Re  s  >  0  and  |£|  >  0.  Schemes  2  and 
3  are  bofli  dissipative,  that  is.  the  values  of for  which  the  symbol  vanishes  satisfy 

,C|  <  1  -  CAI  ( 

\  Ar2  Ay-  ) 

for  some  positive  constant  c. 


-  8g?,m  +  gt"; 
6At 


3.  THE  MULTIGRID  ALGORITM. 

Each  of  the  three  schemes  discussed  in  the  previous  section  are  implicit  and  require 
some  means  to  solve  the  linear  system  of  equations  for  the  velocity  and  pressure  at  the 
next  time  level.  Each  of  these  systems  can  be  written  in  the  form 


We  present  here  an  iterative  method  using  multigrid  techniques  to  solve  systems  of  the 
form  (3.1).  The  general  multigrid  method,  see  [1],  involves  using  a  sequence  of  grids  with  a 
smoothing  operator  on  each  grid  and  methods  for  mapping  function  between  neighboring 
grids.  The  smoothing  operator  should  be  designed  to  reduce  the  amplitude  i  of  those 
modes  of  the  error  that  are  of  highest  frequency  on  that  grid. 

For  the  results  reported  in  this  paper  the  grids  had  A.r  =  L  \/2n  and  Ay  =  Lil'2n  for 
some  integer  n  on  a  rectangle  with  sides  of  length  L\  and  L-i  in  the  x  and  y  directions. 
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respectively.  The  sequence  of  coarser  grids  were  defined  by  Ax  =  X i  / 2ji  and  Ay  =  L2I23 
for  j  =  n  —  1 ,  n  —  2, . . . ,  1 . 

On  each  grid  the  basic  smoother  for  the  velocity  was  a  Gauss-Seidel  iteration  on  (3.1a) 
using  the  checkerboard  ordering.  At  each  point  the  smoothing  step  is  given  by 

U(,m  <—  «/,m  -  -  aAtV2kut,m  4-  aA tV hpt  m  -  Ff,m)  (3.2) 

where  =  (1  +  2cvAf(l/Ax2  +  l/Ay2))-1.  Note  that  d_1  is  the  coefficient  of  U(,m  on  the 
right-hand  side  of  (3.1a). 

After  the  velocity  was  smoothed  once  with  the  Gauss-Seidel  iteration,  the  pressure 
was  smoothed  by  the  operation 


Pt ,m  *  Pf.m  h  '  9f.m )  (3.3) 

The  parameter  7  in  (3.3)  was  determined  in  several  ways,  this  is  discussed  in  more  detail 
later. 

The  overall  smoother  for  the  solution  consisted  of  two  steps,  each  of  which  consisted 
of  a  velocity  smoothing  and  a  pressure  smoothing. 

The  prolongation  from  a  coarse  grid  to  a  finer  grid  was  bi-quadratic  interpolation 
and  the  restriction  from  a  fine  grid  to  a  coarse  grid  was  the  adjoint  of  the  bi-quadratic 
interpolation  operator. 

On  the  coarsest  grid,  with  Ax  =  L\/'2  and  Ay  =  L>/2.  there  is  one  equation  for  each 
of  the  velocity  components  and  the  pressure  gradients  were  set  to  zero. 

In  analyzing  the  behaviour  of  iterative  methods  for  the  system  (31)  it  must  be  kept 
in  mind  that  the  velocity  and  pressure  are  coupled  in  an  elliptic  system.  Thus,  modes  of 
the  pressure  error  affect  the  velocity  error  through  equation  (3.2)  and  this  in  turn  affects 
the  pressure  via  equation  (3.3).  The  overall  effect  must  be  to  reduce  the  error  in  both  the 
velocity  and  the  pressure. 

It  was  found  that  the  velocity  errors  were  reduced  quite  rapidly  by  the  multigrid 
iterations,  taking  only  a  couple  of  V- cycles  to  reduce  the  error  below  the  desired  tolerance. 
However,  the  pressure  errors  were  reduced  more  slowly.  Why  this  is  so  is  the  result  of 
continuing  investigation. 

The  parameter  *  in  (3.3)  was  chosen  to  depend  on  the  grid  spacing.  The  formula  is 
derived  as  follows.  A  change  of  r  in  the  pressure  at  one  grid  point  contributes  a  change1 
to  the  velocity  at  neighboring  grid  points  through  (3.2)  proportional  to  aJAts/A.r  or 
o  .i Afc /  At/ .  The  proportionality  depends  on  the  relative  locations  of  the  points,  but  is 
independent  of  the  grid  sparing.  The  changes  in  the  velocity  result  in  changes  in  the 
pressure  through  (3.3)  proportional  to  n  A-  At£{  l/Ar2  +  1/Aty2).  This  indicates  that  * 
should  be  given  by 

7  =  <7)0 (1 +  2oA/(l/A.r-  +  1/Ay2))  (A/Ax2  +  A/A y2))-'  (3.4) 

for  some  value  of  r(l. 
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4.  NAVIER-STOKES  EQUATIONS. 

The  incompressible  Navier-Stokes  equations  are 

ut  —  V2j7  +  VuuT  +  Vp  =  f  (5.1a) 


V-u=g  (5.16) 

( The  superscript  T  on  u  denotes  transpose. ) 

The  modifications  to  the  schemes  so  as  to  include  the  nonlinear  convection  terms  can 
be  done  so  as  to  maintain  the  linearity  of  the  equations  that  must  be  solve  at  each  time 
step.  To  show  how  this  is  done,  consider  the  first  scheme  (2.5)  with  the  addition  of  the 
convection  terms.  We  have 


un+1  -  u71 


-  p(V2t7n+1  +  V2un)  +  Vun+1/2un+1/2'T 


+  \(Vpn+1  +  Vp")  =  /n+=  +  0(At2). 

By  differencing  the  convection  term  as 

Vifn+ 2  un+*'T  =  lvun+lirn-T  +  ^Vunun+l'T  +  0{At2) 

the  second-order  accuracy  is  maintained  and  the  system  for  t7"+1  is  linear. 

Similarly,  scheme  2.  i.e..  (2.9).  is  modified  by  the  approximation 

V un+lun+l  T  =  Vu"+lu"  T  +  Vt7nt7"+lT  -  Vi7nt7n-r  +  0(At2) 

to  preserve  the  second-order  accuracy  and  linearity. 

Scheme  3.  i.e..  (2.10).  is  modified  by  the  approximation 

Vf7n+^  un+*r  =  -  Vi7,l  +  1  il"'T  +  ^Vt7nt7n+ir  —  -Vununr  +  0(At2). 

3  3  3 

5.  TEST  RESULT. 

We  give  the  results  of  a  test  case,  for  which  the  domain  is  the  square  given  by  0  <  x  <  1 
and  0  <  t/  <  1.  The  exact  solution  is  given  l)y 

u  =  tf'1  cos(  ty ) 

e  =  —  if-’1  sin {ii) )  (5.1 ) 

p  =  -<  ,r  I  r  cos(  ft/  )  —  //  sin(  ft/)}  +  5. 


where  7  =  (n.v).  At  each  time  step  the  finite  difference  equations  were  solved  to  an 
accuracy  of  5  •  lO-13.  Dirichlef  boundary  conditions  were  used.  e.g. .  (2.2)  where  6  was  the 
exact  data  from  (4.1). 

As  shown  in  Table  1  the  solutions  were  computed  to  second-order  accuracy.  For  the 
results  shown  in  Table  1  the  constant  c().  see  (3.4)  was  0.2. 

The  method  gives  a  substantial  increase  in  speed  over  methods  based  on  successive- 
over-relaxation,  see  e.g..  [3].  Further  improvement  should  give  more  efficiency. 

The  finite  difference  schemes  introduced  here  can  be  used  with  domain  decomposition, 
see  [4l .  Results  of  a  computation  of  two-dimensional  flow  past  a  rectangular  shaped  ob¬ 
struction  are  displayed  in  Figure  1.  The  domain  is  decomposed  into  four  sub-domains.  The 
velocities  are  interpolated  from  one  subdomain  to  the  boundaries  of  the  other  subdomains. 
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Figure  1 
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1  Abstract 

A  multigrid  Alternating  Direction  Implicit  scheme  has  been  developed  to 
solve  the  compressible  Navier-Stokes  equations  for  two-dimensional  prob¬ 
lems.  The  scheme  is  an  extension  of  that  developed  by  Caughey  [l]  to  solve 
the  Euler  equations  of  inviscid  compressible  flow. 

Spatial  discretization  of  the  governing  equations  is  done  using  a  finite 
volume  approximation  to  provide  flexibility  in  dealing  with  complicated  ge¬ 
ometries.  In  order  to  prevent  decoupling  of  the  solution  at  odd-  and  even- 
numbered  points  of  the  grid,  and  to  prevent  oscillations  of  the  solution  near 
shock  waves,  artificial  dissipation  is  added  in  the  form  of  an  adaptive  blend 
of  second  and  fourth  differences  of  the  solution.  The  time-linearized  implicit 
operator  is  approximated  as  the  product  of  two  one-dimensional  factors.  In 
order  to  improve  computational  efficiency,  each  of  the  implicit  factors  is  di¬ 
agonalized  using  a  local  similarity  transformation.  This  diagonalization  is 
possible  only  when  the  contributions  of  the  viscous  terms  to  the  implicit 
operator  are  approximated  or  eliminated  altogether.  But  it  is  fairly  com¬ 
mon  to  treat  the  viscous  terms  explicitly  when  using  even  non-diagonalized 
ADI  schemes.  The  resulting  scheme  requires  the  solution  of  four  scalar  pen- 
tadiagonal  systems  along  each  line  in  each  of  the  two  mesh  directions  for 
each  time  step.  The  implicit  scheme  is  used  within  the  framework  of  the 
multigrid  method  to  further  accelerate  convergence  to  a  steady  state. 

The  motivation  for  the  development  of  the  method  is  to  improve  upon 
the  convergence  rates  of  explicit  multigrid  methods  on  the  highly  stretched 
grids  required  for  high  Reynolds  number  flows.  The  turbulence  model  used 
here  is  based  on  the  algebraic  model  developed  by  Baldwin  and  Lomax 
[2].  Results  are  presented  for  flows  past  airfoils.  Flow  field  results  are 
presented  to  confirm  the  accuracy  of  the  method,  and  convergence  rates  are 
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compared  with  other  methods  to  demonstrate  the  efficiency  of  the  implicit 
ADI  multigrid  method. 


2  Analysis 


2.1  The  Equations 

The  Reynolds  Averaged  Navier-Stokes  equations  in  two  dimensions  can  be 


written  as 

drS  df  dg  _  dfv  dgv 
dt  dx  dy  dx  dy  ' 

(1) 

where 

w  =  {p,  pu,  pv,  e}T 

(2) 

is  the  vector  of  conserved  variables, 

f  =  {P«,  pu2+p,  puv,  (e+p)u}T, 

(3) 

g  =  { pv ,  puv ,  pv2  +  p,  (e  +  p)v}T, 

(4) 

are  the  inviscid  flux  vectors  in  the  x  and  y  directions  respectively,  and 

fv  =  {0,  <TXX,  <7Xy,  u<Txx  +  v<7xy-qx}r ,  (5) 

9v  =  C  X  y,  &yyj  VCyy  Qy}  i  (^) 


are  the  viscous  flux  vectors  in  the  x  and  y  directions  respectively.  The 
variables  p  and  p  are  the  fluid  density  and  pressure,  u  and  v  are  the  velocity 
components  in  the  x  and  y  directions,  and  e  is  the  total  energy  per  unit 
volume.  The  equation  of  state  for  a  calorically  perfect  gas  is  used  to  relate 
the  pressure  to  the  total  energy 

P  =  (T) 


where  7  is  the  ratio  of  specific  heats.  For  air,  7  =  1.4  .  The  viscous  stresses 
and  heat  fluxes,  with  the  assumption  of  Stokes’  hypothesis,  are  given  by 


&  XX 
®xy 


„  du  ,  ,  du  ilv, 

du  dv 
dy  dx 


(3) 

(9) 
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(10) 


dv 

<7yy  — 

to 

-E 

Col 

1 
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,  dT 

qx  — 

,  dT 

qy  = 

-fc— . 
dy 

,  du  dv 
'  dx  +  dy 


). 


(11) 

(12) 


Here  T  is  the  temperature,  p  is  the  viscosity  and  k  is  the  thermal  conduc¬ 
tivity  of  the  fluid. 


2.2  Turbulence  Model 

The  effects  of  turbulence  are  modeled  using  the  eddy  diffusivity  concept  for 
the  Reynolds  stresses  and  eddy  thermal  conductivity  for  the  turbulent  heat 
fluxes.  The  total  diffusivities  are  given  by 


V-  —  Mmoi  "b  Mt> 
k  =  kmol  +  kt , 


(13) 

(14) 


where  pmoi  and  fcmoi  are  the  molecular  quantities,  and  pt  and  kt  are  the 
turbulent  quantities.  We  obtain  closure  by  modeling  pt  analytically  using 
a  zero  equation  model  and  calculating  kt  from  Prt,  the  turbulent  Prandtl 
number,  which  is  chosen  to  be  equal  to  0.9  . 

The  turbulence  model  is  based  on  the  algebraic  model  of  Baldwin  and 
Lomax  [2].  This  is  a  two-layer  zero-equation  eddy-viscosity  model.  The 
eddy  viscosity  pt  is  given  by 


_  I  (^t)mner 

f*t  —  S  /  \ 

\Pt ) outer 


y  —  V  crossover 
y  >  ycrossover 


(15) 


where  y  is  the  distance  normal  to  the  wall  and  yCToasover  *s  the  smallest  value 
of  y  at  which  the  value  calculated  from  the  inner  formula  exceeds  the  value 
from  the  outer  formula.  Both  the  inner  and  outer  formulas  have  the  general 
form 

Ht  =  a  K  £  u  p,  (16) 

where  £  is  the  length  scale  and  u  is  the  velocity  scale  prescribed  by  the 
model. 

In  the  inner  region  the  length  scale  is  the  Prandtl  mixing  length  modified 
by  the  van  Driest  damping  factor.  The  velocity  scale  is  calculated  as  the 
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product  of  the  modified  mixing  length  l  and  the  magnitude  of  the  local 
vorticity  |w|,  according  to  the  Prandtl-van  Driest  formulation. 

e  =  Ky  where  y+  =  ^  (17) 

u  =  l  M,  (18) 

a  =  1.0 ;  K  —  1.0,  (19) 

where  k  =  0.4  is  the  von  Karman  constant  and  A+  =  26.0  is  an  effective 
sub-layer  thickness. 

To  determine  the  scales  in  the  outer  region  Baldwin  and  Lomax  defined 
a  function 

F(y)  —  y M  (i  —  e-y+/'4+) .  (20) 

This  function  F(y)  is  used  to  compute  the  length  and  velocity  scales  in  the 
outer  region  according  to 

l  =  1.6 1 IFmax,  (21) 

(n  (UFmax  —  Umm)  \ 

Onaxi - p -  It  (22) 

r  max  ) 

where  Fmox  is  the  maximum  value  of  F(y )  that  occurs  in  a  profile,  yFmax  is 
the  y-location  of  that  maximum,  Upmax  is  the  total  velocity  at  that  location 
and  Um,n  is  the  minimum  velocity  in  the  profile.  The  Clauser  constant  a 
and  the  Klebanoff  interinittency  factor  K  are  given  by 


(23) 

(24) 


The  model  is  modified  slightly  when  applied  to  the  wake.  In  the  wake  the 
van  Driest  damping  factor  is  set  equal  to  1,  and  the  y-distance  is  measured 
from  the  first  coordinate  line  in  the  wrap-around  direction  of  the  C-grid.  i.e. 
from  the  t)  —  0  line. 


2.3  Finite  Volume  Formulation 

To  facilitate  the  handling  of  complex  geometries  a  finite  volume  formulation 
is  used.  First  the  equations  are  transformed  from  the  physical  plane  to  the 


computational  plane  through  a  non-singular  transformation.  In  the  new 
plane  the  system  of  equations  can  be  written  as 


where 


OW  dF_  dG  _  8FV  8GV 
dt  +  dr]  d£  St) 


W  =  hw 


(25) 


is  the  vector  of  transformed  dependent  variables  and  h  =  x^yn  —  xv  is 
the  determinant  of  the  Jacobian  of  the  transformation.  The  transformed 
inviscid  and  viscous  fluxes  are  given  by 


F 

=  fVn  -  gx 

(26) 

G 

=  -fyt  +  9xz, 

(27) 

Fv 

—  fvV  j)  QvXi]i 

(28) 

Gv 

=  -fvVt  +9vX{. 

(29) 

The  dependent  variables  are  taken  to  be  the  cell  average  quantities. 
Spatial  derivatives  are  approximated  by  evaluating  the  fluxes  across  the 
faces  of  each  cell.  This  requires  the  values  of  the  velocities  on  all  cell  faces, 
which  are  taken  to  be  the  average  of  the  cell  average  values  of  the  two  cells 
sharing  the  face. 


2.4  Thin  Layer  Approximation 

The  thin  layer  approximation  neglects  all  diffusion  processes  parallel  to  the 
body  surface.  In  this  respect  it  is  similar  to  the  classical  boundary  layer  ap¬ 
proximation,  but  it  is  different  in  that  no  assumptions  are  made  regarding 
the  pressure,  and  the  momentum  equation  normal  to  the  body  surface  is  re¬ 
tained.  In  the  present  formulation  the  £  -direction  is  the  coordinate  direction 
approximately  parallel  to  the  surface  and  the  77 -direction  is  approximately 
normal  to  the  surface.  Therefore  all  £ -derivatives  are  neglected  while  all 
77 -derivatives  are  retained  in  all  the  viscous  terms  and  in  the  evaluation  of 
the  cartesian  derivatives  in  the  viscous  terms.  In  particular,  the  viscous  flux 
in  the  77 -direction  Gv  which  involves  the  calculation  of  cartesian  derivatives 
(Eqs.  8  -  12)  is  modified  to  a  simpler  form  G'v  containing  only  77 -derivatives. 
The  system  of  equations  reduces  to 


d\V  OF  dG  _  dC7'„ 
dt  d£  dr\  di] 


(30) 
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In  most  Navier-Stokes  solutions  for  high  Reynolds  number  turbulent 
flows  the  diffusion  terms  involving  derivatives  parallel  to  the  body  surface 
have  not  been  resolved  even  when  the  appropriate  terms  have  been  retained 
in  the  equations.  This  is  due  to  the  coarseness  of  the  mesh  in  that  direction. 
In  other  words,  if  the  mesh  is  not  fine  enough  in  the  direction  parallel  to  the 
body  surface  to  resolve  these  diffusion  terms,  it  is  a  wasted  effort  to  try  to 
calculate  them. 

2.5  Artificial  Dissipation 

The  finite  volume  scheme  for  the  Euler  equations  does  not  contain  any  dissi¬ 
pative  terms.  In  order  to  prevent  odd-even  point  decoupling  and  oscillations 
near  shock  waves  or  stagnation  points  artificial  dissipation  terms  must  be 
added  when  solving  the  Euler  equations.  The  Navier-Stokes  equations  on  the 
other  hand  possess  dissipative  properties  due  to  the  presence  of  the  viscous 
terms.  However  the  physical  dissipation  provided  by  these  terms  in  regions 
away  from  the  shear  layer  may  not  be  sufficient  to  guarantee  stability.  So  in 
order  to  maintain  the  stability  and  robustness  of  the  numerical  procedure 
it  was  necessary  to  add  artificial  dissipation.  The  terms  were  constructed 
as  an  laptive  blend  of  second  and  fourth  differences  with  the  directional 
scalin  of  the  terms  suggested  by  Caughey  [l]. 

The  modified  set  of  equations  is 

&W_  DF_  dG  _  dG\, 

Ot  dr]  di i 


where  c^2*  and  c'4*  are  as  defined  in  [l]. 

2.6  Iterative  Scheme 

To  construct  an  iterative  scheme  to  solve  the  difference  equations,  the  spa¬ 
tial  derivatives  are  first  approximated  implicitly  and  the  changes  in  the  flux 
vectors  are  linearized  in  time.  The  implicit  operator  thus  obtained  is  ap¬ 
proximated  as  the  product  of  two  one-dimensional  factors.  The  development 
thus  far  follows  that  of  Briley  and  McDonald  [3]  and  Beam  and  Wanning 
[<l].  Since  the  artificial  dissipation  terms  must  be  treated  implicitly  for  rapid 
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convergence,  this  scheme  would  lead  to  the  requirement  to  solve  block  pen- 
tadiagonal  systems  for  each  of  the  two  factors.  An  alternative  suggested  by 
Pulliam  and  Chaussee  [5]  is  to  diagonalize  the  implicit  factors  using  a  local 
similarity  transformation.  This  yields  the  decoupled  set  of  equations 

{I  +  0At[AA?Jt  -  +  c<4>64(1 lh)]}Q-A\} 

x  QB?JI  +  0At[AB?Jn  -  e^S2(l/h)  +  f<4>64(l//i)]}AV" 

=  -At  +  6vGij  -  SV(G ?v)itj  -  <V(6l  +  6*)iB+  +  *«)*}". 

(32) 

Here  A  and  B  are  the  Jacobians  of  the  transformed  flux  vectors 

and  A  a  and  Ag  are  diagonal  matrices  whose  diagonal  elements  are  the  eigen¬ 
values  of  A  and  B.  The  modal  matrices  QA  and  Qg  diagonalize  A  and  B 
according  to 

Q~A^Qa  =  A/4  ;  Q~gBQg  =  As.  (34) 

The  correction  A W  to  the  solution  in  each  computational  cell  is  given  by 

AW"  =  QgAV”.  (35) 

The  elements  of  the  Jacobian  matrices,  their  modal  matrices  and  the  diag¬ 
onal  matrices  have  been  given  by  Pulliam  and  Chaussee  [5].  The  system 
of  equations  (Eq.  32)  are  solved  at  each  time  step  by  solving  four  scalar 
pentadiagonal  systems  along  each  line  in  each  of  the  two  directions. 


2.7  Boundary  Conditions 

2.7.1  Explicit  Boundary  Conditions 

The  solutions  computed  to  date  have  been  for  subsonic  freestream  Mach 
numbers.  For  subsonic  inflow  the  boundary  conditions  are  based  on  the 
Rjemann  invariants  for  the  one-dimensional  problem  normal  to  the  bound¬ 
ary.  The  first  Riemann  invariant 


R  i 


X(_v  -  1J(U  2c 

\/xl  +  'A  7  ~ 1 


(3«) 
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is  extrapolated  from  the  interior  of  the  domain,  while  the  second  Riemann 
invariant 


R  _  x^v  -  y^u  _  2c 
^  7-1 


(37) 


is  specified,  as  are  the  entropy  and  the  tangential  velocity  component.  At 
a  subsonic  outflow  boundary  pu,  pv  and  the  entropy  are  extrapolated  from 
the  interior,  while  pressure  is  specified  to  be  the  freestream  value. 

On  the  body  surface  the  no-slip  boundary  condition  is  applied,  i.e.,  the 
velocity  components  u  and  v  are  set  equal  to  zero.  Also,  the  surface  is 
assumed  to  be  adiabatic,  i.e.  dT/dn  =  0. 


2.7.2  Implicit  Boundary  Conditions 

In  the  far-field  the  implicit  boundary  conditions  are  treated  in  a  manner  con¬ 
sistent  with  characteristic  theory;  on  the  body  surface  homogeneous  Dirich- 
let  conditions  are  applied. 


2.8  Multigrid  Algorithm 

The  scheme  is  implemented  within  the  framework  of  the  multigrid  algo¬ 
rithm  following  Jameson  [6]  and  Caughey  [l].  The  algorithm  consists  of  the 
following  steps: 


1.  Form  a  coarse  grid  by  eliminating  every  second  line  of  the  fine  grid  in 
each  coordinate  direction. 

2.  Restrict  the  flow  variables  to  the  coarse  mesh  using  area- weighted 
averages  of  the  values  on  the  fine  mesh. 

3.  Drive  the  corrections  on  the  coarse  mesh  with  the  residual  computed 
on  the  fine  mesh. 

4.  Continue  until  the  coarsest  mesh. 

5.  Prolong  corrections  back  to  the  finer  meshes  using  bilinear  interpola¬ 
tion. 

6.  Add  corrections  on  the  finest  mesh  to  the  solution. 

7.  Repeat  cycle. 
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Both  the  body-surface  and  the  far-field  boundary  conditions  are  updated 
on  coarser  meshes.  A  fixed  V-cycle  is  used  in  which  the  solution  is  advanced 
one  time  step  on  each  mesh  as  the  grid  is  coarsened  and  refined.  A  fixed 
coefficient  second  difference  form  of  the  dissipation  is  used  on  all  but  the 
finest  mesh. 

3  Results 

The  scheme  described  above  has  been  applied  to  compute  transonic  flows 
past  the  NACA  0012  airfoil.  The  following  cases  are  presented  here: 

1.  Moo  =  0.7,  a  =  1.49,  Rec  =  9  X  106 

2.  Moo  =  0.799,  a  =  2.26,  Rec  =  9  X  106 

These  are  cases  A1  and  A3  of  the  Viscous  Transonic  Airfoil  Workshop  of 
1987  [7].  All  results  were  calculated  on  C-grids  containing  192  x  48  cells  in 
the  wrap-around  and  body-normal  directions  respectively.  Of  the  192  points 
in  the  wrap-around  direction  120  were  on  the  airfoil.  The  distance  from  the 
airfoil  to  the  first  coordinate  line  was  5  x  10~5  of  a  chord  which  corresponds 
to  a  y+  less  than  4  for  the  given  Reynolds  number.  The  farfield  boundaries 
were  about  7  chord  lengths  from  the  airfoil.  The  cells  are  highly  clustered  in 
the  r) -direction  near  the  surface  of  the  airfoil  and  have  large  aspect  ratios. 
The  largest  aspect  ratio  is  of  the  order  103. 

The  calculations  were  performed  on  an  IBM  3090/600J.  A  typical  cal¬ 
culation  took  about  100^s  per  work-unit  per  point  as  compared  to  93^s  for 
an  Euler  calculation  using  the  same  method.  A  work  unit  is  the  amount  of 
computation  required  for  advancing  one  time  step  on  the  finest  mesh. 

Airfoil  surface  pressure  distributions  for  the  two  cases  are  presented  to 
verify  the  accuracy  of  the  scheme.  The  results  of  the  two  cases  are  compared 
with  the  experiments  of  Harris  [8]  and  the  computational  results  from  the 
VTA  Workshop  [7].  For  case(  1)  the  flow  is  attached  and  just  slightly  super¬ 
sonic  near  the  leading  edge  upper  surface.  The  measured  experimental  angle 
of  attack  for  this  case  was  1.S60.  This  was  corrected  to  1.49°  by  Harris  using 
a  linear  method  for  accounting  for  wind  tunnel  wall  effects.  Figure  1  shows 
that  the  computed  surface  pressures  are  in  excellent  agreement  with  exper¬ 
imental  data.  Table  1  gives  a  comparison  of  the  force  coefficients.  The  lift 
coefficient  obtained  using  the  present  method  is  about  2%  less  than  the  ex¬ 
perimental  value  and  is  wit'.'in  the  range  of  values  obtained  computationally. 


749 


Drag  coefficients  are  difficult  to  calculate  accurately  because  the  pressure  in¬ 
tegration  for  drag  is  very  sensitive.  Even  so  the  value  obtained  0.0084  is  only 
about  6%  different  from  the  experimental  value,  and  is  within  the  range  of 
the  VTA  Workshop  values.  Of  the  total  computed  drag  about  17%  is  due 
to  skin  friction  and  the  rest  due  to  pressure  drag. 

Convergence  results  are  presented  in  Figures  2  and  3.  Grid  sequencing 
is  used,  i.e.  multigrid  solutions  are  first  obtained  on  coarser  grids  and  then 
interpolated  for  use  as  initial  conditions  on  fine  grids.  The  error  is  defined 
as  the  residual  of  the  continuity  equation  averaged  over  all  the  grid  cells. 
The  logarithm  of  this  error  is  plotted  against  the  number  of  work  units 
in  Figure  2  for  a  single  grid  and  for  4  levels  of  multigrid.  We  see  that 
with  4  levels  of  multigrid  the  error  has  been  reduced  8  orders  of  magnitude 
in  100  work  units,  whereas  for  the  single  grid  it  has  been  reduced  only 
3  crders  of  magnitude.  The  asymptotic  rate  of  convergence  is  clearly  much 
in.  iroved  with  multigrid.  The  CFL  number  for  both  cases  was  24,  and  local 
time-stepping  was  used.  Figure  3  shows  that  the  three  measures  of  global 
convergence  -  the  lift  coefficient  Cl ,  the  drag  coefficient  Cp  and  the  number 
of  cells  N3up  in  which  the  local  velocity  is  supersonic,  have  converged  to 
w'lhin  plottable  accuracy  of  the  final  values  within  50  work  units  when 
u.'ng  4  levels  of  multigrid. 

Figure  4  compares  the  convergence  history  of  the  implicit  multigrid 
scheme  presented  in  this  paper  with  the  explicit  multigrid  Runge-Kutta 
scueme  of  Martinelli  and  Jameson  [9],  The  overall  convergence  rate  and  the 
a-  ymptotic  rate  are  improved  with  the  present  implicit  scheme. 

The  flow  conditions  for  case{2)  are  =  0.799,  o  =  2.26°  and  Rec  = 

9  x  106.  The  flow  field  contains  a  shock  on  the  airfoil  upper  surface  at 
ai  x/c  of  about  0.5.  The  shock  is  strong  enough  to  induce  a  significant 
1  >undary  layer  separation.  The  experimental  data  obtained  by  Harris  [S] 
a.  e  compared  with  the  computational  results  in  Figure  5.  The  computational 
angle  of  attack  (2.26°)  is  obtained  from  the  measured  angle  of  attack  (2.86°) 
using  a  linear  wind  tunnel  wall  correction  procedure  [8].  Our  results  are 
generally  in  close  agreement  with  other  computational  results  [10]  that  use 
the  same  turbulence  model  but  the  shock  strength  and  the  shock  position 
are  incorrectly  predicted.  The  computed  shock  is  both  stronger  and  farther 
downstream  than  that  measured  experimentally.  The  convergence  history  is 
shown  in  Figure  6.  The  rate  is  comparable  to  that  obtained  for  the  simpler 
case(  1 ). 
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4  Conclusions 


The  multigrid  diagonalized  Alternating  Direction  Implicit  scheme  developed 
by  Caughey  has  been  extended  to  solve  the  thin-layer  Navier-Stokes  equation 
for  compressible  flow.  The  Baldwin-Lomax  algebraic  turbulence  model  was 
used.  Results  for  transonic  flows  past  airfoils  were  presented.  They  show 
that  for  attached  flow  the  computed  flowfield  data  are  in  good  agreement 
with  the  experimental  data,  but  for  flows  with  strong  shocks  and  shock- 
induced  separation  the  agreement  is  poor.  This  can  be  attributed  to  the 
equilibrium  nature  of  the  turbulence  model  used.  The  convergence  rates 
obtained  using  the  implicit  method  described  above  are  better  than  those 
obtained  using  the  explicit  Runge-Kutta  method. 
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Experimental  Results 
of  Harris  [1981] 

FL053MDI 
(this  work) 

Computational  Results 
VTA  Workshop  [1987] 

■Mg! 

■EM 

0.2410 

0.2379 

0.2350  -  0.2620 

Drag 

Coeff. 

(Cd) 

0.0079 

0.0084 

0.0074  -  0.0100 

Table  1:  Comparison  of  Force  Coefficients  for  case(l) 
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.  Harris  Expi[1981]  Cl, A  FL053MDI - 

Figure  1:  Comparison  of  computed  (FL053MDI)  surface  pressure  coeffi¬ 
cients  with  values  from  the  Harris  experiment 
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Rate  Cl)  0.9840  Rate  (4  )  0.9634 


Figure  2:  Convergence  rates  using  a  single  grid  and  using  4  levels  of  multigrid 
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Figure  3:  Convergence  of  global  measures 
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Figure  4:  Comparison  of  the  convergence  rate  of  the  present  method  with 
the  explicit  Runge-Kutta  method  of  Martinelli  and  Jameson[1988] 
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Figure  5:  Comparison  of  pressure  coefficients  for  case(2):  =  0.799 
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Figure  6:  Convergence  history  for  case(2):  =  0.799 
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Abstract 


An  Alternating  Direction  Implicit  diagonal  multigrid  algorithm  is  presented  for  the  solution 
of  the  Navier-Stokes  equations  of  viscous,  compressible  flow.  Attention  is  focused  on  the 
inclusion  of  viscous  contributions  to  the  implicit  factors  in  a  way  that  will  enhance  the  sta¬ 
bility,  yet  not  disturb  the  efficiency,  of  the  diagonal  algorithm.  Flows  past  two-dimensional 
airfoils  are  computed  to  demonstrate  the  stability  and  efficiency  of  the  scheme. 


I.  Introduction 


In  the  numerical  simulation  of  viscous  flows  at  high  Reynolds  numbers,  it  is  neces¬ 
sary  to  resolve  the  thin  shear  regions  which  develop  near  solid  boundaries.  These 
thin  shear  regions  require  the  use  of  grids  with  cells  of  very  high  aspect  ratio,  which 
are  known  to  hinder  convergence  for  steady  problems  in  explicit  schemes.  To  over¬ 
come  such  difficulties,  Caughey  has  developed  an  diagonal  implicit  algorithm  for  the 
solution  of  the  Euler  equations  of  inviscid,  compressible  flow  [l].  Rapid  convergence 
is  achieved  with  the  use  of  the  implicit  scheme  within  the  multigrid  framework. 

Here,  the  method  is  extended  to  solve  the  Navier-Stokes  equations,  and  specifically 
the  thin-layer  approximation  to  those  equations.  Aspects  of  the  algorithm  including 
artificial  dissipation,  boundary  conditions,  multigrid,  and  other  topics  not  directly 
related  to  the  implementation  of  the  Navier-Stokes  equations  are  not  addressed 
here  in  great  length;  details  of  those  issues  can  be  found  in  [1].  Instead,  emphasis 
is  directed  at  methods  of  adding  viscous  contributions  to  the  algorithm  in  a  way 
which  does  not  disturb  the  overall  stability  and  efficiency  of  the  implicit  scheme. 
Since  no  attempt  in  the  present  analysis  is  made  to  incorporate  a  turbulence  model, 
discussion  will  be  limited  to  laminar  flows. 
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II.  Governing  Equations 


Navier-Stokes  Equations 

The  equations  which  govern  compressible  viscous  flows  are  the  Navier-Stokes  equa¬ 
tions.  In  Cartesian  coordinates,  the  Navier-Stokes  equations  in  two-dimensions  can 
be  written 


d^,aj£..dlc^dLv,dlv  m 

where  w_  =  {p,pu,pv,e}T  is  the  vector  of  conserved  dependent  variables.  Here,  p 
denotes  the  density,  u  and  v  the  cartesian  velocities,  and  e  the  total  energy  per  unit 
volume.  The  convective  flux  vectors  in  the  i-  and  y-  directions,  respectively  and 
gc,  and  the  viscous  flux  vectors  f_y  and  gy  are  given  by 


S_c  =  [pu,pu2  +  p,puv,(e  +  p)u}  , 
2c  =  {pv,puv,pv2  +  p,(e  +  p)v}  , 

f_y  =  {0?  Txxi  Txy i  (  T  '  u)x  ~  9r} 

9y  ~  {0,TyZ,  Tyy,  (f  ■  li)y  —  9 J/}  • 

The  viscous  shear  stresses  and  the  heat  fluxes  are  of  the  form 


Txx 

=  2pux  4- 

A  (Ur  T  Vy)  , 

Tyy 

=  2pVy + 

A(UX  +  Vy), 

TXy 

=  /i(«»  + 

«*), 

Qx 

=  -kTx, 

<7y 

n 

i 

where  k  is  the  coefficient  of  thermal  conductivity  and  T  is  the  temperature.  The 
second  coefficient  of  viscosity  A  is  related  to  the  molecular  viscosity  p  by  Stokes 
hypothesis, 


A  = 


(2) 
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An  equation  of  state  is  needed  to  relate  the  pressure  and  total  energy: 


P  = 


^p(u2  +  v2) 


(3) 


To  allow  treatment  of  arbitrary  geometries,  the  equations  are  transformed  into  curvi¬ 
linear  coordinates  and  written 


dW_  ,  dFc  '  dQc  dFv  ,  dGv 
dt  +  +  dp  ~df +  dr)  ' 


(4) 


where  W_  ~  hw  is  the  transformed  dependent  variable  and 


Ec(W) 


'  phU 

'  phv 

phuU  +  yvp 

,  <2c0£)  = 

pliV  u  —  y^p 

phU  v  —  xnp 

phV v  -f  X(_p 

_  (e  +  p)hU 

_(e  +  p)hV 

£.v{W-,Kj,Kr,')  = 


Qv(w,Wj,wv)  = 


o 

Vt)Txx  ~  XT)Xxy 

VrjTxy  ~  xnTyy 

.  Vn(urxx  +  VTxy  -  qx)  -  xn(urxy  +  vryv  -  qy ) 

"  0 

-V(.Txx  +  X(rxy 

-ytTxy  +  X(_Tyv 

.  ~ydUTrr  +  VTxy  -  qx)  +  i{(uriy  +  vryy  -  qy) 


are  the  transformed  flux  vectors.  The  contravariant  velocities  U  and  V  are  related 
1 1  the  cartesian  velocities  by 


(  U  \  _  1  (  yvu  -  xvv  ^ 

V  V  )  h\  ~V(.U  +  x(r  )  ' 


•.vliere  h  —  x^yv  —  xvy^  is  the  determinant  of  the  Jacobian  of  the  transformation. 


Thin-Layer  Approximation 

Under  certain  conditions  it  is  possible  to  neglect  viscous  diffusion  in  the  mainstream 
direction  of  the  flow'  without  adversely  affecting  the  quality  of  the  solution.  The 
validity  of  such  a  simplification  requires  that  the  flow  have  a  predominant  direction, 
and  ia  without  massive  separation.  High  Reynolds  number  flows  over  wings  are  one 
such  example.  Implementation  of  such  a  model  necessitates  that  body  surfaces  be 
mapped  onto  coordinate  surfaces,  and  there  be  sufficient  clustering  normal  to  the 
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shear  surface  to  allow  the  boundary  layer  to  be  resolved.  It  can  be  argued  that  even 
if  the  full  Navier-Stokes  approximation  is  used,  viscous  diffusion  in  the  streamwise 
direction  cannot  be  resolved  unless  the  grid  is  sufficiently  fine  in  that  direction  [2], 
and  for  many  practical  flows,  current  computational  limitations  prevent  the  use  of 
grids  with  sufficient  resolution  in  both  the  normal  and  streamwise  directions. 

The  transformed  viscous  flux  vectors  can  be  decoupled  into  components  which  de¬ 
pend  only  on  the  vector  of  dependent  variables  and  its  derivative  in  either  the  £-  or 
77-  direction: 


Lv  =  E.v{W-,w*,wj  =  Ev(w,Wt)  +  £v(K,Wq), 

G.V  =  Gv{W.,\^,Kri)  =  £v(W,Kt)  +  G.v(W.,wJ1). 

The  thin-layer  approximation  entails  retaining  only  the  surface  normal-  or  77-  deriva¬ 
tives  from  the  viscous  terms  in  the  Navier-Stokes  equations  (Eq.  4);  that  is  only  the 
term  GVf  W.  is  kept  when  the  body  surface  is  a  line  of  77  =  constant.  The 

thin-layer  equations  are  then  written 


8W  dEc  dGc  _  OGy 
dt  ^  dr]  drj 


where 


GV{W,W^) 


0 

~V(,TtX  T 
~V(.Gy  +  x(jyy 

-y(.(uTIX  +  VTxy  -  qx)  +  x^ufxy  +  VTyy  -  ijy) 


and 


(2n  +  \\  A 

Txx  —  ^  ^  J  !/(uv  T 

X  f  2(i  +  \  \ 

Tyy  =  ~hyZUri+{ - 1 - )XZV'V 

Gy  =  £  (x^Ur,  -  y<W„)  , 

k 

<7x  = 

-  -  k  T 

<7y  —  hI^'n 
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III.  Numerical  Method 


As  with  the  algorithm  for  the  Euler  equations,  spatial  derivatives  are  approximated 
using  a  finite-volume  formulation  equivalent  to  a  centered-difference  approxima¬ 
tion  [1].  The  approximation  is  second-order  accurate  when  the  mesh  is  smooth. 
Artificial  dissipation  consisting  of  an  adaptive  blend  of  second  and  fourth  differ¬ 
ences  of  the  solution  is  added  to  insure  convergence  to  steady  state  and  to  enable 
accurate  shock  capturing  for  transonic  flows.  Local  time  stepping  is  used  to  increase 
the  convergence  rate  for  steady  problems.  To  further  accelerate  convergence,  a  re¬ 
cursive  multigrid  algorithm  similar  to  that  described  by  Smith  and  Caughey  [3]  is 
implemented. 

The  difference  equations  have  the  form 

jWu,:  +  Cm,3  -  You,]  -  Dw*,j  =  0,  (6) 

where  Cu>,-j  and  V Wj  ,  represent  contributions  due  to  convection  and  viscous  dif¬ 
fusion  respectively,  and  represents  the  artificial  dissipation  defined  in  [1].  To 

simplify  the  expressions,  contributions  from  the  artificial  dissipation  will  no  longer 
be  shown;  it  should  be  noted,  however,  that  these  terms  play  an  important  role  in 
the  overall  algorithm,  and  a  detailed  description  of  the  terms  can  be  found  in  [1.4]. 

The  first  step  in  developing  an  ADI  scheme  is  to  approximate  the  spatial  derivatives 
as  weighted  averages  at  new  and  old  time  levels.  Such  an  approximation  to  the  thin 
layer  equation  (Eq.  5)  can  be  written 


where  AIT"  =  IT"+1  —  IT"  is  the  correction  added  to  the  solution,  and  9  represents 
the  implicitness  of  the  scheme  with  0  <  6  <  1. 

The  changes  in  the  convective  flux  vectors  can  be  linearized  with  a  local  Taylor 
series  expansion  in  time  to  give 


£crj+1-£crj  =  A"AlT"+o(Ar) 


and 


/-a  n+  i 


-Gel  =  Br;AIT"  +o(a/2). 


(9) 
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where  A  =  {dEn/dW\  and  B  =  f dG_n/dW\  are  the  Jacobians  of  the  transformed 
convective  flux  vectors  with  respect  to  the  solution.  Since  the  transformed  viscous 
flux  vector  Qy  i*  a  function  of  both  W_  and  the  appropriate  linearization  is 


O.vT-^1  =  +  O  (At2)  (10) 

=  (M,>  -  ^  (N“AW")  +  O  (At2)  ,  (11) 

where  M  =  {dGv/dW}  is  the  Jacobian  of  the  transformed  viscous  flux  vector 
with  respect  to  the  solution  and  N  =  {dGv  / dW n)  is  the  Jacobian  with  respect  to 
the  derivative  of  the  solution.  Recognizing  that  M  —  N,,  =  0  if  the  the  transport 
coefficients  are  approximated  to  be  locally  constant  [5],  the  linearization  reduces  to 

GyT  -  "i  -  Ij  (NS^iES)  +  O  (a<j)  .  (12) 


The  viscous  flux  Jacobian  is 


/ 

0 

0 

0 

0 

\ 

"2 1 

"22 

"23 

0 

"31 

"32 

"33 

0 

V 

"41 

"42 

"43 

"44 

/ 

(13) 


where 
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with 


2/x  +  A  o  Hi 

=  — *  +  fa' 


A  +  h 

a2  = - —x 


a3  =  fa  + 


2/i  +  A  2  n  -U  2  ,  2\ 

- h - **  ’  014  =  ~Pr  ^  +  Vti  ^ ' 


The  matrix  N  is  not  to  be  confused  with  the  viscous  flux  Jacobian  described  by 
Steger  [6]  in  which  the  elements  of  the  matrix  are  differential  operators.  Introducing 
the  approximations  of  Eqs.  8,  9,  and  12  into  Eq.  7  results  in  what  is  commonly  called 
the  “delta”  form  of  the  algorithm: 


{I  +  0At(AijS(  +  B ijSv  -  NijSm)}nAJ£?j 
=  -A Qjc'ij  ~  ^vS~v ij } •  (14) 

Approximating  the  left  hand  side  of  Eq.  14  as  the  product  two  one-dimensional 
factors  results  in  a  block  ADI  scheme  and  is  written 


{I  +  0AtAtJ6t)  x  {I  +  0At(Bij6r,  -  N,J^)}nAi£4’* 

=  -A t{6^Fcij  +  6r,Qc7j  ~  ^Q.Vij}-  (15) 

With  the  addition  of  fourth  order  artificial  dissipation,  block  pentadiagonal  systems 
must  be  solved  in  each  factor  of  Eq.  15.  For  the  Euler  equations,  the  convective 
flux  Jacobians  can  be  diagonalized  with  local  similarity  transformations  as  A  = 
Q^A^Q^1  and  B  =  QaAgQg1,  where  A  a  and  A  g  are  diagonal  matrices  whose 
diagonal  elements  are  the  eigenvalues  of  their  respective  Jacobians,  and  and 
Q b  are  the  modal  matrices  whose  elements  are  given  in  [7].  This  allows  the  block 
equations  to  be  decoupled  into  equations  which  can  be  solved  as  scalar  pentadiagonal 
systems,  greatly  reducing  the  amount  of  computational  labor  needed  for  a  solution. 

For  the  Navier-Stokes  equations  however,  it  is  not  possible  to  both  include  the  vis¬ 
cous  terms  in  the  implicit  factor  and  to  diagonalize  the  system,  since  the  convective 
and  viscous  Jacobians  are  not  simultaneously  diagonalizable.  If  viscous  contribu¬ 
tions  are  neglected  completely  from  implicit  consideration,  a  diagonalized  system 
can  be  written 


{I  +  AAtA^ajQ,^ 

x  Qfl.^I  +  ^AtAB.AJQi.yAM^  (16) 

=  -AtQ^Fc” + 
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Neglecting  the  viscous  terms  completely  from  the  implicit  factors  would  jeopardize 
the  stability  of  the  scheme.  It  is  desirable  to  maintain  the  efficiency  of  the  diago¬ 
nalized  scheme  without  degrading  its  stability  properties,  so  alternate  approaches 
must  be  explored. 


Method  I 

The  first  method  consists  of  using  the  largest  eigenvalue  of  the  viscous  Jacobian 
to  add  contributions  to  the  existing  implicit  factors.  This  is  similar  to  what  was 
suggested  by  Pulliam  [8].  The  eigenvalues  of  N  are 


Ai 

A2 

A3 

A4 


(17) 


where  PT  is  the  Prandtl  number.  A  scheme  is  constructed  by  adding  the  diagonal 
approximation  ss  Qs  NQs  to  the  appropriate  implicit  factor: 


{I  +  0AtAA,^}Q^ 

*  4-  9At{ABi]6j)  —  (18) 

=  +  *v£cij-  ^Vln;}. 


The  diagonal  approximation  A  ^  is  for  example 

The  number  of  additional  operations  needed  to  implement  this  scheme  is  negligible 
since  it  involves  only  the  calculation  of  an  eigenvalue  whose  analytical  form  is  known. 
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Method  II 

Another  option  is  to  use  an  additional  implicit  operator  which  contains  the  exclusive 
contributions  from  the  viscous  terms.  Since  the  eigenvalues  of  the  viscous  Jacobian 
are  distinct  (Eqs.  17),  a  modal  matrix  Q ^  exists  which  diagonalizes  N  through  a 
similarity  transformation.  This  results  in  the  scheme 


{i + 

x  +  (20) 

x  Q/v,y{I  - 

=  +  6^  -  S.Gyl). 


The  modal  matrix  and  its  inverse  are  written 


QT1  = 


where  k  =  £  or  77,  and 


■  0 

Q.  - 

6 

0  0 

0  ky 

0  —  kx 
1  -A 

1  " 

U 

V 

£ 

p  - 

9 

£ 

*y  , 

-  -  +  U2  +  V2 
p 

U 

-u 

Ky 

ki+«i 

—  V 

—  *1 

1 

0 

*T+*y 

0 

«  r 

K.  —  — 

Kv 

/Vy  — 

> 

/"*  +  *v 

=  kzu  +  kyv. 

A  =  kx 

V  —  kyU. 

(21) 


0 

1 

0 

0 


(22) 


Although  the  above  methods  are  developed  for  the  thin-layer  approximation,  the 
schemes  can  be  readily  modified  to  accommodate  the  full  Navier-Stokes  approxima¬ 
tion.  Analogous  terms  must  be  added  to  account  for  the  viscous  contributions  in 
the  ^-direction.  Also,  the  stability  properties  of  these  methods  still  remain  an  issue 
and  must  be  further  explored. 
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IV.  Stability  Analysis 


The  stability  properties  of  these  schemes  are  examined  using  von  Neumann  (Fourier) 
analysis  on  a  scalar  model  equation.  The  model  equation  for  the  thin-layer  approx¬ 
imation  contains  fourth  order  artificial  dissipation  and  is  written 


du  du  du 

a7  +  cte  +  ca?  +  “ 


3  d^U  304U 

*  dP  +  Ay  dy* 


d2u 
U  dy2' 


(23) 


Substitution  of  the  Fourier  term  u -  =  Gnext3xXett3*y ,  into  the  model  and  writing  it 
as  the  product  of  one-dimensional  operators  leads  to 


(G  -  1)  {1  +  i0j\x  sin£  +  160£)Aresin4 

x  {1  -f  i9[\zA~l  sin??  +  \§9d^xA~1(  sin4  ^  -f  \9v\xRex~x  A~2  sin2 

=  -Ax{j(sin  £  -(-  A~x  sin  ??)  4-  16c(sin4  |  4-  A~x  sin4  ^) 

+4Rex~l  A~2  sin2  ^}.  (24) 

From  Eq.  24,  the  magnitude  of  G  can  be  calculated, 

|G|  =  /(S,7?;  A z,Rez,Ar,c,9[,9D,9v), 

where  £  and  t/  represent  the  mesh  wave  numbers.  In  addition  to  the  Courant  number 
Ax  =  cAt/Ax  and  artificial  dissipation  e,  the  numerical  stability  of  the  implicit 
viscous  equations  is  governed  primarily  by  the  aspect  ratios  (/ir)  of  the  mesh  cells 
and  the  mesh  Reynolds  numbers  (/?ex).  The  expression  in  Eq.  24  corresponds  to 
what  is  done  in  Method  I.  Similar  expressions  representative  of  Method  II  can  also 
be  derived. 

Using  such  a  model,  it  is  found  that  when  viscous  terms  are  added  directly  to  the 
convective  operators,  analogous  to  what  is  done  with  Method  I,  unconditional  sta¬ 
bility  is  achieved.  If  the  viscous  terms  are  evaluated  explicitly  without  implicit 
contributions,  a  conditionally  stable  scheme  results.  This  can  be  seen  in  Figure  1  in 
which  the  dark  areas  represent  regions  in  parameter  space  where  von  Neumann  anal¬ 
ysis  predicts  an  amplification  factor  greater  than  unity.  This  figure  represents  the 
properties  of  the  scheme  applied  to  a  model  problem  using  values  of  the  dissipation 
parameters  characteristic  of  those  used  in  the  computations,  and  a  Courant  number 
of  16.  The  possibility,  however,  of  obtaining  a  converged  solution  without  includ¬ 
ing  viscous  contributions  in  the  implicit  factors  cannot  be  rul^d  out.  If  additional 
viscous  operators  are  added  to  the  scheme  (as  is  done  in  Method  II),  the  solution 
will  remain  conditionally  stable,  although  the  region  of  stability  is  increased  slightly 
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as  shown  in  Figure  2.  The  stability  analysis  indicates  that  the  most  promising  al¬ 
gorithm  would  be  one  similar  to  Method  I.  Method  II  should  also  be  considered, 
however,  in  so  far  as  it  represents  less  of  an  approximation  than  Method  I. 


V.  Results 


Both  methods  are  implemented  in  a  computer  code  to  calculate  transonic  flows 
in  two-dimensions.  A  number  of  test  cases  have  been  computed  for  flows  past 
a  two-dimensional  NACA  0012  symmetric  airfoil.  The  case  presented  here  is  for 
subcritical  laminar  flow  (Re  =  5000,  -  0.5)  past  a  two-dimensional  NACA 

0012  symmetric  airfoil  at  zero  degrees  angle  of  attack.  The  calculation  is  performed 
using  the  thin-layer  approximation  on  a  192  x  48  cell  “Cr-grid  generated  using  the 
GRAPE  code  elliptic  mesh  generator  [9].  The  outer  boundary  of  the  mesh  is  located 
about  8  chords  from  the  body.  Care  is  taken  to  insure  sufficient  clustering  in  the 
region  close  to  the  body  surface  where  viscous  effects  are  significant.  Approximately 
10  mesh  points  are  included  within  the  boundary  layer  at  the  airfoil  trailing  edge, 
and  the  first  point  normal  to  the  body  surface  is  located  at  about  .001  chords. 

The  surface  pressure  distribution,  presented  in  Figure  3,  agrees  well  with  that  pre¬ 
sented  by  Martinelli,  Jameson,  and  Grasso  [10].  The  flow  separates  at  approximately 
85%  of  the  chord,  as  can  be  seen  from  the  contour  plot  of  the  stream  wise  component 
of  mass -flux  density  in  Figure  4;  this  value  is  close  to  the  values  reported  by  both 
Swanson  and  Turkel  [11]  and  Jayaram  and  Jameson  [12]  for  this  case. 

The  iterative  process  is  begun  by  initializing  the  solution  to  free  stream  values.  A 
plot  of  the  convergence  history  is  shown  in  Figure  5.  Using  five  levels  of  multigrid 
and  local  time  stepping,  the  solution  converged  to  a  steady  state  in  approximately 
35  work  units  which  corresponds  to  20  multigrid  cycles.  One  work  unit  is  defined 
as  the  amount  of  work  required  to  advance  the  solution  one  time  step  on  the  finest 
mesh  level;  for  the  strategy  used  here,  each  multigrid  cycle  requires  approximately 
1  2/3  work  units.  Overall,  the  average  residual  is  reduced  by  6  orders  of  magni¬ 
tude  in  about  300  work  units  or  180  multigrid  cycles.  This  represents  a  significant 
improvement  over  rates  reported  by  other  researchers  [10.1 1]. 

This  solution  is  computed  at  a  Courant  number  of  16  using  Method  I.  At  this 
Courant  number,  both  methods  I  and  II  produce  converged  solutions  as  illustrated 
in  Figures  5  and  6.  However,  a  converged  solution  is  not  attainable  if  viscous  terms 
are  neglected  from  the  implicit  factors  as  is  evident  in  Figure  7.  This  demonstrates 
the  importance  of  maintaining  an  implicit  viscous  contribution  to  the  numerical 
scheme.  Converged  solutions  from  a  completely  explicit  viscous  scheme  have  also 
been  obtained,  but  at  the  expense  of  a  lower  Courant  number,  hence,  a  lower  rate 
of  convergence. 

The  importance  of  including  viscous  contributions  in  the  implicit  operator  has  been 
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demonstrated.  Although  several  options  are  available  for  implicit  inclusion,  the 
addition  of  approximate  terms  to  the  existing  operators  seems  the  most  effective. 
The  laminar  solutions  obtained  are  in  good  agreement  with  results  reported  by  other 
researchers  [12,10,11],  while  significant  improvements  in  rates  of  convergences  are 
achieved.  Work  is  underway  to  extend  these  methods  to  three  dimensions,  and 
incorporate  a  turbulence  model  so  that  engineering  flows  can  be  studied. 
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Figure  1:  Explicit  Viscous  Treatment  Only 
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Figure  2:  Additional  Viscous  Operator 
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Figure  3:  Surface  pressure  distribution:  Re  =  5000.0.  0  =  0.50.  a 


Figure  4:  Streamwise  mass-flux  density:  Re  —  5000.0.  M ^  —  0.50.  a 
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Figure  5:  Residual  convergence  history  -  Method  I: 
Re  =  5000.0.  =  0.50.  a  =  0.0 
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Figure  6:  Residuai  convergence  history  -  Method  II: 
Re  =  5000.0.  \I~*  =  0.50.  a  =  0.0 


772 


4. 

1 - 1 - 1 - 1 - ! - 

2. 

- 

E&lr 

log  R  0. 

-  1 

- 

-2. 

Cd 

-4. 

-6. 

- 

- 

-8. 

-10. 

- 

0.  200.  400.  600.  800.  1000.  1200. 


flo2d-vg4  -  VO  expi.dcOO 12/900402  Ol.nuh 
Mach  0.500  Alpha  0.000  '  Pm  -1.000 

Real  0.369E-03  Ra  5000.  CFL  16.00 

Re*20.167E-01  Fbc  -1.  Grid  192x48 

Work  174.73  Rale  1.0220  Nine*  5 


Figure  7:  Residual  convergence  history  -  Explicit  Viscous: 
Re  =  5000.0.  M^>  =  0.50.  a  =  0.0 
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Extremum  Control: 

The  Effects  of  Artificial  Viscosity 


Culbert  B.  Laney,  Center  for  Applied  Mathematics 
David  A.  Caughey,  Sibley  School  of  Mechanical  and  Aerospace  Engineering 

Cornell  University,  Ithaca,  NY  lf850 


This  paper  concerns  numerical  approximation  to  discontinuous  solutions  of 
conservation  laws.  First  order  spatially  accurate  methods  routinely  capture  dis¬ 
continuities  smoothly.  The  best  capture  grid-aligned  steady  shocks  with  only 
one  transition  point.  Unfortunately,  formally  higher-order  accurate  methods  in¬ 
clude  terms  which  may  become  large  near  discontinuities,  resulting  in  spurious 
oscillations  and  overshoots.  The  goal  is  to  design  higher-order  methods  with  the 
shock  capturing  abilities  of  first  order  methods.  The  1980s  saw  the  introduction 
of  several  successful  approaches  [1,2, 3, 4}.  We  propose  to  evaluate  such  numeri¬ 
cal  methods  in  terms  of  their  effect  on  the  growth  and  creation  of  extrema.  The 
model  problem  is  a  one-dimensional  scalar  semidiscrete  approximation.  The 
results  for  semidiscrete  approximations  apply  immediately  to  steady  state  so 
lutions,  since  in  this  case  time  discretization  affects  only  convergence  rate.  We 
briefly  consider  extensions  to  multidimensional  problems. 


1  Basic  Theory 


Consider  the  following  scalar  one-dimensional  initial  value  problem  on  an  un¬ 
bounded  domain: 


dn  df(u)  „ 

dt  dx 

u(x,0)  =  d>(x),  =  <»(«)• 


(1) 


The  solution  u  is  constant  along  straight-line  characteristics  given  by  du/dt  = 
a(u).  If  two  characteristics  intersect  then  no  continuous  solutions  exist — 'weak' 
solutions  containing  jump  discontinuities  must  be  allowed.  Weak  solutions  to 
equation  (1)  have  the  following  interesting  properties  [2,5]: 

El  Maxima  do  not  increase  in  time.  Minima  do  not  decrease. 

E2  No  new  extrema  are  created. 

Consider  the  semidiscrete  finite-difference  approximation  to  equation  (1): 


duk(t) 

dt 


Ax* 


Hk(t) 


(2) 
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Hk(t)  =  H[uk+K(t) . ut.K(01  (3) 

where  the  difference  stencil  is  2K  +  1  points  wide,  A'  >  1;  the  grid  spacing 
is  Ax*  =  ik+i/2  —  x*_i/j;  the  cell  boundaries  occur  at  +  and  u*(f)  ap¬ 
proximates  u(x*,f).  When  will  u*(f)  inherit  properties  El  and  E2  of  the  exact 
solution? 

Theorem  1:  The  solution  to  equation  (2)  has  property  El  if 
Hk(t)  <  0  for  u*(<)  maximum 

Hk( f)  >  0  for  u*(f)  minimum 

Theorem  2:  Property  El  implies  property  E2. 

The  conditions  of  Theorem  1  are  necessary  and  sufficient  if  du*/dt  exists. 
Otherwise,  the  conditions  hold  in  the  limit  from  the  right  and/or  left.  To  prove 
Theorem  2,  note  that  continuous  extrema  must  start  out  infinitely  small,  at 
which  point  El  acts  to  prevent  further  growth.  Otherwise,  replace  discontinuous 
extrema  by  continuous  extrema  with  slope  approaching  infinity. 

Enforcing  El  leads  to  ‘clipping’  error  at  moving  continuous  extrema.  For 
example,  suppose  the  exact  solution  contains  a  maximum  M  moving  to  the  left. 
In  general,  the  maximum  falls  somewhere  between  two  grid  points  j  and  j  - f  1 . 
At  some  time,  j  should  become  a  maximum  on  the  grid,  with  a  value  less  than 
A/.  El  prevents  further  increase  at  j,  and  the  top  of  the  maximum  is  clipped’ 
off  as  time  progresses.  This  process  limits  accuracy  at  unsteady  extrema  to 
roughly  second  order,  but  steeply  peaked  extrema  are  obviously  more  affected 
than  flattish  extrema.  As  a  matter  of  fact,  many  methods  uniformly  reduce 
extrema,  which  affects  steady  as  well  as  unsteady  extrema  e.  g.  [2].  Because  of 
these  problems,  one  might  reasonably  wish  to  enforce  only  condition  E2.  While 
this  is  possible  for  fully-discrete  approximations,  it  is  as  yet  unclear  how  this 
could  be  accomplished  in  semidiscrete  cases. 

Consider  the  following  three  forms  for  //*(<): 

Conservation  Form 


Hk  =  — (h*+ 1/2  —  1/3) 

where  hk+l/2  =  ....  «*+/c)  is  a  Lipschitz  continuous  function  consis¬ 

tent  with  /(«)  in  the  sense  that  h(u,  u, u)  =  /(u). 

Viscosity  Form 

Hk  =  -^[/(tlt  +  l)  -  /( Wfc-l)  ~  ?i  +  l/2^k  +  l/2“  +  </k— 

where  qk  +  i/i  =  q(uk-K+\,  --->  u*+k)  is  Lipschitz  continuous  and  A*+i/2u  = 
uit+i  —  u*.  Viscosity  form  is  related  to  conservation  form  by: 

fii  +  l/2  =  +  +  f(uk)  -  <7k  +  l/2^k  +  l/2w] 
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Incremental  Form 


Hk  =  C^+1/JAk+i/3U  -  C~_lf  2At_1/2ti 

where  C*+1^2  =  C±(u*_k+1, u*+k)  are  Lipschitz  continuous  function  such 
that  the  following  conservation  condition  holds: 

^k+l/i  ~  ^*+1/2  =  a*  +  i/2  (4) 


<**  +  1/2 


■( 


U*  +  i-U* 
<*(«*) 


if  u*+i  ^  u* 
otherwise 


Incremental  form  relates  to  conservation  form  via: 


h*+i/2  =  _<-'>  +  l/2A*  +  l/2u  +  /(“*)  —  —  ^*  +  l/2A*  +  */2U  +  /(u*+ 1) 
Incremental  form  relates  to  artificial  viscosity  form  via: 

?i+l/2  =  C*+l/2  +  ^*+1/2 

Ck+l/2  ~  2^k  +  1/J  ~  a*  +  l/2)  (6) 

^*+1/2  =  2^t+1/2  +  at+1/2^ 

Examples  of  the  incremental  and  viscosity  forms  include: 

Central  Differences  //*  =  -(/( tit+i)  -  /(u*_i))/2 

C4+i/2  =  -5a*+i/2,  Cfc+1/2  =  J°t+l/2.  ?*  +  l/2  =  0 

Roe’s  Method  (First  order  upwind)  [6] 

(The  scalar  version  is  also  known  as  the  Cole-Murman  method  [7].) 

^i+1/2  =  max(0,  -at+i/2),  C*+1(,3  =  max(0,  ai+I/3).  qt+i/j  =  l<**+i/2l 

Consider  the  following  simple  corollary  to  Theorem  i: 

Corollary  3:  The  incremental  form  has  property  El  if 

C*  +  1/2A*  +  1/2u  <  Ck-l/7^k-mU  for  maX  (At  +  l/2“  <  0  L  A*_,/2U  >  0) 

C++1/2A*+i/2u  >  C'_!/2A*_1/2u  for  ut  min  (At+1/2u  >  0  ic  At_1/2u  <  0) 

This  implies  that  increasing  C*  tends  to  enforce  El;  in  particular,  El  holds  if 
C*  >  0.  Harten  [2]  introduced  this  popular  ‘positivity’  condition;  when  true, 
incremental  form  becomes  a  scalar  version  of  flux  vector  splitting  [8].  Clearly, 
Roe’s  method  is  uniformly  positive  while  the  central  difference  method  is  not. 
We  examine  the  positivity  condition  further  in  [9]. 
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2  Second  Order  Artificial  Viscosity 

We  may  enforce  Corollary  3  by  adding  second  order  artificial  viscosity,  thereby 
increasing  C+  and  C~  at  extrema: 

Ht  =  (C++1/J  +  !<*+i/2)A*+i/2u  -  (C;.U7  + 

where  <4+1/3  >  0  is  the  coefficient  of  second  order  artificial  viscosity  and  C4 
belongs  to  some  higher-order  method.  Suppose  the  higher-order  method  violates 
Corollary  3  at  point  k.  Choose  <*±1/3  >  0  so  that: 

(^t+l/2  +  2f*+1/2^lk+1/jU  =  (C:-l/2  +  (8) 

Corollary  3  now  holds  with  the  least  possible  deviation  from  the  higher-order 
method.  If  k  ±  1  is  also  an  extremum,  set  <4+1/2  large  to  damp  the  2Ax 
component.  If  neither  k  +  1  nor  k  —  1  is  an  extremum,  equation  (8)  has  one 
degree  of  freedom.  Consider  the  following: 

•  If  <  0  and  >  0  then: 

<4+1/2  =  2  (~C++l/7  +  C*~-l/2^*“~)  ’  =° 

•  If  >  0  and  Cj~_  <  0  ihen: 

<*-i/2  =  2  (-^4~_,/2  +CM-i/3lfr^u)  ’  <t+1/2  =  ° 

Various  strategies  may  be  employed  when  both  coefficients  are  negative, 
which  occurs  only  near  sonic  points  (where  a(u)  =  0).  We  do  not  wish  to 
address  the,  as  yet,  undeveloped  art  of  sonic  point  capturing  here 

Increasing  the  negative  incremental  coefficient  corresponds  to  reducing  *’n 
downwind  contribution.  In  the  most  extreme  case,  such  as  near  shocks  the 
method  becomes  fully  upwind  and  first  order.  For  example,  if  the  basis  !'igher- 
order  method  is  central  differences: 

A*-  1/2U 

“4  +  1/2  >  Oi  a4-  1/2  >  0  :  <4  +  1/2  =  <*4  +  1/2  +  “4-I/2T  ~  <  a4  +  l/2 

‘-i4  +  l/'u 

A*  +  l/2“  ^ 

a4  +  l/2  <  O’  a4-l/2  <  0  <4-1/2  =  ““4-1/2  -  “4+1/2  T  “  £  “4  +  1/2 

—  1/2  “ 

(If  these  give  <  <  0,  Corollary  3  is  not  violated  and  no  viscosity  is  required.)  If 
“4±  1/2  are  non-negative,  <4+1/2  =  “4+1/2  ~  “4-1/2  =  C(Ax)  when  A4+1/2U  = 
-A4_i/2U.  Thus,  for  a  symmetric  or  nearly  jymmetric  extremum  moving  to 
the  right,  <4+1/2^4+1/20  =  0(Ax2)  and  -econd  order  accuracy  is  retained. 
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Artificial  viscosity  increases  as  |A*+i/jti|  grows  larger  relative  to 
but  does  not  exceed  04+1/2,  the  viscosity  of  Roe’s  method.  (A  similar  analysis 
holds  if  dk+i/2  and  a*.  1/2  are  non-positive.  Similar  conclusions  also  hold  for 
other  choices  of  et±i/j  in  equation  (8).) 

The  above  represents  a  lower  bound  on  artificial  viscosity.  For  a  steady 
state  method  to  converge  stably,  one  also  needs  to  add  viscosity  in  monotone 
regions,  and  perhaps  increase  viscosity  at  extrema.  We  need  to  know  C*  and 
the  time-stepping  method  to  say  anything  more  specific. 

3  Fourth  Order  Artificial  Viscosity 

Incremental  form  plus  fourth  order  artificial  viscosity  gives: 

Hi,  =  —  <*+i/2^*+3/j«  +  (^*+1/2  +  2<*+i/j  +  ‘t-i/aJAt+i/ju 

-ff*_l/2At_3/2U  -  +  <4  +  1/2)A|,_i/2U 

where  <4+1/2  >  0  is  the  coefficient  of  fourth  order  artificial  viscosity.  Fourth 
order  artificial  viscosity  increases  the  coefficients  of  A^+^u,  which  tends  to 
enforce  El.  However  if,  for  example,  A4+3/2U  is  very  large  and  negative,  the 
first  term  will  overwhelm  the  others,  causing  u*  to  overshoot,  violating  El 
or  E2.  Since  u*  is  too  large,  ut~  1  will  tend  to  be  too  small;  if  u*_i  is  too 
small,  ut_2  will  tend  to  be  too  large;  etc.  In  this  way,  the  overshoot  at  k 
causes  oscillations  to  the  left.  This  validates  the  common  wisdom  that  fourth 
order  artificial  viscosity  should  not  be  used  near  shocks.  On  the  other  hand, 
fourth  order  viscosity  works  quite  well  in  smooth  regions,  where  it  will  tend  to 
enforce  E1/E2  with  less  accuracy  penalty  than  second  order  viscosity.  It  also 
strongly  damps  2Az  waves:  if  114  is  a  max  in  a  2Ax  wave,  all  terms  in  Ht  are 
negative  resulting  in  a  decrease  in  114  (and  similarly  minima  are  increased).  By 
the  Nyquist  sampling  theorem,  2Ax  waves  should  be  eliminated,  since  a  grid 
cannot  accurately  represent  wavelengths  shorter  than  4Ax  [9], 

It  seems  sensible  to  use  a  blend  of  second  and  forth  order  artificial  viscosity. 
Consider  the  ‘self-adjusting  hybrid  method’  [10]  in  conservation  form: 

/»*  + 1/2  “  ft+l/2^t+l/2(ut  +  l  —  —  (1  —  0fc+l/2)(ui+2  ~  3ut+i  +  3 Uk  -  «t_i)] 

where  #4+1/3  varies  between  0  and  1:  #4+1/2  should  be  1  near  shocks  and  near 
0  elsewhere  and  will  formally  depend  on  the  choice  of  /14+1/2,  the  conservative 
flux  of  a  higher-order  method.  In  practice,  the  modified  blending 

^*+i/2  — f>+i/2[0i+i/2(ut+i  —  ut  )  —  max(0, 6  — 0fc+i/2)(u4+2  — 3  m+i  +3ut  —  ut_  1)] 

allows  greater  flexibility  in  choosing  #4+1/2.  The  parameter  6  is  chosen  so  that 
#t+ !/2  is  greater  than  6  near  shocks.  Note  that  this  leads  to  an  adaptive  width 
stencil  of  fixed  center:  near  shocks  the  stencil  width  shrinks  from  five  points  to 
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three.  (Contrast  this  with  the  ENO  stencil,  which  has  fixed  width  and  adaptive 
centering  [4].)  Jameson’s  method,  a  self-adjusting  hybrid,  has  proven  highly 
successful  for  solving  the  steady  Euler  equations  of  a  perfect  gas  (1],  In  the 
closest  scalar  equivalent,  is  central  differences;  et+1/3  =  a|a*+l/j|;  and 

9  is  a  normalized  second  difference  of  u*.  (In  the  original  vector  version,  9  is 
a  normalized  second  difference  of  pressure;  also,  replace  |at+1/2|  by  p(Ak+ 1/2), 
the  spectral  radius  of  some  average  of  the  Jacobian  matrices  -4(ut+i )  and  A(ut). 
Recall  that  the  spectral  radius  of  a  matrix  with  eigenvalues  > ,  j  =  1, ...,  JV , 
is  defined  as  max(|A^|).  The  algorithm  is  applied  to  each  component  of  the 
vector  u*.)  If  k  =  1/2,  <5  =  1,  and  0*+1/2  =  1,  then  the  scalar  method  be¬ 
comes  Roe’s  method,  which  is  optimal  near  steady  shocks.  However,  usually 
one  would  choose  *  somewhat  less  than  1/2,  trading  accuracy  at  shocks  for 
accuracy  in  smooth  regions.  In  general,  6  and  *c  are  chosen  by  trial-and-error  to 
yield  the  best  accuracy  compromise  in  each  particular  situation.  Despite  its  suc¬ 
cess  for  steady  Euler  equations,  it  may  not  be  easy  or  even  possible  to  discover 
a  satisfactory  9  for  use  with  other  equations  or  for  unsteady  problems.  Also, 
while  intuitively  appealing,  it  is  unclear  how  to  choose  9  to  guarantee  rigorously 
E1/E2,  or  any  other  sufficient  oscillation  control  condition. 


4  Multi-dimensional  Equations 


Our  theory  easily  extends  to  multi-dimensional  scalar  equations  Consider  the 
two-dimensional  equation: 


4.  d/(»)  ,  dg(u) 
dt  dx  dy 


(9) 


When  will  the  semi-discrete,  finite-difference  approximation  ut(f)  given  by: 


dt  Art  A  yk 


(10) 


have  properities  El  and  E2?  Theorem  2  is  unchanged  while  Theorem  1  becomes 
Theorem  4:  Equation  (10)  has  property  El  if 


— — Hk(t)  +  — — - Gk(t)  <  0  for  Uk(t)  maximum 
Art  Ayt 

— - — Hk(t)  +  — - — Gk(t)  >  0  for  u*(f)  minimum 
Art  Ayt 

Corollary  5:  Equation  (10)  has  property  El  if 

//t (<)  <  0  and  Gk(t)  <  0  for  ut(<)  max 
Hk(t)  >  0  and  Gt(f)  >  0  for  Ut(<)  min 
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This  corollary  justifies  the  common  approach  of  adding  artificial  viscosity  on 
a  dimension-by-dimension  basis,  at  least  in  the  scalar  case.  For  one-dimensional 
hyperbolic  systems  of  conservation  laws,  the  characteristic  variables  will  have 
properties  El  and  E2;  unfortunately,  in  more  than  one  dimension,  E1/E2  may 
not  hold  for  all  characteristic  variables  in  all  regions  of  flow  [9].  For  nonlinear 
vector  problems,  the  characteristic  variables  can  only  be  found  only  approx¬ 
imately;  thus,  it  is  not  yet  clear  how  one  might  rigorously  guarantee  E1/E2 
when  appropriate. 

5  Conclusions 

Artificial  viscosity,  TVD,  UNO,  and  ENO,  among  other  recent  methods,  can 
successfully  combat  the  spurious  oscillations  commonly  found  in  higher-order 
approximations  to  weak  solutions  of  conservation  laws.  Here  we  have  presented 
a  new  framework  for  understanding  the  action  of  artificial  viscosity  and,  in  a 
subsequent  paper,  we  use  the  same  approach  to  elucidate  TVD,  UNO,  and  ENO 
conditions  [9], 

We  end  with  a  brief  discussion  of  the  potential  practical  implications  of  this 
work.  In  the  common  ‘method-of-lines’  approach,  the  fully  discrete  method 
derives  from  a  semidiscrete  method.  Popular  time-discretizations  include  for¬ 
ward  Euler,  Lax-Wendroff  (more  generally,  Cauchy-Kowalewski),  and  Runge- 
Kutta.  The  time-discretization  may  or,  more  likely,  may  not  fully  preserve 
extremum  control  properties.  For  steady  problems,  extremum  control  at  in¬ 
termediate  times  matters  only  to  the  extent  that  it  affects  the  rate  of  conver¬ 
gence.  For  unsteady  solutions,  one  can  sometimes  tolerate  the  effects  of  the 
time-discretization  with  good  results,  particularly  if  the  semidiscrete  method  is 
overdamped  e.  g.,  [11].  However,  rigorous  enforcement  of  E1/E2  in  unsteady 
problems  requires  consideration  of  the  time-discretization  along  with  the  spatial 
discretization.  We  currently  are  engaged  in  an  analysis  of  the  forward  Euler  and 
Lax-Wendroff  time-discretizations.  We  hope  that  success  in  the  simplest  cases 
will  pave  the  way  to  understanding  more  efficient  and  accurate  time-stepping 
algorithms,  including  Runge-Kutta. 
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ABSTRACT.  An  approach  to  solve  a  pressing  practical  problea, 
aerodynaaic  heating  of  hypervelocity  projectiles,  is  discussed.  Since  huge 
aaounts  of  supercoaputer  tiae  is  needed  for  siaulation,  a  thorough  and 
detailed  investigation  is  initiated  on  nuaerlcal  aethods  for  transient 
three  diaensionai  heat  transfer  around  hyper velocity  projectiles  and  for 
establishing  critical  tiae  steps.  A  aethodology  is  established  for 
coaparing  various  nuaerical  aethods,  in  particular,  f inite-eleaent ,  finite 
difference  and  aethod  of  aeighted-resi duals  (MUR).  Starting  with  the 
variational  principle,  an  equivalent  finite  -  eleaent  equation  aith  27 
nodal  teaperatures  of  a  three  -  diaensionai  eleaent  is  established. 

Sinilar  equations  are  foraulated  for  finite-difference  and  MUR,  in 
particular  for  collocation  and  Galerkin  techniques.  The  critical  tiae- 
steps  are  derived  by  Uon  Neuaann  aethod  for  various  nuaerical  aethods  and 
include  1-0,  2-0,  and  3-0  transient  heat  conduction  probleas.  A 
coaparison  shoes  a  drastic  need  for  robust  and  efficient  codes  due  to  not 
only  an  increase  in  band  aidth,  nuaber  of  nodes  (by  several  orders  of 
aagnitude),  etc.  but  also  a  decrease  in  critical  tiae-step  (by  aore  than 
order  of  aagnitude)  for  ault i-diaensional  probleas. 

INTRODUCTION,  Aray  Research  Office  recognized,  aerodynaaic  heating 
of  hypervelocity  projectiles,  as  a  very  coaplicated  problea  aith  aany 
facets,  each  of  which  offers  for aidable  challenges.  This  project  is 
initiated  in  FV  90.  The  aotivation  for  this  paper  coaes  froa  the  needs  of 
this  project.  Departaent  of  Oefense  prepared  a  list  of  t»enty-teo 
critical  technologies  for  Congress.  The  folloeing  are  pertinent  to  this 
project:  Hypervelocity  Projectiles  ehere  USSR  is  ahead;  Coeputat ional 
Fluid  Oynaaics  (CFO),  Parallel  Coaputers;  Siaulation  and  flodeling; 

Softeare  Producibi I ity;  and  Coaposite  Materials.  The  ultieate  goal,  to  be 


♦County  College  of  Morris/Rutgers  University,  College  of  Engineering, 
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accoapl  ished,  is  to  prepare  a  self  contained  package  of  emulation  froa 
the  tiae  a  eeapon  is  fired  until  the  tlae  the  hyperuelocity  projectile 
reaches  the  target. 

The  theraat  package  inuolues  conuectlon,  conduction  and  radiation  In 
addition  to  other  coapl icat ions  such  as  unsteady,  three  diaensional  and 
hyperuelocity  (uiscous-inuiscid  interaction  and  non-equi I ibriua  floe) 
effects.  For  exaaple,  conuectiue  heat  transfer  aay  be  aodeled  froa  CFD 
package  ahich  is  gouerned  by  full  three  diaensional  Hauier  Stokes 
equations.  Typical  coaputer  run  aay  take  aore  than  20  hours  on  a  cray 
supercoaputer  in  order  to  siaulate  one  supersonic  floe  filed  around  a 
projectile:  Transient  3-D  Hauier-Stokes  Soloer,  500K  nodes,  10  seconds 
per  iteration,  8000  iterations. 

The  transient  three  diaensional  heat  conduction  aodel  el  1 1  prouide  a 
aeans  to  deteraine  the  teaperature  distribution  as  a  function  of  location 
(3  di sens  ions)  and  tiae  for  any  giuen  initial  and  boundary  conditions. 

The  boundary  conditions  are  usually  obtained  froa  CFD.  There  are 
occasions  vhere  there  is  a  strong  coapiling  bet Men  conuectiue  floa  and 
conduction.  Ho  Batter  ahat  aethod  is  utilized,  all  require  great  coaputer 
storage  and  large  aaounts  of  coaputer  tiae.  The  solution  process  is  not 
only  subjected  to  these  restrictions  but  also  bound  to  bloe-up,  In  the 
Biddle,  if  accuracy,  stability,  and  aonosci I  lot  ion  characteristics  are  not 
taken  into  account  by  proper  selection  of  nuaerical  techniques.  If  a 
coabined  conuection  and  conduction  problea  is  atteapted  in  one-step,  the 
failure  in  one  area  can  lead  to  a  loosing  proposition  in  both  areas. 
Therefore,  a  search  is  initiated  to  find  an  accurate,  robust  and  efficient 
nuaer i ca I  scheee  for  the  solution  of  t rans i ent  three  diaensional  head 
conduction  probleas. 

Uarious  nuaerical  aethods  are  In  use  today.  The  aost  popular  aethods 
are  f inite-eleaent  (FE)  finite-difference  (F0),  and  Weighted  residuals 
(NWR).  It  is  not  an  easy  task  to  single  out  the  best  one.  In  any  case, 
one  has  to  bring  all  these  aethods  into  the  saae  foraat  in  order  to  sake 
any  aeaningful  coeparison. 

FINITE  ELEHEHT  DIFFERENCE  EXPRESSION 

Wilson  and  Nickell,  folloeing  Gurtin's  discussion  of  uarlation 
principles  for  linear  initial  oalue  probleas,  confiraed  that  the  function 
T(x-v*z/t)  ahich  leads  to  an  extreaua  of  functional 
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2  ?  CpT0*T}  dU 


-  1/2  ^  {  ?CpT*T  ♦^T*K*^T  - 

v 

-  j  gjni  *T  dS 

s 

Is  the  solution  of  the  f o I  loving  transient  heat -conduct  Ion  equotlon: 
(K*T,  i )  ,  i  -  ?  cp  *  ♦  ?  *P  -0 

A 

•ith  the  boundary  condition  K*T, I  -  Qi  ■  0 

Uhere  T(x,y,*,t)  ■  temperature  at  the  spatial  point  (*>y>2)  and  at  time  t 
To  a  Initial  temperatures 

T  *  Gradient  of  T  with  respect  to  spatial  coordinates 
K  ■  Thermal  conductivity 
f  s  flaterial  density 

Cp  3  Heat  capability  of  the  material  per  unit  mass 
t 

fi  (*,y,z,t)  -  £  Qj  (x,y,z,x)  dX 

o 

U  *  Uo I ume 


*  a  Convolution  symbol  defined  as: 


^7T  *  S7T 


t 

T*T  •  £T(x,y,z,t-T)  T  (x,y,z/t)  dX 
o 

_  vr  *■  a  t  *  VL  -v  ^ 

'b'K  "SX  ^  SN 


Oivide  the  three  dimensional  solid  body  into  I  oxial  elements  (nodes  0  to 
I),  J  transverse  elements  (nodes  0  to  J)  and  K  normal  elements  (nodes  0  to 
K)  such  that  step  sizes  are  the  same  in  all  three  directions.  This 
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restriction  is  introduced  to  simplify  algebraic  tan i pu f at i ons  inuoiued  in 
the  analysis.  Instead  a  unit  step  size  can  be  assumed  elthout  any  loss  of 
generality  and  generalize  later. 


Consider  the  nodal  point  (i,j,k),  in  the  range  0<  i<  I,  0  <  j  <  J,  0 
<  k  <  K)  as  shoen  In  Figure  1.  The  temperature  of  the  nodal  point  sill 
vary  as  a  function  of  time,  t.  The  temperature  distribution  in  a 
subregion  is  a  function  of  spatial  coordinates  (x,y,z)  and  surrounding 
nodal  point  teaperatures.  For  simplicity,  linearity  and  the  same 
functional  distribution  are  assuaed  for  all  elements.  The  functions 
U ,  and  *8  are  functions  of  nodal  point  teaperatures. 
These  are  determined  by  substitution  of  the  coordinates  of  nodal  points 
into  the  equation  and  by  soluing  the  resulting  simultaneous  equations. 

The  results  for  region  XL  are  as  folloms; 
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3~D  FEA  (Continued) 


1- 

X 

>- 

M 

co 

CO 

co 

CO 

«o 

CO 

788 


Similar  set  of  equations  can  be  derlued  for  the  remaining  seven  regions  of 
the  3  dimensional  finite  element  discussed  above.  It  is  nom  time  to 
substitute  all  equations  derived  into  all  three  terms  of  functional, 
governing  equation,  and  integrate  over  the  volume  occupied  by  region  U 
and  take  the  first  variation  with  respect  to  T;iK at  the  nem  time  in  order 
to  obtain  the  extremum  of  the  functional.  Obtain  similar  results  for  the 
other  regions  and  sum  them  up.  The  procedures  for  the  first  and  third 
terms  of  governing  equation  are  same.  Homever,  double  convolution  symbol 
is  involved  in  the  second  term.  Here  the  evaluation  of  the  second  term  in 
the  governing  equation  involves  integration  not  only  over  the  volume  (and 
the  use  of  first  variation  mith  respect  to  the  nodal  temperature,  at 
the  nem  time)  but  also  over  the  time-step  due  to  the  additional 
convolution  symbol.  Tomards  this  goal,  a  linear  nodal  point  temperature 
variation  is  assumed  within  each  time-step.  Summing  up  the  results  of  all 
three  terms  produces  an  equivalent  finite-element  equation  in  a  form 
familiar  to  finite-difference  community. 

The  details  of  derivation  can  be  found  in  the  Journal  of  Heat 
Transfer,  Transactions  of  flSflE,  for  a  tmo-dimensional  case.  It  is  beyond 
the  scope  of  this  paper  to  provide  all  those  tedious  derivations  and  long 
equations  and  may  be  found  in  a  journal  to  be  published  soon  for  a  three 
dimensional  case.  The  equivalent  finite-element  equation  contains  all  27 
nodal  temperatures  of  a  three-dimensional  element,  shomn  above  at  both  old 
and  nem  times. 


FINITE  DIFFERENCE  EXPRESSION 

There  are  various  versions  of  finite-difference  approximations  to  the 
transient  heat  conduction  problems.  Homever,  all  these  schemes  can  be 
classified  as  either  explicit  or  implicit  type.  In  the  case  of  explicit 
scheme,  the  unknomns  are  determined,  one  at  a  time,  mithout  the 
simultaneous  solution  of  the  entire  set  of  algebraic  equations.  This,  in 
turn,  produces  tremendous  savings  in  computer  time,  almost  one  to  three 
ratio  over  the  implicit  counterparts.  Homever,  the  explicit  I  schemes  are 
usually  conditionally  stable  and  demand  small  time-steps. 

If  implicit  finite-difference  scheme  is  chosen,  one  equation  for  each 
node  is  generated  in  the  entire  body,  and  finally  simultaneous  solution  of 
all  these  algebraic  equations  is  required.  Numerous  difference  formulas 
can  be  formulated  for  first  and  second  order  derivatives  and  also  for 
three  dimensional  Laplacian  term,  depending  upon  the  number  of  nodes 
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those  nodes.  Usual  I y,  the  difference  foreulas  are  constructed  for  first 
and  second  order  derivatives  by  the  expansion  of  the  function  in  Taylor's 
series  and  algebraic  eanipulat ions.  The  three  dieensional  Lap  lac  ion  is 
constructed  by  use  of  the  central  difference  foreulas  for  the  second  order 
der i vat i ves . 

The  siepiest  7  point  approximation  for  three  dieensional  Loplacian  can 
be  eritten  as: 

(Tj+jk+TLjk+T|j+k  +  T j j _k  ♦  T j jk++Tj jk_  -6Tjjk)/AL2 

This  Is  0  (AL  ).  You  can  also  write  a  19-point  approx  i  eat  I  on  ehich  is 
OfAL14')  for  three  dieensional  Laplacian.  Sieilarly,  one  can  also  prepare 
a  27  point  approx  I  eat  I  on  ehich  is  0  (AL*3)  for  three  dieensional 
Laplacian.  There  are  eany  eore  possibilities  for  a  3-0  Laplacian. 

Hoeever,  it  is  suffice  to  unify  or  coepare  various  nueerical  eethods  for 
the  tiee  being.  Reeeober,  apples  can't  be  coepared  to  oranges  or  vice- 
versa.  They  all  have  to  be  brought  to  the  saee  foreat  before  a  eeaningful 
coeparison  can  be  eade. 

THE  METHOD  OF  UE I GHED-RES I  DUALS  (MR)  EXPRESSIONS 

The  eethod  of  eeighted-reslduals  unifies  eany  approx i eat e  eethods  of 
the  solution  of  differential  equations  that  are  in  use  today.  Uariational 
principles  proposed  by  several  authors  are  all  applications  of  the  flUR. 

In  literature,  this  technique  is  coeeonly  called  the  error  distribution 
principle.  The  choice  of  approx i eat i ng  function,  in  an  assueed  solution 
fora,  is  crucial  in  applying  the  DUR.  No  eay  presently  seees  to  be 
available  to  select  the  approxieat Ing  functions  systeeat ical ly  for  all 
problees.  The  variation  bet seen  results  obtained  by  application  of 
different  eeighting  functions  to  the  saee  approxieat ing  solution  is  euch 
less  significant  than  the  variations  that  can  result  froe  the  choice  of 
different  approxieate  solution  fores.  Soeetiees,  one  can  obtain  the  exact 
solution  by  use  of  the  MUR  if  the  right  choice  is  eade  in  the  selection  of 
the  approxieate  solution  fore. 

The  objective  of  applying  the  eethod  of  eeighted  residuals  is  to 
einieize  the  error  by  distribution  of  it  over  the  interval  eith  the  help 
of  a  eeighting  function  in  such  a  eay  that  the  net  error  Is  zero,  There 
are  eany  variations  in  it  The  eost  popular  ones  are  the  eethod  of 
collocation,  eethod  of  eoeents,  eethod  of  Galerkin,  and  eethod  of  least 
squares.  Hoeever,  this  study  is  lieited  to  eethod  of  collocation  and 
Galerkin  eethod.  The  Oirac-delta  function  is  the  eeighting  function  in 
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•ethod  of  collocation.  The  weighting  function  ie  sate  as  the  distribution 
function  (in  approximate  solution  fore)  In  case  of  Galerkin  eethod. 
flathemat  leal  ly, 


R(*'g'z>  u(x^'2^  dxdydz  -  0 

Uhere  U  is  the  weighting  function  and  fl,  the  residual  can  be  eritten  as 
R(x,g,l)  -  +  (>-+')  (|^r^+ 

-(j^  -T»UV  At) 


The  parameter  4s  a lloes  a  weighted  average  of  three  second  order  spatial 
derivatives  at  teo  discrete  times.  For  compactness,  the  commas  are 
omitted  in  betmeen  subscripts.  The  algebraic  sign  fol losing  the  subscript 
or  superscript  indicates  an  increment  (♦)  or  decrement  (-)  in  the 
correspond i ng  step-size.  The  linear  temperature  distribution  is  assumed 
betseen  the  adjacent  discrete  points  in  order  to  apply  the  Galerkin 
method.  Substituting  the  residual,  seighting  functions  and  3-D  Laplacian 
over  8  regions  and  performing  the  integration  process  yields  equations 
similar  to  the  finite  element  and  finite  difference  methods. 


CRITICAL  TlflE  STEPS 

Uarlous  numerical  methods  are  introduced  for  transient  3-D  heat 
conduction  problems  from  classical  to  modern  approaches.  Homever,  all 
require  some  guidelines  in  order  to  obtain  successful  numerical  solutions 
subjected  to  various  initial  and  boundary  conditions.  Ue  are  forced  to 
select  finite  step  sizes,  both  time  and  spatial,  to  satisfy  practical 
considerations.  The  temperatures  and  material  properties  change 
continuously  mi  thin  each  time  step.  Homever,  it  is  not  uncommon  to 
integrate  mith  respect  to  time  based  on  the  information  knomn  only  at  the 
beginning  of  the  time-step  (Euler)  or  by  utilization  of  data  equally  both 
at  the  beginning  and  end  of  the  time-step  (Crank-Hicolson) .  Other 
variations  are  possible.  Since  the  techniques  vary  considerably,  the 
errors  can  either  grom  or  decay.  The  errors  also  vary  not  only  mith 
respect  to  time  but  also  from  location  to  location.  It  is  possible  that 
less  accurate  scheme  may  be  more  accurate  in  one  location  than  more 
accurate  scheme  and  vice  versa. 
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Nuaerous  authors  discuss  accuracy  and  sore  or  less  understood  in  the 
fa  I  losing  say :  The  RfIS  error  and  the  absolute  aaxlaua  error  can  be 
coaputed  froa  the  difference  betaeen  the  analytical  and  the  nuaericai 
solution  and  include  error  contributions  froa  every  grid  point.  Oouble 
precision  arithaatic  is  usually  used  to  ainiaize  the  effects  of  roundoff 
error.  Since  the  RfIS  error  and  the  absolute  aaxiaua  error  behaves 
siailarly,  the  RfIS  error  is  used  in  deteraining  the  accuracy  of  scheae. 

In  general ,  the  error  is  a  function  of  tiae,  tiae-step,  and  spatial 
step-sizes.  If  sufficient  tiae  is  ailosed  to  coaaunicate  the  influence  of 
boundary  conditions  to  the  interior  points,  the  error  decreases  as  tiae 
increases  thereafter.  Therefore,  the  steady  state  error  is  saaller  than 
transient  errors  and  the  selection  of  nuaericai  scheae  aay  not  be  that 
critical  for  steady  state  cases. 

It  is  eel  I  knoan  that  selection  of  large  tiae-step  can  lead  to 
aeaningless  oscillatory  nuaericai  solution.  The  largest  tiae-step  for 
•hich  an  Euler  Solution  ail  I  be  stable  is  called  the  critical  tiae-step. 

Of  course,  if  Euler  solution  is  stable,  the  Crank-Hicolson  solution  is 
nonosi I latory  for  the  saae  critical  tiae-step.  The  critical  tiae-step  can 
be  derived  either  by  Uon  Neuaann  aethod  or  by  other  techniques.  The 
folloaing  table  provides  inforaation  for  selection  of  critical  tiae-steps 
for  various  nuaericai  aethods  and  heat  conduction  probleas. 

aUUL-OinEhSlQHflL  EEFECTS 

FOURIER  NUfIBER  (F)  -  cC  At/ 


FOURIER  NUtIBER 

DltlENSION(S)  FO  FE 


1-0 

1/2 

1/6 

2-0 

1/1 

1/12 

3-0 

1/8 

1/21 

CONCLUSIONS.  The  critical  tiae-steps,  shoan  above,  fora  the  guiding  light 
on  use  of  nuaericai  aethods.  They  help  to  avoid  east  age  of  precious 
supercoeputer  tiae.  The  table  also  deaonstrates  that  the  f inite-eleeent 
techniques  require  saaller  tiae-steps  over  their  finite-difference 
counterparts.  Rnother  interesting  fact  is  drastic  reduction  in  tiae-step 
sizes  as  nuaber  of  diaensions  increases.  Overall,  aore  than  order  of 
aagnitude  reduction  in  tiae-step  is  not  uncoaaon  depending  upon  the  chosen 
nuaericai  aethod  and  ault idiaensional  effects.  Reaeeber  that  the  nuaber 
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of  nodes  Increases  by  several  orders  of  eagnitude  for  suit  I -d liens  Iona  I 
problems  over  their  one-dleenslonal  problees.  fls  far  as  band-eldth  Is 
concerned, t he  typical  one  -  d I  sens  Iona  I  probles  toy  contain  three  non  zero 
diagonal  teres  ehereas  three  dieensional  one  eay  have  2?  non-zero  diagonal 
teres.  All  these  adverse  effects  point  to  a  drastic  need  for  developeent 
of  robust  and  efficient  codes. 

In  sueeary,  practical  hypervelocity  projectile  heating  problee  Is 
analyzed.  The  principal  subproblee  is  investigated,  in  detail,  for 
solution  of  transient  three-dieensional  heat  conduction  by  various 
nueericai  eethods.  A  eethodology  is  established  for  coeparison  of  several 
numerical  eethods,  in  particular,  f inite-eleeent,  finite-difference,  and 
eethod  of  eeighted  residuals.  The  critical  tiee-steps  are  derived  by  Uon 
Heueann  eethod  for  several  cases. 

A  coeparison  shoes  a  drastic  need  for  robust  and  efficient  codes  due 
to  not  only  an  increase  in  band-eidth  and  nueber  of  nodes  by  several 
orders  of  eagnitude  but  also  a  decrease  in  critical  tiee-steps  by  eore 
than  order  of  eagnitude  for  eult i -dieensional  problees.  Even  if  the  tiee- 
step,  shoen  above,  is  satisfactory  for  eany  problees,  a  further  reduction 
is  in  order  for  problees  subjected  to  convection  and  radiation  boundary 
conditions  as  suggested  by  Professor  Regers  of  The  University  of 
U i scons in-fladi son,  based  on  analysis  of  1-D  and  2-D  problees. 
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Abstract 

This  paper  is  a  survey  on  Symbolic  Computation.  We  first  propose  a  new  defini¬ 
tion  of  the  term  “Symbolic  Computation”:  Symbolic  Computation  is  computation 
with  objects  that  are  “symbolic”  in  the  sense  that  they  are  finite  representations 
(symbols)  for  infinite,  abstract  ojects  in  the  domains  of  pure  mathematics.  In  the 
second  part  of  the  paper  we  summarize  the  facilities  available  in  modem  symbolic 
computation  software  systems.  In  the  third  part  we  give  some  examples  of  recent 
mathematical  research  results  on  which  improved  symbolic  computation  algorithms 
can  be  based.  Finally,  we  compile  the  most  important  literature  and  software  sy¬ 
stems  references  for  encouraging  newcomers  to  access  these  powerful  new  problem 
solving  techniques. 

The  main  message  of  this  paper  is  twofold: 

•  Applied  mathematicians  are  encouraged  to  experiment  with  the  available  Sym¬ 
bolic  Computation  software  systems  and,  by  doing  so,  will  experience  an  enor¬ 
mous  expansion  of  their  problem  solving  potential. 

•  Pure  mathematicians  are  encouraged  to  consider  the  “algorithmization”  of 
their  respective  field  of  interest  and  will  experience  the  enormous  intellectual 
and  mathematical  challenge  of  such  a  project  that  often  goes  far  beyond  the 
degree  of  difficulty  found  in  “traditional”  pure  mathematics. 


1  A  New  Definition  of  Symbolic  Computation 

Pragmatically,  one  could  “define”  Symbolic  Computation  (symbolic  mathematics,  compu 
ter  algebra,  formula  manipulation)  to  encompass  everything  that  is  available  in  present 
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day  symbolic  computation  systems  like  MACSYMA,  Maple,  Mathematica  or  Scratchpad. 
Of  course,  a  definition  by  characterizing  properties  of  the  field  is  more  desirable. 

Sometimes,  Symbolic  Computation  is  simply  characterized  as  “Non-Numerical  Com¬ 
putation”.  This  characterization,  however,  is  too  wide.  Many  non-numericai  algorithms, 
for  example,  graph  theoretical  algorithms  are  not  considered  to  be  symbolic.  Similarly, 
a  definition  that  characterizes  Symbolic  Computation  simply  as  "computation  with  sym¬ 
bols”  is  too  wide  because,  for  example,  word  processing  considers  symbols  without  having 
the  flavor  of  symbolic  computation. 

In  an  attempt  to  characterize  the  flavor  of  computations  in  existing  "Symbolic  Com¬ 
putation”  software  systems  with  a  view  to  predict  and  challenge  what  should  and  could  be 
added  to  theses  systems  in  the  future,  we  belive  that  the  fundamental  feature  of  Symbolic 
Computation  is  that 


Symbolic  Computation  = 
computation  with  finite  (concrete)  objects 
having  infinite  (abstract)  semantics. 

Example:  The  string  “sin(2x)”  is  a  finite  object.  It  represents  an  infinite  object, 
namely,  a  mathematical  function  consisting,  in  terms  of  set  theory,  of  infinitely  many 
pairs  of  real  numbers.  In  other  words,  the  symbolic,  finite,  concrete  expression  "sin(2x)” 
has  an  infinite  semantics  (meaning)  in  an  abstract  domain  of  mathematics. 

Example:  The  string  “((x,  y),  (28, 33))”  is  a  finite  object  that  may  be  considered  as 
a  concise  representation  of  a  very  complex  ("practically  infinite”)  abstract  mathematical 
object,  namely,  a  finite  group  with  6912  elements  and  a  6912  x  6912  multiplication  table. 

Example:  A  formula  of  the  theory  of  real  closed  fields,  for  example,  Va0l  ai,  a23y(x3  -f 
a2x2  +  axx  +  ao  =  0)  is  a  finite  string  object.  It  says  something  about  infinitely  many 
abstract  objects,  namely,  real  numbers.  If,  by  finite  manipulation  on  such  finite  formula 
objects,  the  formula  can  be  proven  it  says  something  about  an  infinite  abstract  domain, 
i.e.  the  formula  has  an  infinite  semantics. 

Problem  solving  in  Symbolic  Computation,  then,  proceeds  in  the  following  steps: 

•  We  want  to  solve  a  problem  in  a  domain  of  infinite,  complex  finite,  "abstract”, 
objects  (a  domain  of  “pure”  mathematics). 

•  We  represent  (a  subset  of)  these  abstract  by  finite  representations  ("symbolic"  re¬ 
presentations). 

•  We  try  to  solve  the  problem  for  the  abstract  domain  by  solving,  by  an  algorithm , 
the  corresponding  problem  for  the  finite  representations. 

Example:  The  famous  problem  of  “symbolic  integration”  is  specified  as  follows:  Given 
a  symbol  string  s  find  a  symbol  string  t  such  that  the  function  represented  by  t  is  the  anti- 
derivative  of  the  function  represented  by  s.  Note  that  the  problem  specification  necessarily 
involves  the  semantics  of  the  string  objects,  i.e.  the  notion  “the  function  represented  by 

As  a  pun,  we  could  also  say  that 
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Symbolic  Computation  = 

Pure  Computer  Mathematics 

with  two  possible  parsings:  Symbolic  Computation  is  computerization  of  pure  mathe¬ 
matics  (i.e.  mathematics  in  abtract  domains)  and  Symbolic  Computation  is  computer 
mathematics  in  its  purest  form. 

It  is  a  common  misunderstanding  that 

Symbolic  Computation  = 

Trivial  Mathematics 
Repeated  in  Loops  on  a  Computer 

and  therefore  is  the  domain  of  “mathematicians”  who  want  to  escape  the  intrinsic 
difficulties  of  inventing  proofs  by,  instead,  “experimenting”  with  well-known  existing  ma¬ 
thematics  on  examples  using  computers. 

I  dare  to  assert  that,  rather, 

Pure  Computer  Mathematics  typically  needs  more  sophisticated  mathematics 
(i.e.  more  sophisticated  proofs)  than  Pure  Mathematics  ! 

This  is  so  because  problem  solving  in  mathematics  essentially  proceeds  by  proving 
theorems  that  show  how  a  problem  can  be  reduced  to  other  (hopefully  easier)  problems. 

•  In  Pure  Mathematics  the  problem  reduction  (proof  of  the  theorem)  may  involve 
very  powerful,  non-algorithmic  operators  (e.g.  “choose  an  x  such  that  P(x)”). 

•  In  Pure  Computer  Mathematics  (Symbolic  Computation)  the  problem  reduction 
must  involve  only  algorithmic  operators  (e.g.  “while  P{x)  do  x  :=  /(x)”). 

Pure  Computer  Mathematics,  therefore,  often  is  more  demanding  than  Pure  Mathema¬ 
tics  because  fewer  tools  are  available  in  the  algorithmic  reduction  methods  of  constructive 
proofs  (that  can  be  translated,  one-to-one,  to  computer  algorithms). 

(On  the  other  hand,  one  can  also  argue  that  Pure  Mathematics  is  more  demanding 
than  Pure  Computer  Mathematics  because,  by  results  of  algorithm  theory.  Pure  Com¬ 
puter  Mathematics  can  sometimes  represent  only  relatively  modest  subdomains  of  the 
abstract  general  domains  of  Pure  Mathematics  and,  thus,  the  problem  reductions  of  Pure 
Mathematics  are  more  general  and  therefore  sometimes  more  difficult.) 

Summarizing, 

•  Symbolic  Computation  is  an  exciting  and  challenging  future  area  for  pure  mathe¬ 
maticians  from  all  areas  (because  more  sophisticated  proofs  are  needed). 

•  Symbolic  Computation  becomes  more  and  more  powerful  for  all  application  areas 
(because  the  more  sophisticated  proofs  result  in  “better”  problem  solving  methods). 

•  Symbolic  Computation  combines  the  elegance  and  “insight”  of  pure  mathematics 
with  the  practical  efficiency  of  computer  mathematics. 
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2  Symbolic  Computation  at  Work 

In  order  to  be  able  to  assess  the  practical  problem  solving  power  of  recent  symbolic 
computation  software  systems  it  is  best  to  experiment  with  at  least  one  of  them,  typically 
in  interactive  mode  on  a  workstation.  In  the  final  section  of  the  paper  we  compile  the 
information  necessary  to  obtain  these  systems  from  the  academic  or  professional  vendors. 
Also,  it  may  help  to  read  the  papers  (Arney  et  al.  1990)  and  (Wang  1990)  in  the  same 
proceedings. 

In  this  section  we  can  only  enumerate  the  most  important  facilities  these  systems 
typically  provide. 

Arithmetic  in  Basic  Domains: 

Symbolic  computation  software  systems  provide 

•  “long”  integers  and  rationals, 

•  high  precision  floating  point  numbers, 

•  other  number  domains  (e.g.  finite  fields,  algebraic  number  fields), 

•  basic  number  theoretic  functions  (e.g.  Extended  GCD,  Moebius  function  etc.), 

•  basic  combinatorial  function  (e.g.  Stirling  numbers,  Bernoulli  numbers  etc.), 

•  polys  and  rational  functions, 

•  classes  of  orthogonal  polys  (e.g.  Legendre  etc.), 

•  elementary  transcendental  functions  and  mathematical  constants, 

•  special  functions  (e.g.  Bessel,  Gamma,  exponential  integral  etc.), 

•  simplification  of  expressions  involving  all  of  the  above, 

•  coercion  and  transformation  between  different  number  domains. 

Algebraic  Systems: 

Symbolic  computation  software  systems  provide 

•  exact  solution  of  algebraic  equations  (with  symbolic  coefficients,  multivariate,  higher 
degree), 

•  use  of  equations  as  “algebraic  rules”, 

•  transformation  between  various  representations  of  curves  and  surfaces, 

•  complete  solution  to  systems  of  multivariate,  non-linear  inequalities, 

•  topologically  correct  decomposition  of  n-space  w.r.t  polynomial  inequalities. 
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Example:  The  command  Reduce  [  a  xA2  +  b  x  +  c  **  0,  x]  in  Mathematica  will 
result  in  a  complete  symbolic  analysis  of  the  various  possible  solutions  dependent  on  the 
values  of  the  parameters  a,  b,  c. 

Example:  The  command  AlgebraicRules[  {  a  ==  x  +  y,  b  ==  x  y} ,  {x,  y,  a, 
b}]  in  Mathematica  will  result  in  the  set  of  rewrite  rules  {  -  y  A2  ->  b  -  a  y,  -x  -> 
-a  +  y  }  that  completely  rewrites  any  polynomial  in  x,  y  in  terms  of  the  polynomials 
a  and  6,  if  this  is  possible,  or  decides  that  this  is  not  possible  at  all.  This  powerful 
simplification  mechanism  is  based  on  the  author’s  Grobner  bases  method,  see  (Buchberger 
1985). 

Example:  The  command  gbasis(  [  cl  *  c2  -  cf  *  ct  *  cp  +  sf  *  sp,  .  .  .]  , 
[sp,  st,  sf,  cp,  ct,  cf,  py,  s2 ,  si,  c2,  cl]  ),  where  [  cl  *  c2  -  cf  *  ct  * 
cp  +  sf  *  sp,  ...]  is  the  system  of  algebraic  equations  that  describes  the  inverse 
kinematic  of  a  certain  class  of  robots,  produces  a  fully  triangularized  version  of  the  system 


ci  +  pzi-2  lPvpz-ii+ll  ~  0’ 

c,  +  LpktS  .  px  .  =  0. 

2  _  pz2-2ljpz+px2-ll+l2  _  n 
1  pz2-2-lipz-l^+lf  ’ 

^2-^=0, 


.  PZ1  -2-li-pz—tl+l}  n 

py  + 2 - pi  2  -1  •  ci  •  =  o, 

*2 


•  -  px 

C2  _  pz2  -2-li  -pz+px2  -l2+t2  _  ~ 

J  pz2 ’ 

Ct  =  0, 

sf  +  i;  ■  ci  -  ■  cf  = 

pz *  -4-li-pz3  -2  l^  pz2  +6  l2  pz2  +4-<i  l2  pz-4  l^  -2  l2  I2  +i, 

P  '  /j  px  pz2  —  21 1  h  ■ px  pz+l j  px3  —li  px+li  li  px 


where  the  dependent  quantities  cl,c2,sl,s2,py,cf,ct,cp,sf,st,sp  are  decoupled  and 
expressed  in  terms  of  the  independent  quantities  px,pz  and  symbolic  parameters  /l,/2 
(the  length  of  the  robot  arms).  For  a  detailed  description  of  this  example  see  (Buchberger 
1987). 

Example:  Again,  a  call  to  a  Grobner  basis  package  (which  is  meanwhile  availabe  in 
most  symbolic  computation  software  systems)  with  a  parameter  presentation  [x  -  r  t , 
y  -  r  tA2,  z  -  rA2]  of  a  surface  in  3D  space  as  input,  will  automatically  produce  the 
implicit  presentation  xA4  -  yA2  z  of  the  same  surface,  see  (Buchberger  1987). 

Computer  Analysis 

Symbolic  computation  software  systems  provide 


•  limites  of  symbolic  expressions, 

•  calculations  with  finite  and  infinite  power  series, 

•  derivatives  of  symbolic  expressions, 
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•  indefinite  and  definite  integral  of  symbolic  expression, 

•  decision  about  existence  of  integrals  in  certain  domains, 

•  Laplace  transforms  et-. 

•  symbolic  solution  of  certain  classes  of  differential  equations, 

•  symbolic  simplification  of  sums  and  products, 


Example:  The  command  powerseries(  log(sin(x)/x) ,  x,  0)  in  MACSYMA  will 
produce  the  infinite  symbolic  sum  presentation  of  the  input  function: 


OO 


-(£ 


22ibern(2i) 

(2i)!  " 


)log(x)  -  log(x) 


Example:  The  commands  eql :  3  *  ’diff  (f  (x)  ,x,2)  -  2  *  'diff  (g(x)  ,x)  =  sin(x) 

andeq2:  a  *  Miff  (g(x)  ,x  ,2)  +  Miff(f(x),x)  *  a  *  cos(x) ;  in  MACSYMA  de¬ 
fine  two  ordinary  second  order  differential  equations  eql  and  eq2.  The  commands  eql : 
laplace(eql,x,s)  and  eq2 :  laplace(eq2,x,s)  then  compute  the  Laplace  transform 
of  eql  and  eq2.  A  call  of  linsolve  will  solve  the  linear  equations  in  the  unknowns 
laplace(f  (x)  ,x,s)  and  laplace(g(x)  ,x,s)  and  a  call  of  ilt  (inverse  Laplace  trans¬ 
form)  will  finally  give  the  symbolic  result 


f(x) 


27as/23in(^|) 

v/6(3a-2) 


3a 


3a  sin(x) 
3a  -  2 


+  a  -f-  f  (0) 


g(x) 


9a3/2»in<7Sr>  , 

"76  +  •  ~6T-4 

3a 


(3a  -I-  l)cos(x) 
3a  -  2 


+  1/2. 


See  (Fateman  81)  for  more  details  on  this  example. 


Linear  Algebra,  Tensor  Calculus 

Symbolic  computation  software  systems  provide 


•  operations  on  matrices  with  symbolic  entries, 

•  inversion,  linear  systems,  nullspace,  eigenvalues,  ... 

•  operations  on  tensors. 


Numerical  Mathematics,  Statistics 

Symbolic  computation  software  systems  provide  routines 

•  for  numerical  computation  (curve  fitting,  Fourier  transform,  Newton  root  finding, 
numerical  integration,  Runge  Kutta  DE  solution,  ....),  and 
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•  for  statistics. 


Automated  Theorem  Proving 

Symbolic  computation  software  systems  provide 

•  automated  proofs  of  geometrical  theorems, 

•  automated  proofs  of  arbitrary  first-order-formulae  on  the  reals. 

•  proofs  of  first  order  equations  w.r.t.  equational  axiom  systems, 

•  general  first  order  theorem  proving. 

Example:  Apollonios’  Circle  Theorem:  aThe  altitude  pedal  of  the  hypotenuse  of  a 
right-angled  triangle  and  the  midpoints  of  the  three  sides  of  the  triangle  lie  on  a  circle.-’ 
After  introducing  coordinates,  a  possible  algebraic  formulation  of  this  problem  is  as  fol¬ 
lows: 

for  all  coordinates  at, . . . ,  a10  €  R: 

if  h\(au . . . ,  aio)  =  0, . . . ,  h$(ax, . . . ,  aio)  =  0, 
then  c(alt . . . ,  aio)  =  0. 

where  the  h,  are  the  polynomials  describing  the  hypotheses  of  the  theorem  and  c  is 
the  polynomial  describing  the  conclusion  of  the  theorem.  The  algebraic  formulation  is 

what  is  called  the  radical  membership  problem  “c  €  Radical({/ii . /im } )?” .  Arbitrary 

such  questions  can  be  decided  by  deciding  “1  6  Grobner-Basis({/ii . 1})?”. 

where  z  must  be  a  new  indeterminate.  More  details  on  this  example  are  contained  in 
(Buchberger  1987).  In  the  system  (Kutzler  1988)  and  similar  systems,  proofs  of  the  above 
kind  can  be  carried  out  in  a  few  seconds. 

Algebraic  Geometry: 

Symbolic  computation  software  systems  provide 

•  analysis  of  and  computation  in  residue  class  rings  modulo  polynomial  ideals  (by 
Grobner  bases), 

•  free  resolution  of  polynomial  ideals  (determination  of  the  sequence  of  syzygy  modules 
by  Grobner  bases), 

•  analysis  of  and  computation  in  commutative  and  non-coinmuative  associative  alge¬ 
bras  (Lie,  VVeyl  algebras),  group  algebras  etc. 

•  analysis  of  and  computation  in  groups  given  in  various  representations. 
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Example:  In  the  CAYLEY  system  the  following  sequence  of  interactive  commands: 
g:  free(a,b); 

g.relations  :  a  f  8,  b  f  7,  (a  *  b)  f  2,  (a  f  — 1  *  b)  f  3; 
h  =<  a  t  2,a  T  -1  *  b  >; 
i  =  todd  coxeter(g,h); 

Print  i; 

effects  the  enumeration  of  all  448  cosets  of  the  subgroup  <  a2,a~lb  >  of  <  a,b  | 
as,b7,(ab)2,(a~lb)3  >  by  the  famous  Todd-Coxeter  algorithm. 

Group  Theory  Zoomed: 

Any  of  the  above  areas  could  be  “zoomed”  revealing  a  wealth  of  problems  and  so¬ 
lution  techniques  available  in  present  symbolic  computation  systems.  For  example,  in 
computational  group  theory  the  following  can  be  computed: 

•  test  for  nilpotency,  commutativity,  solvability  etc. 

•  coset  enumeration 

•  normalizers 

•  centralizers 

•  central  chains 

•  series 

•  lattice  of  (normal)  subgroups 

•  classes  of  conjugacy  subgroups 

•  orbits 

•  test  of  imprimitivity 

•  test  of  isomorphism 

•  word  problems 

•  automorphism  groups. 

Different  representations  of  the  group  theoretical  objects  lead  to  different  solution 
algorithms  for  the  above  problems  with  drastically  different  efficiency.  Therefore  the  study 
of  representations  and  the  conversion  between  different  representations  is  an  important 
subarea  of  computational  group  theory.  The  main  representation  methods  are: 

•  generators  and  relations 

•  permutations,  block  permutations 
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•  power  commutators 

•  bases 

•  strong  generating  sets 

•  matrices  over  Z,  Galois  fields  and  algebraic  number  fields, 

•  Cayley  graphs 

•  character  tables. 

“Icons”  of  Other  Symbolic  Computation  Areas 

Other  important  and  evolving  subareas  of  Symbolic  Computation  can  only  be  men¬ 
tioned  as  “icons”  here: 

•  Symbolic  Computation  based  computational  geometry 

•  automatic  programming 

•  computational  number  theory 

•  computational  topology 

•  ... 

Interfaces: 

Symbolic  computation  software  systems  provide 

•  advanced  2D  and  3D  graphics  and  animation, 

•  interface  to  textprocessing, 

•  output  in  FORTRAN,  C  etc.  syntax  (symbolic  computation  as  preprocessing!) 

Programming  facilities: 

Symbolic  computation  software  systems  provide 

•  functional  and  procedural  programming, 

•  “generic”  programming, 

•  “rewrite  rules”  style  programming  using  pattern  matching, 

Example:  Mathematica,  for  example,  provides  a  convenient  style  of  programming  in 
the  form  of  “rewrite  rules”  that  is  particularly  convenient  for  mathematicians  who  are 
used  present  mathematical  knowledge  in  the  form  of  problem  reduction  rules  (“theo¬ 
rems”).  For  example,  an  “algorithm”  for  limes  computations  could  be  quickly  assembled 
by  formulating  Mathematica  rules  of  the  following  style:  Limes  [  a_  +  bj  :  =  Limes  [ 
a]  +  Limes  [  b] .  For  a  given  symbolic  input  expression,  Mathematica  would  check  in  its 
rule  base  whether  an  expression  matching  the  pattern  a_  +  b_  occurs  as  a  subexpression 
and  would  effect  the  corresponding  transformation. 
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3  Examples  of  Recent  Theoretical  Results 

In  this  section  we  give  some  samples  of  recent  theoretical  research  results  in  the  area  of 
symbolic  computation.  These  results  are  taken  from  papers  that  appeared  or  will  appear 
in  one  of  the  special  issues  of  the  Journal  of  Symbolic  Computation.  A  complete  list  of 
all  special  issues  of  this  journal  that  appeared  in  the  last  five  years  and  will  appear  in  the 
next  two  years  is  contained  in  Section  4.8. 

These  samples  of  theoretical  results  should  give  a  flavor  of  the  breadth  of  ongoing 
foundational  research  in  symbolic  computation  and  should  also  demonstrate  the  depth 
of  mathematics  necessary  to  come  up  with  algorithmic  solutions  to  symbolic  computa¬ 
tion  problems.  Each  of  these  results  also  gives  rise  to  many  important  and  challenging 
open  problems  that  could  attract  the  attention  of  mathematicians  who  are  interested  in 
embarking  on  new  directions  off  the  beaten  track. 

3.1  Proofs  of  Combinatorial  Identies 

In  (Zeilberger,  Takayama  1992)  a  new  approach  to  the  automated  proof  of  combinatorial 
identities  and  the  determination  of  definite  integrals  and  sums  is  developed.  It  is  based 
on  the  notion  of  “holonomic  functions”.  Roughly,  a  holonomic  function  is  a  function  / 
for  which  an  ideal  A  in  the  Weyl  algebra  of  differential  operators  exists  such  that  Af  =  0 
(and  A  satisfies  some  other  properties).  The  class  of  holonomic  functions  is  huge  and 
includes  most  of  the  practically  interesting  functions  like  polynomials,  rational  functions, 
algebraic  functions,  trigonometric  functions,  exponential  and  logarithm,  hypergeometric 
functions,  binomial  coefficients,  Bessel  functions,  Legendre  functions  etc. 

It  is  shown  how,  for  an  A  for  f(x,t)  an  annihilating  B  for  g  —  f(x,t)dt  can  be 
constructed.  Hence,  the  integral  g  can  be  obtained  as  a  solution  to  the  equation  Bg  =  0 
An  essential  subalgorithm  in  the  construction  of  B  for  the  elimination  of  variables  and 
other  purposes  is  a  generalized  version  of  the  Grobner  bases  algorithm  (Buchberger  1985). 

3.2  Automatic  Discovery  of  Geometric  Theorems 

In  (Sturmfels,  Whiteley  1991)  “Cayley  factorization”  of  expressions  in  the  Cayley  algebra 
is  discussed.  The  mathematical  problem  consists  in  devicing  an  algorithm  that  takes 
arbitrary  expressions  in  the  bracket  algebra,  for  example,  an  expression  of  the  form 

-  [abc\[ade}[bdf){cef]  +  [abd][ace\[bcf][dtf]  (1) 

and  to  generate  an  equivalent  expression  in  the  Cayley  algebra,  for  example,  t  he  expression 

( ab  A  de)  V  (be  A  ef)  V  (cd  A  fa).  (2) 

The  problem  is  open  for  the  general  case.  A  solution  is  given  in  the  above  paper  for  the 
multlinear  case. 

This  solution,  however,  suffices  to  “automatically  generate”  a  lot  of  non  trivial  geo 
metrical  theorems  or  to  automatically  prove  geometrical  conjectures.  For  example,  the 
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question  of  finding  a  necessary  and  sufficient  condition  for  6  points  a,  6,  c,  d,  e,  /  in  the 
plane  to  lie  on  a  quadric,  which  is  equivalent  to  the  statement  that 


det 


(A 

a\  a 2 

<*la2 

<*la3 

U2a3  ^ 

\  fi 

fl  fl 

/l/2 

/l/3 

hfz  ) 

=  0 


(3) 


can  be  answered  by  transforming  (by  the  well  known  “straightening  algorithm”)  condi¬ 
tion  (  3)  into  the  equivalent  condition  that  (  1)  is  zero  and  then,  by  the  new  Cayley 
factorization  algorithm,  into  the  equivalent  condition  that  (  2)  is  zero.  The  latter  con¬ 
dition,  however,  has  the  immediate  geometrical  interpretation  that  the  6  points  lie  on 
a  quadric  if  and  only  if  certain  intersection  points  of  connection  lines  between  the  lines 
lie  on  one  common  straight  line  (Pascal's  Theorem).  This  means  that  the  theorem  was 
automatically  produced. 


3.3  Point  Location  Problems  Solved  With  Cylindrical  Alge¬ 
braic  Decomposition 

In  (Chazelle,  Sharir  1990)  it  is  shown  how  a  very  general  class  of  point  location  problems 
can  be  solved  by  using  Collins’  well  known  algorithm  for  producing  “cylindrical  algebraic 
decompositions”  of  the  n-space. 

Given  n  polynomials  in  m  variables,  the  m-space  is  naturally  partitioned  into  a  finite 
set  of  “cells”.  Given  a  point  in  m-space  the  point  location  problem  consists  in  locating 
the  “cell”  in  which  the  point  lies.  (Collins  1975)  showed  how  the  decomposition  of  the 
m-space  can  effectively  be  computed  by  algebraic  algorithms.  In  (Chazelle,  Sharir  1990) 
it  is  shown  how  Collins’  decomposition  can  be  used  to  solve  the  general  point  location 
problem  in  O(logn)  time  after  0{n2d)  preprocessing  operations  for  building  up  a  suitable 
data  structure. 


3.4  Black  Box  Algorithms 

In  (Kaltofen,  Trager  1990)  a  new  type  of  algorithms  is  introduced:  “black  box  algorithms” 
for  problems  having  polynomials  as  input  and  output.  Their  method  starts  from  the  ob¬ 
servation  that  for  a  polynomial  to  be  “known”  it  suffices  to  have  a  method  that  produces, 
for  any  argument,  the  corresponding  value  of  the  polynomial.  It  is  not  actually  necessary 
that  we  know  the  coefficients  of  the  polynomial. 

Let  us  consider,  for  example,  the  factorization  problem  for  multivariate  polynomials 

over  the  integers.  Given  a  polynomial  /  (i.e.  given  some  method  to  compute  /(aj . an) 

for  arbitrary  arguments),  the  factorization  problem  is  solved  if  we  know  a  method  that,  gi¬ 
ven  any  (xj, . . . ,  .r„),  produces  the  values  h i(xj . x„), . .  . ,  hm{z\ . xn)  of  the  factors 

hi,...,hmof  /.  This  method  may  use  calls  of  /  for  various  (alt ....  am)  as  “oracles”.  In 
the  paper,  such  a  method  is  developed  for  the  factorization  problem  and  related  problems 
and  it  is  shown  that  polynomial  time  complexity  can  be  achieved. 
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3.5  Polycyclic  Quotient  Algorithm 

In  (Sims  1990)  an  important  problem  of  computational  group  theory  is  treated:  the 
computation  of  polycylcic  quotient  groups  for  a  group  given  by  a  finite  presentation.  The 
paper  starts  from  an  earlier  method  by  Baumslag-Cannonito-Miller  that  did  not  fully  cover 
all  the  algorithmic  subproblems.  (Sims  1990)  fills  the  gaps  by  giving  a  method  how  to 
completely  reveal  the  structure  of  two  residue  class  rings  that  appear  in  the  construction. 
This  new  method  is  a  modification  of  the  Grobner  basis  technique. 

3.6  Linear  Diophantine  Equations  and  Unification 

The  problem  of  unification  can  be  considered  to  be  the  most  general  formulation  of  the 
problem  of  solving  equations  in  arbitrary  first  order  theories.  It  is  well  known  that 
the  unification  problem  in  “AC”-theories  (theories  involving  only  equational  axioms  for 
associative-commutative  function  symbols)  can  be  reduced  to  the  problem  of  solving  sy¬ 
stems  of  linear  diophantine  equations  over  the  natural  numbers. 

Earlier  solutions  of  the  linear  diophantine  equations  problems  relied  on  systematic 
enumeration.  In  (Clausen,  Fortenbacher  1989)  a  new  method  is  presented  that  uses 
certain  “completion”  steps  that  can  be  interpreted  as  “walks”  in  certain  labeled  digraphs. 
Significant  speed-up  can  be  achieved  by  this  new  approach. 

3.7  Grobner  Fans  of  Ideals 

The  method  of  Grobner  bases  has  numerous  applications  in  polynomial  ideal  theory  and 
related  areas  (algebraic  geometry,  geometric  modeling).  Grobner  bases  depend  on  an  un¬ 
derlying  “admissible”  ordering  of  power  products.  There  are  infinitely  many  “admissible” 
orderings.  In  (Mora,  Robbiano  1988)  it  is  shown  that,  for  a  given  polynomial  ideal,  the 
infinitely  many  admissible  orderings  fall  into  finitely  many  classes.  One  class  corresponds 
to  exactly  one  Grobner  basis.  Furthermore,  it  is  possible  to  compute,  for  a  given  poly¬ 
nomial  ideal,  one  “universal”  Grobner  basis,  i.e.  a  set  of  polynomials  that  is  a  Grobner 
basis  for  the  given  ideal  under  all  possible  admissible  orderings. 

3.8  Cluster-Based  Cylindrical  Algebraic  Decomposition 

As  explained  above,  Collins’  algorithm  constructs,  for  n  polynomials  in  m  variables,  a 
decomposition  of  the  n-space  into  “sign-invariant  cells”.  The  computation  proceeds  by, 
first,  reducing  the  problem  for  the  n-space  to  the  problem  for  the  (n  —  l)-space  and  then 
showing  how  to  construct  a  solution  for  the  n-space  from  a  solution  for  the  (n  -  l)-space. 
The  latter  construction  proceeds  by  building  a  “stack”  of  n-cells  over  each  (n  —  L)-cell. 

In  (Arnon  1988)  it  is  shown  how  the  cells  of  the  (n  —  l)-space  can  be  automatically 
clustered  into  bigger  sign-invariant  blocks.  In  Collins'  algorithm  it  then  suffices  to  erect 
stacks  over  these  bigger,  and  fewer,  blocks.  By  recursion,  this  may  save  enormous  time 
in  the  construction  of  the  n-space  decomposition. 
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(Mora,  Robbiano  1988)  T.  Mora,  L.  Robbiano, 

The  Grobner  Fan  of  an  Ideal,  Journal  of  Symbolic  Computation  6/2-3,  October/December 
19S8. 


(Sims  1990)  C.  Sims, 

Implementing  the  Baumschlag-Cannonito- Miller  Polycyclic  Quotient  Algorithm,  Journal 
of  Symbolic  Computation  9/5-6,  May/June  1990. 

(Sturmfels,  Whiteley  1991)  B.  Sturmfels,  W.  Whiteley, 

On  the  Synthetic  Factorization  of  Projectively  Invariant  Polynomials,  Journal  of  Symbolic 
Computation  12/4-5,  October  1991. 

(Wang  1990)  Paul  S.  Wang, 

Advances  in  integrating  Symbolic,  Numeric  and  Graphics  Computing,  this  conference. 
(Zeilberger,  Takayama  1992)  D.  Zeilberger,  N.  Takayama 

Computerized  Proofs  of  Combinatorial  and  Special  Function  Identities,  Journal  of  Sym¬ 
bolic  Computation  13/5-6,  June  1992. 


4.2  Symbolic  Computation  Software  Systems 

Some  of  the  more  important  symbolic  computation  software  systems  are  listed  here  in 
alphabetic  order  with  some  characterizing  information  in  the  order 

•  developed  since  which  year 

•  address  for  ordering  the  system 

•  available  on  which  machines 

•  some  characteristic  features. 

CAYLEY  V4 


•  since  the  early  I970’s,  steadily  expanded  and  improved 

•  Prof.  John  Cannon 
Department  of  Pure  Mathematics 
University  of  Sidney 

Sidney  NSW  2006 

•  Workstations  of  Apollo,  DEC,  IBM,  SUN. 

IBM  machines  running  VM/CMS  and  VAX  machines  running  VMS. 
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•  Cayley  V4  is  a  system  designed  for  solving  problems  in  algebra,  number  theory 
and  algebraic  combinatorics,  with  strong  emphasis  on  structural  questions.  A  user 
language  based  on  the  concepts  of  set,  mapping  and  algebraic  structure  provides  a 
natural  notation  for  algorithms  in  this  area. 


DERIVE 


•  1990  (Version  1.62) 

•  Soft  Warehouse,  Inc. 

3615  Harding  Avenue,  Suite  505 
Honolulu,  HI  96816 
Phone:  (SOS)  734-5801 

•  IBM  PC  compatible  machines  under  MS-DOS:  approximately  200  $. 

•  The  computer  algebra  system  Derive  is  designed  for  use  on  microcomputers.  A  very 
compact  system.  A  menu-driven  interface  makes  Derive  easy  and  natural  to  use. 
Derive  has  great  potential  as  a  teaching  aid. 


MACSYMA 


•  1988  (Version  412.6) 

•  Computer  Aided  Mathematics  Group 
Symbolics  Inc. 

New  England  Executive  Park  East 
Burlington,  MA  01803 
Phone:  (617)  221-1250 
Fax:  (617)  221-1099 

•  386- based  MS-DOS  systems. 

Workstations  of  Apollo,  DEC,  Symbolics,  SUN. 

•  Macsyma  is  a  general  system  for  numerical,  symbolic  and  graphical  computation. 
It  offers  over  1300  documented  commands  for  solving  a  wide  range  of  mathematical 
problems. 
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MAPLE 


•  1989  (Version  4.3) 

•  Waterloo  Maple  Software 
160  Columbia  Street  West 
Waterloo,  Ontario,  Canada 
N2L  3L3 

Phone:  (519)  747-2373 
Fax:  (519)  747-5284 

•  386-based:  approximately  680  $.  Atari  ST,  Apple  MacII,  MacSE,  MacPlus:  appro¬ 
ximately  380.00  $. 

Workstations  of  Apollo,  DEC,  HP,  IBM,  MIPS,  Silicon  Graphics,  SUN  and  many 
others:  approximately  2400  $. 

•  Maple  is  a  powerful  and  user-friendly  system  for  numerical,  symbolic  and  graphical 
computation  that  incorporates  a  high-level  programming  language.  It  delivers  a 
large  library  of  about  2000  mathematical  functions.  Maple  requires  remarkably 
little  memory  and  is  therefore  an  ideal  multi-user  system. 


MATHEMATICA 


•  1989  (Version  1.2) 

•  Wolfram  Research,  Inc. 

P.O.  Box  6059 
Champaign,  Illinois  61821 
Phone:  217-398-0700 
Fax:  217-398-0747 

•  386-based  MS-DOS  systems:  approx.  695  $. 

Apple  MacII,  MacSE,  MacPlus:  approx. 495  $. 

Workstations  of  Apollo,  DEC,  HP,  IBM,  MIPS,  Silicon  Graphics,  SUN  and  many 
others:  approx.  2,250  $. 

•  Mathematica  is  a  general  system  for  numerical,  symbolic  and  graphical  computation. 
The  system  produces  excellent  2D  and  3D  PostScript  graphics  and  incorporates  a 
modern  high-level  programming  language  with  pattern-match  and  rewriting  style 
programming.  On  the  Macintosh,  Mathematica  has  a  sophisticated  user  interface 
which  supports  interactive  textbooks. 
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REDUCE 


•  1988  (Version  3.3) 

•  Dr.  A.C.  Hearn 

The  RAND  Corporation 
P.0.  Box  2138 

Santa  Monica,  CA  90406-2138 

•  Reduce  has  been  implemented  on  many  different  computers  ranging  in  power  from 
the  IBM  PC  to  the  Cray  X-MP. 

•  Reduce  is  a  general-purpose  computer  algebra  system  designed  for  physicists,  ma¬ 
thematicians  and  engineers.  Since,  at  present,  Reduce  is  the  most  widely-used 
computer  algebra  system  in  a  number  of  countries  many  application  packages  are 
available.  Users  with  access  to  any  of  the  major  research  computer  networks  can 
obtain  newly  released  packages  and  other  material  from  a  digital  library. 


SAC-2 


•  Since  the  early  1970’2,  continuously  expanding  and  improving. 

•  Prof.  George  Collins 
Computer  Science  Department 
Ohio  State  University 
Columbus,  Ohio 

•  Available  on  all  machines  having  a  FORTRAN  or  C  compiler. 

•  SAC-2  is  a  collection  of  carefully  designed  algebraic  algorithms  mainly  for  algo¬ 
rithms  on  polynomials  including  Collins’  decision  algorithm  for  the  logical  theory  of 
real  closed  fields.  The  algorithms  are  the  basis  for  many  implementations  in  other 
systems.  ALDES,  the  programming  language  of  the  system,  is  compiled  into  FORT¬ 
RAN  or  C  and,  thus,  makes  the  system  widely  available  and  fast.  All  algorithms 
are  availabe  in  source  code  and  are  scientifically  documented. 


SCRATCHPAD 


•  Prototyp 

•  Richard  D.  Jenks 
Computer  Algebra  Group 
IBM  Research  Division 
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T.J.  Watscin  Research  Center 
P.O.  Box  218 

Yorktown  Heights,  NY  10598 

•  IBM  mainframes  under  VM/CMS. 

RT/PC  and  PS/2  under  AIX. 

•  Scratchpad  is  a  general  computer  tool  for  mathematics.  Its  design  is  based  upon 
abstract  datatypes  which  are  organised  into  programmable  algebraic  hierarchies. 
Scratchpad’s  modular  library  has  over  200  datatypes.  In  addition,  there  is  a  growing 
library  of  algebraic  functions.  Scratchpad  has  an  interactive  language  for  easy  access 
to  library  facilities  and  simple  programming  tasks. 

4.3  Text  Books  and  Survey  Books  on  Computer  Algebra 

Akritas,  A.G.  (1989). 

Elements  of  Computer  Algebra  with  Applications. 

John  Wiley  and  Sons. 

Buchberger,  B.  (ed.)  (1985). 

Computer  Algebra. 

Proceedings  of  EUROCAL  85,  Vol.  1  (Invited  Lectures).  Springer  LNCS  203. 

(A  collection  of  survey  lectures  on  some  main  topics  in  computer  algebra.) 

Buchberger,  B.,  Collins,  C.E.,  Loos,  R.  (Eds.)  (1982). 

Computer  Algebra:  Symbolic  and  Algebraic  Computation. 

Springer  Verlag,  Wien  -  New  York. 

(This  is  not  really  a  text  book  but  a  collection  of  survey  articles  on  the  main  subareas 
of  computer  algebra.  Some  of  the  articles,  in  addition,  contain  details  of  theorems  and 
algorithms  not  available  in  other  text  books.) 

Davenport,  J.H.,  Siret,  Y.,  Tournier,  E.  (1988). 

Computer  Algebra:  Systems  and  Algorithms  for  Algebraic  Computation. 

Academic  Press,  London. 

(The  first  real  text  book  on  computer  algebra.  For  some  of  the  subjects  treated,  no  proof 
details  are  given.  Instead,  references  to  the  original  literature  are  provided.) 

Della  Dora,  L,  Fitch,  J.  (19S9). 

Computer  Algebra  and  Parallelism. 

Academic  Press. 

Knuth,  D.E.  (1981). 

The  Art  of  Computer  Programming.  Vol.  2:  Seminumencal  Algorithms. 

Addison-Wesley,  Reading. 
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(The  algorithms,  together  with  their  mathematical  foundation,  for  arithmetic  in  the  most 
important  algebraic  domains  are  given  in  this  book.) 

JanBen,  R.  (1988). 

Trends  in  Computer  Algebra.. 

Springer. 

(A  collection  of  survey  lectures  at  an  international  sympolsium  in  Bad  Neuenahr,  May 
1987.) 

Lipson,  J.D.  (1981). 

Elements  of  Algebra  and  Algebraic  Computing. 

(This  text  book  is  a  systematic  introduction  to  basic  areas  of  algebra  together  with  algo¬ 
rithms  tor  the  algebraic  problems  discussed.  However,  it  covers  only  a  part  of  what  today 
is  computer  algebra.) 

Mignotte,  M.  (1989). 

Mathematiques  pour  le  calcul  formel. 

Presses  Universitaires  de  France. 


4.4  Survey  Articles  on  Computer  Algebra: 


Buchberger,  B.,  Kutzler,  B.  (1986). 

Computer-Algebra  fur  den  Ingenieur. 

In:  Rechner-Onentierte  Verfahren  (Buchberger  et  al.  eds.).  Teubner,  Stuttgart, 
pp. 11-64. 


Caviness,  B.F.  (1985). 

Computer  Algebra:  Past  and  Future. 

J  of  Symbolic  Computation  2/3,  217-236. 

Kaltofen,  E.  (19S7  ). 

Computer  Algebra  Algorithms. 

In:  Annual  Review  of  Computer  Science  Vol.2,  91-118  (J.F.  Traub  ed.).  Annual  Review 
Inc.,  Palo  Alto,  California. 

Yun,  D.Y.Y.,  Stoutemyer.  R.D.  (19S0). 

Symbolic  Mathematical  Computation. 

In:  Encyclopedia  of  Computer  Science  and  Technology  (  J.  Belzer  et  al.  eds.),  Vol.  15, 
235-310,  M.  Dekker,  New  York  -  Basel. 
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4.5  Books  on  Subareas  of  Computer  Algebra: 


Davenport,  J.H.  (1981). 

On  the  Integration  of  Algebraic  Functions. 

Springer  LNCS  102. 

(Published  version  of  a  Ph.D.  thesis.) 

Atkinson,  M.D.  (ed.)  (1984). 

Computational  Group  Theory. 

Academic  Press,  London. 

(Proceedings  of  the  1983  LMS  Symposium  in  Durham.  There  is  no  text  book  on  com¬ 
putational  group  theory.  These  proceedings  may  serve  as  a  possible  starting  point  for 
obtaining  an  overview  on  the  subject.) 

Stauffer,  D.,  Hehl,  F.W.,  Winkelmann,  V.,  Zabolitzky,  J.G.  (1988). 

Computer  Simulation  and  Computer  Algebra. 

Springer. 

Tournier,  E.  (1989). 

Computer  Algebra  and  Differential  Equations. 

Academic  Press. 

4.6  Books  on  Applications  of  Computer  Algebra: 


Caviness,  B.F.,  Gilbert,  R.P.,  Shtokhamer,  R.  (19??). 

An  Introduction  to  Applied  Symbolic  Computation  Using  XIACSYMA. 

(In  preparation). 

Howard,  J.C.  (1979). 

Practical  Applications  of  Symbolic  Computation. 

IPC  Science  and  Technology  Press,  Guildford,  England. 

(Some  application  examples,  using  FORMAC,  are  given  in  much  detail.  Somewhat  out 
of  date.) 

Klimov,  D.M.,  Rudenko,  V.M.  (1989). 

Computer  Algebra  Methods  for  Mechanics  Problems. 

Moskva. 

Pavelle,  R.  (Ed.)  (1985). 

Applications  of  Computer  Algebra. 

Kluwer  Academic  Publisher,  Boston  -  Dordrecht  -  Lancaster. 

(A  collection  of  application  papers  using  MACSYMA). 
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Rand,  R.H.  (19S4). 

Computer  Algebra  in  Applied  Mathematics:  An  Introduction  to  MACSYMA. 
Pitman  Publishing  Inc.,  Marshfield  MA. 

(Examples  of  using  MACSYMA.  Not  so  much  a  primer  of  MACSYMA.) 
Rand,  R.H.,  Armbruster,  D.  (1987). 

Perturbation  Methods,  Bifurcation  Theory  and  Computer  Algebra. 

Springer. 

(Application  of  computer  algebra  in  a  special  area.) 

4.7  Books  on  Specific  Computer  Algebra  Systems: 

Rayna,  G.  (1987). 

REDUCE:  Software  for  Algebraic  Computation.  Springer,  New  York. 
(Introduction  to  REDUCE  with  case  studies.) 

Symbolics  Inc.  (1987). 

MACSYMA  User’s  Guide. 

(A  primer  for  getting  acquainted  with  MACSYMA.) 

Wooff,  C.,  Hodgkinson,  D.  (1987). 
muMATH:  A  Microcomputer  Algebra  System. 

Academic  Press,  London. 

(A  tutorial  on  muMATH.) 

4.8  Journals  on  Computer  Algebra: 


[JSC]  Journal  of  Symbolic  Computation  (B.  Buchberger  et  al.  eds.). 

Academic  Press  London,  1985-  . 

(The  first  refereed  journal  on  computer  algebra  and  all  other  areas  of  symbolic  compu¬ 
tation.  In  addition  to  the  regular  issues  it  frequently  publishes  special  issues  on  certain 
subareas  of  computer  algebra.  For  example, 

•  Vol.  3/1-2  (Feb  1987).  Rewriting  Techniques  and  Applications.  (J.P.  Jouannoud 
ed.) 

•  Vol.  4/1  (August  1987).  Algorithmic  Methods  in  Algebra  and  Number  Theory.  (M. 
Pohst  ed.) 

•  Vol.  5/1-2  (April  19SS).  Algorithms  in  Real  Algebraic  Geometry.  (D.  Arnon  and  B. 
Buchberger  eds.) 

•  Vol.  6/2-3  (Oct  1988).  Computational  Aspects  of  Commutative  Algebra.  (L.  Rob- 
biano  ed.) 
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•  Vol.  7/3-4  (March  -  April  1989).  Unification:  Part  1.  (C.  Kirchner  ed.) 

•  Vol.  S/1-2  (July  -  Aug.  1989).  Unification:  Part  2.  (C.  Kirchner  ed.) 

•  Vol.  9/3  (March  1990).  Complexity  of  Algebraic  Algorithms.  (E.  Kaltofen  ed.) 

•  Vol.  9/5-6  (May  -  June  1990).  Computational  Group  Theory.  (J.  Cannon  ed.) 

•  Vol.  10/3-4  (Sept.  -  Oct.  1990).  Computational  Geometry.  (E.  Welzl  and  R.  Seidel 
eds.) 

•  Vol.  11/1-2  (Jan.  -  Feb.  1991).  Rewriting  Techniques  for  Theorem  Proving.  (J. 
Hsiang  and  L.  Bachmair  eds.) 

•  Vol.  12/4-5  (Oct.  -  Nov.  1991).  Invariant  Theoretic  Algorithms  m  Geometry.  (B. 
Sturmfels  and  N.  White  eds.) 

•  Vol.  ?  (June  -  July  1992).  Symbolic  Algorithms  for  Combinatoric  Identities.  (P. 
Paule  and  D.  Zeilberger  eds.)) 


Journal  of  Applicable  Algebra  and  Error  Correcting  Codes. 

Springer,  New  York-Heidelberg,  1990-  . 

(A  recent  journal  emphasizing  the  application  of  Computer  Algebra  to  coding  theory  and 
related  subjects.) 

Journal  of  Automated  Reasoning. 

Reidel  Publishing  Company,  19S5-  . 

(This  journal  specializes  in  Symbolic  Comutation  algorithms  in  the  area  of  automated 
theorem  proving.) 

ACM  SIGSAM  Bulletin. 

Distributed  by  the  ACM  Special  Interest  Group  on  Symbolic  and  Algebraic  Manipulation. 
(This  is  a  non-refereed  informal  bulletin  for  the  fast  dissemination  of  papers,  implemen 
tation  notes,  announcements,  bibliographies  etc.  in  the  area  of  computer  algebra.) 

Acknowledgement:  This  paper  was  written  in  the  frame  of  the  project  "Grobner 
Bases'1  sponsored  by  the  “Osterreichisches  Ministerium  fur  Wissenschaft  und  Forschung” . 
I  am  grateful  to  mv  student  W.  Windsteiger  who  helped  in  the  production  of  the  manus¬ 
cript. 
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Phase  Transitions  and  Maximally  Dissipative  Dynamic  Solutions 
In  the  Rlemaxm  Problem  for  Impact* 


Thomas  J .  Pence** 

Dept,  of  Metallurgy,  Mechanics  and  Materials  Science 
Michigan  State  University 
East  Lansing,  MI  48824-1226 


Introduction 

Nonlinearly  elastic  materials  whose  strain  energy  density  is  a  nonconvex 
function  of  strain  can  undergo  phase  transitions  in  which  displacement 
gradients  are  discontinuous  across  internal  surfaces.  In  a  dynamic  setting, 
this  gives  rise  to  both  conventional  shock  waves  and  travelling  phase 
boundaries.  Physically  such  phase  boundaries  separate  states  which  involve 
different  microstructure,  even  though  they  are  the  same  compositionally . 

These  type  of  phase  transitions  can  be  induced  by  the  mechanical  impact  of 
solid  bodies  [l]-[4]  or  even  by  the  shock  due  to  an  intense  laser  pulse  [5]. 

The  one -dimensional  theory  of  fully  dynamical  isothermal  phase  transitions  has 
been  utilized  in  the  investigation  of  phase  boundary  propagation  [6] -[8].  In 
this  setting  the  material  response  mirrors  the  anomalous  pressure -volume 
relation  of  a  van  der  Waals  fluid  and  corresponding  mathematical  Issues  arise 
[9] -[16].  In  particular,  it  is  often  the  case  that  the  equations  of 
mechanical  motion  are  not  sufficient  to  ensure  a  unique  solution  to  various 
boundary  value  problems.  A  number  of  criteria  for  selecting  physically 
admissible  solutions  have  been  proposed  [9] -[16]  and  many  of  their  solid 
mechanical  analogues  are  discussed  in  [6]. 

We  consider  an  impact  problem  for  the  case  where  one  material  (that  of  the 
target)  admits  the  possibility  of  phase  transitions  and  the  other  material 
(that  of  the  lmpactor)  does  not.  The  goal  is  to  determine  the  nature  of  the 
wave  packet  that  is  generated  upon  impact  at  the  Interface  of  contact  for 

various  values  of  impact  velocity  v.  Attention  is  restricted  to  longitudinal 
compressive  waves;  Interactions  with  additional  surfaces  of  the  target  or 
lmpactor  are  not  considered.  As  this  study  is  motivated  by  considerations  of 

high  velocity  impact,  the  Impact  velocity  v  may  be  large.  Thus  a  mathematical 
treatment  can  not  utilize  techniques  based  on  small  Initial  data,  as  for 
example  utilized  in  [12]  in  the  parallel  setting  of  a  van  der  Waals  fluid. 

The  purpose  of  this  communication  is  to  outline  a  method  for  treating  this 
problem.  This  treatment  relies  on  a  mapping  from  candidate  solutions  for  both 

lmpactor  and  target  to  a  plane  (the  (o  ,v  ) -plane)  of  contact  interface 

it  it 

stress  a  vs.  contact  interface  velocity  v  .  For  certain  values  of  Impact 

velocity  v  we  find  that  the  problem  under  consideration  yields  a  unique 
solution  on  the  basis  of  the  equations  of  motion  alone.  However  for  values  of 

impact  velocity  v  within  a  finite  interval  we  that  uniqueness 

does  not  hold.  In  this  case  we  seek  solutions  that  are  maximally  dissipative. 
The  maximum  dissipation  criterion  was  proposed  by  Dafermos  [16]  for  resolving 
uniqueness,  although  in  [16]  it  is  called  the  entropy  rate  admissibility 
criterion.  Here  we  find  that  this  criterion  delivers  uniqueness  for  a  certain 

proper  subinterval  of  (v^.Vj).  In  addition  we  find  that  this  uniqueness  will 
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extend  to  the  whole  interval,  and  hence  all  impact  velocities  v  >  0,  if 
certain  additional  constitutive  restrictions  are  assumed  of  the  two-phase 
material. 

The  purpose  of  this  communication  is  not  to  give  an  exhaustive  treatment  of 
these  results.  Instead,  a  number  of  theorems  are  presented,  many  without 
proof,  which  summarize  the  basic  methodology  used  in  arriving  at  these 
conclusions.  Some  of  the  proofs  rely  on  fairly  elaborate  algebraic 
calculations,  which  have  been  performed  with  the  aid  of  MACSYMA.  A  complete 
exposition  is  given  in  [17]. 


Preliminaries 


We  consider  the  impact  between  an  initially  stationary  target  (occupying 
x  >  0) ,  and  a  moving  impactor  (which  occupies  x  <  0  at  the  instant  of  contact 
t  -  0) .  Since  impact  will  give  rise  to  compressive  stress  it  will  be 
convenient  to  define  compressive  strain  7  and  compressive  stress  a  as 
positive.  The  constitutive  response  is  given  by 


f  (7)  .  in  x  <  0, 

r(7) ,  in  x  >  0, 


(1) 


where  $(y),  r(7)  are  nonlinear  elastic  (compressive)  stress  response  functions 
for  the  impactor  and  target  respectively.  By  definition,  the  compressive 
strain  7  is  confined  to  the  interval  0  <  7  <  1  and  diverges  as  7  -*  1.  For  the 
purpose  of  the  pilot  problem  presented  here  it  will  be  convenient  to  assume 
that  7  can  take  on  all  values  obeying  7  >  0  and  that  f(7>  and  r( 7)  are  defined 
for  all  7  »  0 .  The  modifications  to  the  argument  to  be  given  here  appropriate 
for  the  case  0  <  7  <  1  can  be  found  in  (17]. 


In  a  Lagrangian  frame,  the  one  dimensional  propagation  of  longitudinal 
compression  waves  in  an  isothermal  setting  are  described  by  the  equations 

7  +  v  -  0, 

t  X 

vc  +  f *<7>  7X 

-  0, 

in  x  <  0, 

a 

+ 

1 

0 

v  +  r ' ( 7)7 
t  w/  'x 

-  0, 

in  x  >  0 , 

and  discontinuity 

conditions 

3  [  1  7 1  1  -  [  M  ] 

-  0,  s  ( |v| ] 

-  ikh 

-  0 ,  in  x  <  0 , 

[Ml  -  0, 

r  -  r 

on  x  -  0, 

s  [ | 7 | ]  -  [ |v| ]  -0,  s  [|v|]  -  [|r|j  -0,  in  x  >  0, 

where  s  represents  the  time  derivative  of  a  generic  discontinuity  curve 
x  -  s(t),  and  (|  |]  denotes  the  jump  in  the  enclosed  quantity  across  x  -  s(t). 

Let  v  denote  the  initial  velocity  of  the  impactor.  Both  impactor  and  target 
are  assumed  to  be  intially  stress  free.  In  view  of  the  absence  of  length  and 
time  scales  in  the  problem  formulation,  we  introduce  the  similarity  variable 
A  —  x/t  and  assume  that 

v(x , t )  -  v(  A)  ,  7 ( x , t )  -  7(X) .  (4) 

The  initial  conditions  then  give 
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v(.<o)  -  V,  7<-®)  -  0, 

v(«)  -  0,  ir(®)  -  0. 


(5) 


In  view  of  (4)  the  equations  of  motion  (2)  now  become  ordinary  differential 
equations,  while  the  discontinuity  conditions  (3)  become  a  set  of  algebraic 
restrictions,  across  any  rays  of  discontinuity  A  -  A^. 


As  is  well  known,  the  motion  of  a  discontinuity  interface  will  in  general  give 
rise  to  a  change  in  the  total  mechanical  energy  of  the  dynamical  fields.  The 
rate  of  mechanical  energy  change  is  given  as  -S  d^  where  the  summation  is 

over  the  total  number  of  discontinuity  interfaces  and  d^  is  the  dissipation 

rate  of  the  i-th  discontinuity  interface.  Consider  the  i-th  discontinuity 
interface  and  let  7-  and  7+  obeying  0^7-57+  be  the  strains  adjacent 

to  this  interface.  Then 


dL  -  D(7-.7+), 


(6) 


where  D:  [0,«)X[0, ■*>)-►( -«,<*>)  is  the  dissipation  function.  If  the  interface  in 
question  occurs  in  the  target,  i.e.  x  >  0,  then  this  dissipation  function  D  is 
given  by 


D(y ,2)  -  |S(y,z) |  A(y , z) , 

where 

S(y,z)  -  £(r(z)-r(y))/(z*y)j 

A(y,z)  -  (r(z)-r(y))*(z-y)/2  -  [  r(q)  dq. 

Jy 


(7) 


(8) 


The  function  S  gives  the  velocity  of  the  discontinuity  interface.  If  the 
interface  in  question  occurs  in  the  impactor,  i.e.  x  <  0,  then  (7), (8) 
continue  to  hold  with  r  replaced  by  f .  If  the  interface  in  quesiton  is  the 
interface  of  contact  x  -  0,  then  D  -  0  since  this  interface  is  stationary  in  a 
Lagrangian  frame. 


A  pair  of  piecewise  smooth  functions  |v^(A),  7^(A)J  defined  on  A  5  0  which 
satisfy  (2)|,  (3)^,  (4)  and  in  addition  obey  (5)^  be  called  a  candidate 

dynamical  state  for  the  impactor.  A  pair  of  piecewise  smooth  functions 
(vr(A),  7r(A)|  defined  on  A  >  0  which  satisfy  (2)^,0)^,  (4)  and  in  addition 

obey  (5) j  will  be  called  a  candidate  dynamical  state  for  the  target.  For  each 
candidate  dynamical  state  for  the  impactor,  denote  the  values  of  strain, 

"it  Jc  'it 

velocity  and  stress  on  the  interface  of  contact  by  7^,  v^  and  f  ,  that  is 


*  —  ,  ,, .  * 
7t-  7t  (0)  ,  v 


vL(°) .  f*-  f(7^). 


(9) 


For  each  candidate  dynamical  state  in  the  target  denote  the  corresponding 
contact  interface  strain,  velocity  and  stress  values  by 


7*-  7R(0) , 


vR(°), 


*  /  *\ 
■  r  (7o  )  • 


(10) 
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A  pair  of  candidate  dynamical  states,  -j^v^(A) ,  7^(*)j •  •  7^(D 

be  said  to  be  a  dynamical  state  solution  for  the  impact  problem  if 


)}• 


will 


*  *  * 
V  V  ■  V 

R  L  ’ 


*  *  * 
£•  -  r  -  a  , 


(ID 


where  we  have  defined  common  interface  values  v  and  o  for  the  velocity  and 
stress.  Condition  (11)  follows  from  A  dynamical  stat2  solution  gives 

rise  to  a  pair  of  functions  ^v(A) ,  7(A)j  defined  on  -®  <  A  <  <*>  that  satisfy 
all  the  conditions  to  be  a  solution  to  the  impact  problem. 

It  shall  henceforth  be  assumed  chat  the  target  admits  the  possibility  of  phase 
transitions  in  compression  and  that  the  impactor  does  not.  In  particular,  the 
response  function  f(y)  for  Che  single  phase  impactor  material  is  assumed  to 
obey: 

HI1)  f(0)  -  0,  and  f(7)  >  0,  f*(7)  >  0,  f"(7)  <0  for  7  >  0. 

The  response  function  r(7)  for  the  two  phase  target  material  is  assumed  to 
obey 


HT1)  r (0)  -  0,  and  r(7)  >0  for  7  >  0. 


In  addition 

it  is  assumed 

that  there 

exist  two 

distinguished 

values  of 

compressive 

strain  7(,  and 

7m  obeying 

°  <7M< 

7  such  that 

m 

T ' (7m)  -  r ' (Tm)  -  0,  and 

HT2) 

r ' (7)  >  0, 

r” (7)  <  o, 

for 

0  <  7  <  7M, 

HT3) 

r'( 7)  <  0, 

for 

7m  <  7  <  V 

HT4) 

r'(7)  >  0, 

r"( 7)  £  0, 

for 

7  >  V 

HT5) 

lig  r'(7)  > 

r'(0). 

The  strain  regions  (0,7u)  and  (7  ,<*>)  will  respectively  be  called  the 

n  q 

low-strain  or  I-phase  and  the  high-strain  or  II-phase.  The  interval  (7^.7m) 

will  be  called  the  unstable  phase  by  virtue  of  the  well  known  instability 
properties  connected  with  such  an  interval  in  either  an  equilibrium  or  a 
dynamic  setting. 

We  shall  restrict  attention  to  candidate  dynamical  states  and  hence  dynamical 
state  solutions  which  obey  two  admissibility  conditions,  both  of  which  involve 
additional  requirements  on  discontinuity  interfaces.  The  first  admissibility 
condition  that  we  require  of  all  discontinuity  interfaces  is: 

Al)  D(7-,7+)  >  0, 

corresponding  to  the  idea  that  no  discontinuity  interface  should  ever  be  a 
source  of  mechanical  energy.  The  second  admissibility  condition  that  we 
require  pertains  only  to  phase  boundaries.  It  is  required  that  all  phase 
boundaries  connect  phase -I  to  phase- II.  This  has  the  physical  consequence  of 
precluding  the  existence  of  dynamical  states  that  involve  constant  strain  and 
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velocity  regions  with  strains  corresponding  to  the  unstable  phase.  We  write 
this  condition  thus: 

A2)  For  phase  boundaries,  0  £  7-  5  7..,  and  7+  >  7  . 

n  m 


Admissible  Candidate  Dynamical  States  for  the  Inpactor  and  the  associated 
Locus  of  Contact  Values 

It  can  be  shown  that  (Al)  forbids  discontinuity  interfaces  in  admissible 
candidate  dynamical  states  for  the  impactor.  Consequently  all  such  dynamical 
states  in  the  (x, t) -quarter  plane  (x<0,  tX)}  consist  of  a  continuous 
compression  wave  bounded  on  each  side  by  a  constant  strain  and  velocity 

region.  Moreover  it  is  found  that  the  value  7*  completely  parametrizes  all  of 

these  admissible  candidate  dynamical  states  in  the  impactor.  The  values  of  f 
★ 

and  v^  for  this  parametrization  are  given  by 


*  * 

r  - 

Let  rL  denote  the 

generated  by  (12) 
(12)  and  (HI1) : 


*  .  pi  „ 

vL  -V  -  (f'(z))  dz.  (12) 

locus  of  (admissible)  contact  values  for  the  impactor 

in  a  (a  ,v  ) -plane.  The  following  result  is  immediate  from 


Theorem  1:  1"^  is  a  connected  semi -infinite  curve  which  is  concave  down  and 

★  ★  a 

monotonically  decreasing  from  the  initial  data  point  (<r  ,v  )  -  (0,v). 

Changing  the  value  of  v  merely  translates  1*^  vertically.  There  is  a 

one-to-one  correspondence  between  points  on  this  locus  and  admissible 
candidate  dynamical  states  for  the  impactor. 


Admissible  Candidate  Dynamical  States  for  the  Target  and  the  associated  Locus 
of  Contact  Values 

It  can  be  shown  that  the  only  candidate  dynamical  states  for  the  target  tnat 
obey  (Al) ,  (A2)  fall  into  one  of  two  possible  families.  The  first  family,  the 
i-family,  involves  dynamical  states  in  the  (x, t) -quarter  plane  {x>0,  t>0)  that 
consist  of  a  continuous  compression  wave  bounded  on  each  side  by  a  constant 
strain  and  velocity  region,  all  of  which  are  in  the  I-phase.  This  one 

it 

parameter  family  can  be  parametrized  by  7^  on  the  interval  [0,7^].  Let  ^ 

"ft 

denote  the  locus  of  admissible  contact  value  pairs  (r  ,  v^)  generated  by  the 

it  it 

i-family  in  the  (a  ,v  ) -plane.  The  main  features  of  the  locus  T.  .  are 
summarized  in  the  following  theorem. 
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Theorem  2:  ^  i s  a  connected  curve  of  f inice  length  Chat  passes  chrough  the 

if  if 

initial  data  point  (<r  ,v  )  -  (0,0)  and  which  subsequently  is  monotonically 

if  if 

increasing.  Moreover  this  locus  is  concave  up  in  the  (a  ,v  ) -plane.  There  is 
a  one-to-one  correspondence  between  points  on  ^  and  the  i-family  of 

admissible  candidate  dynamical  states  for  the  target. 


The  second  family,  the  ii-family,  gives  rise  to  dynamical  states  in  the 
(x, t) -quarter  plane  {x>0,  O0)  chat  involve  at  most  five  distinct  sectors:  SI 
-  a  constant  strain  and  velocity  region  in  the  I-phase,  S2  -  a  continuous 
compression  wave  in  the  I-phase,  S3  -  a  constant  strain  and  velocity  region  in 
Che  I-phase,  S4  -  a  phase  boundary  separating  Che  I  and  II-phases,  and  S5  -  a 
constant  strain  and  velocity  region  in  Che  II -phase.  The  sectors  SI  and  S4 
are  present  in  all  members  of  this  family,  whereas  sectors  S2,  S3  and  S3  may 
or  may  not  be  present.  The  absence  of  S5  Indicates  that  the  phase  boundary  is 
on  the  interface  of  contact  x  -  0  and  hence  stationary. 

if 

For  this  family  is  also  the  value  of  the  strain  adjacent  to  the  phase 
boundary  in  the  II -phase.  Let  ya  denote  the  value  of  the  strain  adjacent  to 

if 

the  phase  boundary  in  the  I-phase.  It  can  then  be  shown  that  pairs  (7s>7r) 

parametrize  the  ii-family  of  admissible  candidate  dynamical  states  for  the 
target. 

In  order  to  characterize  this  parametrizatlon  we  Introduce  the  distinguished 
strain  values  7^,  y^,  7^,  7^,  7^  as  follows  (see  also  Figure  1).  The  value  7^ 

is  given  as  the  unique  solution  of 

r (7  )  -  r(7M).  7  >  7  •  (13) 

q  n  q  m 

Let  7a>  7^  denote  the  well  known  phase-I  and  phase-II  values  of  Maxwell 
strain.  That  is  7  ,  7^  are  the  unique  roots  of 

T(7a)  “  r(7b).  °(7a .  7b)  -  0,  0  <  7a  <  7m  <  7m  <  7b-  (14) 

Let  7^  denote  the  solution  of 

r ’ (0)  7f  “  r(yf),  7 f  >  7ffl •  (15) 

The  existence  and  uniqueness  of  7^  is  ensured  by  (HT5).  Finally  let  y°  denote 
the  unique  solution  of 

0(0,7 )  -  0,  7  >  7-  (16) 

o  o  m 

It  is  easily  seen  that 

7b<  7q  <  7f-  7b<  7q  <  7f-  (17) 
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T/t) 


Figure  1:  The  stress 
response  function  r( 7) 
for  the  multi -phase 
target  material 
and  the  associated 
distinguished  strain 
values . 


★ 

We  shall  let  T  denote  the  domain  of  the  ordered  pairs  (7  f 7R)  for  the 
parametrization  of  the  ii-family.  It  can  then  be  shown  that 

T-  ((7S,7*)|  7*  *  7m,  7se  (MS(7*),MS(7*)]  ).  (18) 

s  0 

where  the  functions  M  are  each  C  :  l7m>®)'*[0.7M]  (see  also  Figure  2). 

The  function  is  uniquely  determined  via: 

r(MS(  7))  -  >-(7),  for  7m  —  7  —  7q , 

r'(MS(7))  (7  *  MS( 7))  -  *(1 )  -  »-(MS(7)),  for  7q  <  7  <  7f>  (19) 

MS(7)  -  0,  for  7  £  7f . 

The  function  ms  is  uniquely  determined  via: 


>■(^3(7) )  “  r(j)  , 
D(ms(7) .  7)  -  0, 
Ms(7)  -  0, 


for  7  <  7  <  7,  , 

m  D 


for  7b  <  7  <  7q,  (20) 

for  7  >  7Q • 


Let  ^  ^  denote  the  locus  of  (admissible)  contact  value  pairs  (r  ,v^) 

generated  by  ii-family  in  the  (<7*,v*) -plane  .  Then  the  mapping  from  T  to 
^2  ii  rakes  (7s-7r)  "*  (r  .VR)  wiH  denoted  by  F  -  It  is 
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given  by 

r*-  Fl(7;>  -  r(7*). 

VR”  F2(VV  "  lQ  <r'<2»Hd2  +  {<r<7R>  '  r(y.»  <7r  -7s)}  • 


(21) 


Figure  2:  The  locus  T. 

The  functions  mS(7) 
and  ^s(7)  defined  in 

(19)  &  (20)  can  each 
be  constructed 
graphically  with  the 
aid  of  the  stress 
response  function  r(7). 


"it  ★ 

The  locus  r„  ..  will  consist  of  curves  for  7  <  7n  <  7,  and  7n  >  7~.  These 

4,ii  m  K  b  R  r 

A  C  ★ 

curves  will  be  denoted  by  ^and  r2  ii  resPect^ve^y ■  For  7b  <  7R  <  7f’  the 

locus  ^  will  consist  of  a  region  which  will  be  denoted  by  1*^  The 

boundary  of  this  region  is  given  by  curves  ?»s  and  <p  where 

*s  “  (<r*’  Vl  (r*'  VR>  '  F("S(V’V'  7Re  (VV>’ 


<PS  -  l(r*,  v*) |  (r*.  v*)  -  F(mS(7^) .7^) .  7^e  (7b>7f)) 


It  will  also  be  convenient  to  define  the  following  distinguished  points  in  the 
*  * 

(o  , v  ) -plane : 

*1  "  F(M<V’V-  P2-F^(V’V’  P3  -  F(MS(7q).7q), 

(23) 

P4  ‘  F(ms(70).70).  P5  -  F(M(7f).7f) 


Salient  properties  of  in  the  (o  ,v  ) -plane  are  given  in  the  following 

theorem . 
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Theorem  3:  ^  is  a  connected  locus  consisting  of  the  union  of  the  finite 

curve  the  bounded  region  *5.  ii 


and  the  semi- infinite  curve  T, 


2 ,  ii  ‘ 


The  former  curve  is  monotonically  increasing  from  P^  to  P2 .  The  latter  curve 
is  concave  up  and  monotonically  increasing  from  P^  to  (<»,•).  The  boundary  of 

ft  S  3 

rr  is  given  by  p  u  p  .  Each  of  the  curves  p  and  p  originate  at  P_  and 

^  »  IX  S  S  fa 


terminate  at  P, 


Between  these  points  both  ps  and  p  are  monotonically 


s  ★  ★ 

increasing  with  strictly  above  p  in  the  ( a  ,v  ) -plane. 


There  is  a 


one-to-one  correspondence  between  points  on  the  locus  r2  and  admissible 
candidate  dynamical  states  for  the  target  from  the  ii-family. 


We  note  that  this  theorem  ensures  that  that  P  possesses  a  unique  inverse  on 
**2  ii  we  shall  subsequently  denote  as  F  \  It  is  found  that 

r2  i  n  r2  ii  *  ^  iiu  I  [p  -p  ]•  T*'e  candi-date  dynamical  states  of  I*2  ^ 

corresponding  to  this  intersection  involve  a  phase  boundary  which  is 

stationary,  i.e.  confined  to  the  interface  of  contact,  and  thus  dissipation 

free.  Other  than  this  stationary  phase  boundary,  these  dynamical  states  are 

identical  to  the  corresponding  candidate  dynamical  state  of  L  . .  In  what 

^  ^ 

follows  these  corresponding  candidate  dynamical  states  will  be  regarded  as 
identical.  Thus,  with  this  view,  there  is  a  one-to-one  correspondence  between 
( admissible }  candidate  dynamical  states  in  the  target  and  points  in  the  Locus 


Admissible  Dynamical  State  Solutions  of  the  Riemann  Problem  for  Impact 

From  the  immediately  preceding  remark,  Theorem  1  and  (11) ,  it  follows  that 
there  is  a  one-to-one  mapping  from  admissible  dynamical  state  solutions  for 

the  impact  problem  to  the  set  1*^  n  H(v) .  The  notation  II(v)  acknowledges 

the  dependence  of  this  set  on  the  impact  velocity  v,  which  in  this  treatment 
enters  through  (viz  Theorem  1) .  We  now  turn  to  examine  this  dependence  in 

more  detail.  Suppose  that  P  is  an  arbitrary  point  in  the  first  quadrant  of 

the  (a  ,v  ) -plane.  Theorem  1  then  ensures  that  there  is  exactly  one  value  of 

v  such  that  P  €  1*^ .  Let  9  denote  this  mapping,  i.e.  v  -  9(P)  .  We  shall  use 

this  mapping  to  define  two  distinguished  values  of  impact  velocity  v^  and  v2 : 

v1-9(P2),  v2-9(P5).  (24) 

The  significance  of  these  two  values  for  the  solution  to  the  impact  problem  is 
given  in  the  following  theorem. 
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Theorem  4:  The  set  H(v)  consists  of  a  single  point  for  v  e  (0,v^]  u  t^,®). 

However  for  v  e  (v^.v^)  the  set  H(v)  consists  of  a  curve  coinciding  with  1*! 

within  the  region  1*^  Thus  for  v  e  (0,v^]  u  [v^,®)  there  exists  a  unique 

admissible  dynamical  state  solution  for  the  impact  problem,  but  for 
v  €  (v^.v^)  there  exists  a  one-parameter  family  of  admissible  dynamical  state 
solutions  for  the  impact  problem. 

■ 

The  endpoints  of  II(v)  will  be  denoted  by  P  (v)  ■  II(v)  n  <p  and  PS(v)  ■  H(v)  n 

s  s 

s  - 

<p  .  Theorem  1  ensures  that  n(v)  is  monontonically  decreasing  and  concave  down 
between  these  two  points  (see  Figure  3)  . 

Figure  3:  The  locus  of 
contact  values  for  the 
target  r ^  showing 

intersections  with  the 
locus  of  contact  values 
for  the  impactor 

1*1  -  (v)  for  various 

impact  velocities  v. 


Maximally  Dissipative  Admissible  Dynamical  State  Solutions 


Since  there  is  no  dissipation  associated  with  the  dynamical  states  ,  the 

rate  of  mechanical  energy  change  for  an  admissible  dynamical  state  solution  is 
that  associated  with  the  travelling  phase  boundaries  (if  any)  in  the  dynamical 

state  from  .  In  particular,  for  v  e  (O.v^J  the  unique  admissible  dynamical 

state  solution  to  the  impact  problem  involves  conservation  of  mechanical 

energy.  Whenever  v  €  l^,®),  the  unique  admissible  dynamical  state  solution 

to  the  impact  problem  involves  a  finite  rate  of  mechanical  energy  loss 

(dissipation).  For  v  e  (v^.v^),  each  of  the  admissible  dynamical  state 

solutions  in  II(v)  will  have  associated  with  it  a  finite  value  of  dissipation. 
Let  D  ^ [0,®)  be  defined  as  D  ■  DoF  Since  II(v)  c  r2.  ii  c^osed’ 

R  *Ar 

and  D  is  bounded  and  continuous  on  the  subdomain  the  function  D  will 


assume  a  finice  maximum  at  one  or  more  points  of  II(v) 
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For  v  e  (v  v  )  let 


^  it 

Q(v)  C  H(v)  denote  those  points  at  which  D  assumes  this  maximum  value.  In 
addition  we  shall  extend  the  definition  of  Q(v)  to  v  e  (0,®)  vi.n  Q(v)-II(v) 
whenever  v  e  (O.v^]  u  [v^,®). 


Each  dynamical  state  solution  corresponding  to  a  point  in  Q(v)  is  a 

maximally  dissipative  solution  to  the  impact  problem  at  impact  velocity  v.  It 
is  thus  immediate  from  this  definition  and  Theorem  4,  that  there  is  a  unique 
maximally  dissipative  solution  to  the  impact  problem  for  all  impact  velocities 

obeying  v  e  ( 0 ,v u  [v^.*) ■  We  now  seek  to  determine  under  what 

circumstances,  if  any,  there  will  be  a  unique  maximally  dissipative  dynamical 

solution  to  the  impact  problem  for  impact  velocities  obeying  v  e  (v^.v^). 

To  determine  Q(v) ,  we  define  loci  of  constant  dissipation  on  ^  for  each 
dissipation  value  d  >  0  via 

S(d)  -  t(<x*,v*)|  (o*,v*)  6  r^u,  D*(a*,v*)  -  d).  (25) 

One  then  finds  that 


S(0) 


( P2  ’  P4 


,U 


*S| 


[P2'P3] 


(26) 


The  behavior  of  D*  on  the  remaining  portion  of  the  boundary  of  ^  is  given 
in 


it  "it  'it 

Theorem  5:  D  (er  ,v  )  is  monotonically  increasing  from  zero  to  D(0,7^)  on 


<p  |  .  p  .  as  P  -  (o  ,v  )  proceeds  from  P,  to  P, 
s  L  * £  »  *  ^  J  ^  ^ 


"it  'it  it 

D  (a  , v  )  is  monotonically 


★  ★ 


increasing  from  zero  to  D(0,7^)  on  p  (  rp  p  ,  as  P  -  (<7  tv  )  proceeds  from 

1  l *2  *  r5 J 


P3C°  P5 


The  mathematical  dificulty  in  determining  E(d)  and  hence  Q(v)  for  v  e  (v^.v^ 

stems  from  the  fact  that  F  cannot  be  inverted  explicitly.  Thus  the  funciton 
★  -  1 

D  ■  DoF  must  be  investigated  by  implicit  means.  Let  us  introduce  notation 
for  the  derivatives  of  the  functions  D  and  F  which  are  defined  for 

<v#  e  T: 


D  -  ^ 

F,  -  — k 

F  ■  — k 

dh 

P  m  “ 

Hi 

F  ■  — k 

r  *’ 

37r 

1 ,  s  - 
37s 

l.r  ,  *  ’ 
a7R 

2,s  , 

*7s 

2 ,  r  * 

37r 

(27) 


Expressions  for  these  quantities  can  be  found  from  (7)  ,  (8) , (21) .  It  thus 
follows  that 
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The  four  inequalities  in  (28)  can  be  shown  to  be  strict  for 
★ 

(7  ,7_)  €  interior(T)  .  In  addition  we  introduce  notation  for  the  derivative 

S  K 

"A"  'ic  "ft  n 

of  the  function  D  which  we  recall  is  defined  for  (a  ,v  )  e  IT 

2,ii 


12 


12 


V 


3v  da 

Differentiating  (7), (8), (21)  yields 


* 

D  oF  - 

v 


*•'(7  s)  <7r-7s)  ~  (*~(7R)  ~  r<7s))  A(7s,7R)  '  ^7R^  ~r(7s))(7^~7s) 

<VV  (  (r'(7S)  (7R'7s))S'  (r(7R)*r(7S))4  )  2 


(29) 


(30) 

while  the  definition  of  D  gives  directly 

D*OF-  .  (  (  D%F)  F2  r  -  Dt  )  /  Fi  r  .  OX) 

for  (7s,7r)  e  T.  Define 


K 

*  *  , 
-  { (0  ,V  ) j 

*  * 
(a  ,v  ) 

€ 

pB 

l2,ll* 

*  *  * 
Dv(^  .v  ) 

<  0  )  , 

0 

<p 

-  {  (^  ,v  ) I 

*  * 
(.0  ,v  ) 

e 

r® 

2,11* 

*  *  * 
Dv(o  ,v  ) 

-  0  )  , 

(32) 

+ 

fp 

-  {(<7*,v*)| 

*  * 
(<*  .V  ) 

e 

pB 

*2, 11’ 

*  *  * 

Dv(<7  ,v  ) 

>  0  )  . 

For  the  purpose  of  classifying  points  P  e  3T®  the  boundary  of  T®  the 

ic 

function  D^  is  defined  in  terms  of  limiting  values.  In  a  similar  fashion,  by 

★  ★  *  ★  ★  ★  o  + 

replacing  Dv(<7  ,  v  )  with  D^(cr  ,  v  )  in  (32),  we  define  ,  q>a  and  <PQ  • 


We  now  turn  to  examine  the  loci  ^v,  and  y*.  It  can  be  shown  that  the 
equation 

A(mS(7).7>  -  ( r  (7)  *  r(^S(  7))  (7  -  fiS  (7) )  -  0,  (33) 

has  a  unique  root  obeying  7  e  (7  ,7^).  Thfs  root  shall  be  denoted  y^.  Let 


P6  -  F(m3(7w),7w), 


v3  -  0<P6>’ 


and  v^<  v^<  v^ . 


(34) 


S  1 

and  note  that  e  <p  p  , 
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The  main  properties  of  the 


loci  and  ^  in  Che  (o  ,  v  ) -plane  are  now  summarized  in 


Theorem  6:  is  a  finite  connected  curve  with  endpoints  and  Pg. 


Between 


these  points  it  is  montonically  increasing  and  confined  to  interior(r? 

4.  f  1.x. 

Both  <p~  and  <p+  are  simply  connected  regions.  The  boundary  b?  of  <p~  is 


given  by  u  u  *,l|p2.,s 

O  ,  ,  Si 

*vu  *  'tp2.p6r 


The  boundary  bt^  of  <p+  is  given  by 


Uniqueness  of  Maximally  Dissipative  Solutions  within  the  one -parameter  family 
of  Dynamical  State  Solutions  for  v  e  [ v^) 

If  v  e  (v^.v^),  theorems  1,  4  and  6  indicate  that  the  curve  II(v)  will 
intersect  if  and  only  if  v  €  (v^,v«],  and  moreover  that  this  intersection 
is  then  unique.  .  ccordingly  we  define  Pq(v)  ”  H(v)  n  for  v  6  (v^.v^]. 
Thus  for  v  €  (v^,v^)  the  curve  E(v)  is  contained  in  between  Ps(v)  and 

P  (v) ,  and  is  contained  in  «+  between  P  (v)  and  Ps(v).  However  for 

O  VO 

v  e  (v^,^)  the  curve  II(v)  is  confined  to  everywhere  between  Ps(v)  and 

PS(v). 

An  immediate  consequence  of  (28)  and  (31)  is  that 

interior(v')  C  interior (^+) ,  (35) 

V  <7 

and 

<p°  n  <p°  n  interior(r?  ..)  -  0.  (36) 

'j  o  2,il 

The  latter  result  precludes  the  possibility  of  any  locus  E(d)  containing  an 
isolated  point  or  a  limit  point  within  interior(r^  ^) .  These  results,  in 
conjunction  with  Theorem  5,  can  be  shown  to  yield 
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Theorem  7:  For  0  <  d  <  0(0,7^),  each  locus  3(d)  consists  of  a  finite  curve 

with  one  endpoint  on  «  |  ,p  p  .  and  the  other  endpoint  on  ps|,p  p 

s  \  * , ^ »  "j  J  \  *  3 » *  5  J 


Moreover 


u  E<d> 

d  €  [0,D(0,7f) ] 


"it  it 

It  is  convenient  to  define  a  slope  function  v  in  the  (o  ,v  ) -plane  as  follows. 

★ 

if  if  ii_- 

If  £  is  any  curve  in  the  (o  ,v  ) -plane  and  if  P  6  {,  then  the  slope  — ■  of 

da 

the  curve  £  at  P  will  be  denoted  by  v(£,P).  In  particular  if  P  e  3(d)  for 
some  d  €  (0, 0(0,7^))  then 


v(3(d),P)  - 


Hence  (35)  indicates  that  the  curves  3(d)  are  monotonically  increasing  in  the 
region  tp^.  The  following  theorem  can  be  established: 


Theorem  8:  Q(v)  -  P  (v) , 


for  v  e  [v3,v2) , 


Q(v)  c  q>+  n  interior (*+) ,  for  v  e  (v-,v,). 
a  v  1  j 


Theorem  8  indicates  that  maximally  dissipative  solutions  are  unique  for 
v  €  and  that  nonuniqueness  of  maximally  dissipative  solutions  remains 

at  issue  only  for  v  e  (v^,v^) . 

On  the  uniqueness  of  Maximally  Dissipative  Dynamical  Solutions  for 

v  €  (^.^3) 

In  view  of  theorem  8  it  follows  that 

i/(3(d) ,  P)  -  *(lI(v),P),  d  -  D*(o*,v*), 

(40) 

★  *  -  .... 
if  P  -  (»  ,v  )  e  0(v)  and  v  e  (v^.v^). 

Conditions  (40)  are  necessary,  but  not  sufficient,  to  establish  that  a 
particular  point  P  —  (o  ,v  )  is  a  maximally  dissipative  dynamical  solution 
when  v  e  (v^.v^).  For  this  range  of  impact  velocity,  one  must  in  addition 

it  it 

examine  the  convexity  of  the  curves  3(d)  in  the  (<r  ,v  ) -plane.  To  this  end  we 
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shall  define  a  "curvature"  function  w  in  the  obvious  way.  Namely  if  £  is 

2  * 

any  curve  in  the  (<r* ,  v*)  -plane  and  if  P  €  £,  then  the  "curvature"  of 

d a 

the  curve  £  at  P  will  be  denoted  by  w»/({,  P)  .  It  can  be  shown  that  points  P 
obeying  (40)  will  be  unique  if  condition  (Ql)  is  satisfied  where 

Ql)  i/i/(H(d)  , P)  >0,  d  -  D*(o*, v*) , 

for  all  P  -  (o  ,v  )  e  p  n  interior(^v) . 

Thus  (Ql)  is  sufficient  to  ensure  the  uniqueness  of  maximally  dissipative 
dynamical  state  solutions  to  the  impact  problem  for  the  range  v  e  (v^.v^), 

and  hence  for  the  full  range  of  impact  velocities  v  >  0. 

Determining  conditions  that  are  necessary  and  sufficient  for  (Ql)  to  hold  has 
remained  an  elusive  goal.  Accordingly  we  shall  here  introduce  two  additional 
constitutive  hypotheses  on  the  stress  response  for  the  target  material: 

HT6)  r* (7b)  >  r ' (0) , 

HT7)  r"(7)  -  0  for  -r  €  (7b.7w). 

The  calculation  of  i/i/(E(d),P)  followed  by  a  fairly  elaborate  analysis  can  now 
be  shown  to  yield 

Theorem  9:  The  additional  constitutive  hypotheses  (HT6)  and  (HT7)  ensure 

(Ql).  ■ 


Hence  the  constitutive  hypotheses  (HI1)  for  the  impactor,  and  (HT1)-(HT7)  for 
the  target  are  sufficient  to  ensure  that  the  impact  problem  studied  here  has  a 

unique  maximally  dissipative  solution  for  all  impact  velocities  v  >  0. 
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An  Inverse  ftfi»»nnn  Problem  for  Impact  involving  Phase  Transitions* 


Thomas  J .  Pence 

Dept,  of  Metallurgy,  Mechanics  and  Materials  Science 
Michigan  State  University 
East  Lansing,  MI  48824-1226 


Introduction 

Phase  transitions  in  solids  can  be  modelled  in  a  continuum  mechanical 
framework  through  the  consideration  of  nonlinearly  elastic  materials  whose 
strain  energy  density  is  a  nonconvex  function  of  strain.  Different  regions  in 
strain  space  can  then  be  identified  with  different  material  phases.  For  such 
materials,  displacement  gradients  can,  in  certain  situations,  become 
discontinuous  across  internal  surfaces.  These  surfaces  are  viewed  as  phase 
boundaries  whenever  they  separate  regions  involving  strains  corresponding  to 
different  material  phases.  In  an  earlier  paper  which  also  appears  in  these 
proceedings  [l]**the  Riemann  problem  for  impact  has  been  investigated  for  a 
case  of  impact  between  a  target  composed  of  a  two  phase  material  and  an 
impactor  composed  of  a  single  phase  material.  As  is  well  known,  solutions  to 
Riemann  problems  involving  phase  transitio  s  lead  to  nonunique  solutions  even 
after  imposition  of  admissibility  conditioi.s  due  to  Lax;  and  the  problem  under 
study  here  is  no  exception.  A  methodology  is  outlined  in  [1]  for  determining 
the  families  of  solutions  associated  with  any  such  Riemann  problem;  it 
involves  mapping  candidate  dynamical  states  for  both  impactor  and  target  to  a 

•jjf 

(a  ,v  ) -plane  of  contact  interface  stress  vs.  contact  interface  velocity. 

Each  intersection  of  the  contact  locus  for  the  impactor  with  the  contact  locus 
for  the  target  corresponds  to  a  solution  of  the  impact  problem.  For  the 
materials  under  study  in  [1],  the  target  locus  consists  of  a  region  from  which 
emanate  two  curves,  while  the  impactor  locus  consists  of  only  a  curve.  There 
are  two  features  of  this  methodology  that  are  quite  useful:  the  technique  is 
global  so  that  assumptions  on  the  size  of  the  data  characterizing  the  Riemann 
problem  are  not  necessary,  and  admissibility  criteria  for  selecting  from  among 
the  possibly  nonunique  solutions  can  be  studied  within  this  context. 

The  admissiblility  criterion  adopted  in  [1],  and  to  be  considered  here 
as  well,  is  the  maximum  dissipation  (rate)  criterion;  this  criterion  was  first 
introduced  by  Dafermos  [2]  under  the  different  name  of  the  entropy  rate 
criterion.  By  investigating  the  dissipation  topography  associated  with  the 
target  locus,  conditions  on  the  strain  energy  density  of  the  two  phase  target 
material  have  been  found  which  ensure  that  there  is  exactly  one  maximally 
dissipative  solution  to  the  Riemann  problem  for  impact  with  any  single  phase 

impactor  material  at  any  impact  velocity  v. 

This  gives  rise  to  an  inverse  problem  that  can  be  stated  roughly  as 
follows:  "Suppose  that  a  target  is  composed  of  such  a  two  phase  material.  It 
is  desired  to  determine  whether  a  given  candidate  dynamical  state  in  the 
associated  contact  locus  can  be  generated  under  the  the  maximum  dissipation 
criterion  by  judiciously  choosing  a  simple  single  phase  material  for  the 

impactor  as  well  as  the  impact  velocity  v."  This  type  of  inverse  Riemann 
problem,  which  bears  directly  on  the  extent  to  which  dynamical  states  in  phase 
transforming  materials  can  be  controlled,  is  the  focus  of  this  paper.  It  will 
be  shown  that  only  a  certain  subfamily  of  candidate  dynamical  states  for  the 


*Supported  by  the  U.S.  Army  Research  Office  under  DAAL03- 89 -G-0089 . 
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targec  (the  maximal  dynamical  states  and  the  semimaxlmal  dynamical  states)  can 
be  generated  in  this  fashion.  Finally,  the  extent  to  which  whole  families  of 

dynamical  states  can  be  attained  as  the  impact  velocity  v  is  varied  will  be 
addressed. 

For  the  sake  of  expediency,  this  paper  will  be  regarded  as  a 
continuation  of  [1],  Consequently,  notation  and  results  will  be  carried  over 
from  [1]  without  further  elaboration.  The  reader  is  also  urged  to  consult  [1] 
for  a  list  of  associated  references.  In  addition,  equations  appearing  in  [1] 
shall  be  refered  to  directly  by  the  equation  number  followed  by  a  superscript 

#,  e.g.  (12)#  is  to  be  read  as  (12)  in  [1]. 

For  ease  of  reference,  we  shall  define  a  simple  single  phase  material 

# 

to  be  one  for  which  ("(7)  obeys  (HI1)  ;  we  shall  also  define  a  simple  two  phase 

material  to  be  one  for  which  r(y)  obeys  (HTl)*- (HT5)# .  In  [1]  it  is  shown 
that  if  a  simple  two  phase  target  material  gives  rise  to  a  contact  locus 

such  that  (Ql)  holds,  then  the  Riemann  problem  for  impact  will  have  a  unique 
maximally  dissipative  solution  (modulo  solutions  with  stationary  phase 
boundary)  irregardless  of  the  choice  of  simple  single  phase  impactor  material 

and  irregardless  as  well  of  the  impact  velocity  v.  Let  us  thus  also  agree  to 
define  an  elementary  two  phase  material  to  be  a  simple  two  phase  material  for 

which  the  associated  obeys  (Ql)  .  A  non- elementary  simple  two  phase 

material  is  then,  of  course,  a  simple  two  phase  material  for  which  the 

associated  does  not  obey  (Ql)#.  As  shown  in  [1],  (HT6)#  and  (HT7)#  are 

sufficient  conditions  to  guarentee  that  a  simple  two-phase  material  is  in  fact 
elementary.  However,  more  general  results  regarding  necessary  and  sufficient 
conditions  to  distinguish  between  elementary  and  non-elementary  simple  two 
phase  materials  have  yet  to  be  determined;  in  fact  the  existence  of  non- 
elementary  simple  two  phase  materials  remains  an  open  question. 


Definition  of  the  Inverse  Riemann  Problem 

The  purpose  of  the  present  work  is  to  explore  the  following  Inverse 
Riemann  Problem  (IRP): 


Given  a  simple  two  phase  target  material ,  choose  a  candidate 

dynamical  state  for  the  target  (i.e.  a  point  -  ( ° 0’v 0 ^  e  r^). 

then  we  seek  to  determine  whether  this  dynamical  state  in  the 
target  can  be  generated  as  a  (maximally  dissipative)  solution  to 
the  Riemann  problem  for  impact  by  judicious  choice  of  either 

(Cl)  simple  single  phase  impactor  material  ((7), 

and/or 

( C2 )  impact  velocity  v? 


(IRP) 


For  the  remainder  of  this  paper  unless  otherwise  noted,  the  term  "generate"  is 
to  be  understood  in  the  above  sense,  namely  to  "generate  using  the  maximum 
dissipation  criterion  in  the  event  of  multiple  solutions."  If  the  simple  two 
phase  material  is  elementary,  then  the  phrase  "a  (maximally  dissipative) 
solution"  in  (IRP)  may  be  replaced  by  "the  (maximally  dissipative)  solution". 
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The  set  of  all  points  P  which  can  be  so  generated  by  judicious  choice 
of  either  (Cl)  or  (C2)  will  be  said  to  constitute  the  well-posed  set  for 

(IRP) ;  this  set  will  be  denoted  1^.  Two  questions  are  now  immediate  : 

(Ql)  Can  the  set  be  determined  precisely? 

(Q2)  Given  PQe  what  is  the  totality  of  choices  for  (Cl)  and  (C2)  that 

succeed  in  generating  Pq? 

To  discuss  these  issues,  imagine  for  the  moment  that  one  is  presented  with 
both  a  simple  two  phase  target  material  and  a  simple  single  phase  impactor 
A  *  * 
material.  Let  Q  be  the  locus  of  points  generated  in  the  (o  ,v  ) -plane  as  the 

impact  velocity  v  varies  from  0  to  <=,  hence 

A 

Q  -  Q(v)  |  u  Q(v)  |  __  u  Q(v)  |  _  u  O(v)  |  _ 

V  e  (0 , v^ ]  V  e  (v1,v3)  v  e  [v3>v2]  v  e  (v2,<*>) 

(1) 


It  is  to  be  noted  that  (2)2  is  a  slightly  stronger  result  than  (39)2_  To 
verify  (2)2  observe  that 

d(<p+  n  <p+ )  -  ®°  u  <p°  -  interior (<p°)  u  P„  u  interior (sp°)  u  P, ,  (3) 

o  v  o  v  o  2  o  6 

since  (see  Figure  1) 

q>°  n  -  d<p°  -  d<p°  -  P0  u  P,  .  (4) 

o  v  ~<7  ~v  2  6 

Now,  by  virtue  of  (38)  ,  the  constant  dissipation  loci  5( • )  possess  horizontal 
tangents  on  interior (v°)  and  possess  vertical  tangents  on  interior(q»°) .  Since 
Theorem  1  of  [1]  indicates  that  the  slope  of  is  restricted  to  finite 

negative  values  it  follows  from  (40)  and  the  definition  of  D(v)  that 
candidate  dynamical  states  for  the  target  corresponding  to  points  in  either 

835 


interiorly®)  or  interior(y°)  cannot  be  associated  with  a  maximally  dissipative 
solution  for  the  Riemann  problem  under  study. 


It  follows  from  (2)  that  the  three  subloci  Q(v) | 


v  e  [v3 , v2 ] 


,  and  O(v) | 


v  e  (O.vJ 

are  each  a  connected  curve.  In  the 


v  €  (v2,®) 


event  that  the  simple  two  phase  material  is  elementary,  then  the  construction 
#  -  , 

utilizing  (40)  which  determines  the  sublocus  Q(v) ] -  -  -  ensures  that 

V  fc  ( , v3 ) 

this  sublocus  is  also  a  connected  curve  initiating  at  P2  and  terminiating  at 

A 

Pg.  Thus  for  an  elementary  two  phase  material  the  complete  locus  Q  is  a 

connected  curve  (Figure  1).  We  shall  comment  later  on  the  possibilities  for 

Q(v) | -  -  -  .  in  the  event  that  the  simple  two  phase  material  is  not 

V  c  , V3 ) 

elementary. 


It  is  enlightening  to  consider  how  the  locus  Q  changes  as  one  varies 
the  associated  simple  single  phase  impactor  material,  while  retaining  the  same 

simple  two  phase  target  material.  Then  the  values  of  v^,  and  will  in 

general  change,  whereas  the  right  hand  side  of  (2)  remains  the  same.  Thus  the 

dynamical  states  associated  with  the  three  subloci  Q(v) |  , 

v  e  (O.VjJ 

O(v) |  ,  and  Q(v) |  will  remain  the  same,  eventhough  the 

v  e  (v3,v2l  v  e  (v2,®) 

value  of  impact  velocity  corresponding  to  each  individual  dynamical  state  in 
the  above  list  of  sets  would  in  general  be  expected  to  change.  In  contrast, 


for  Q(v) 


v  e  (v^v^ 


the  dynamical  states  themselves  will  in  general  change. 


Hence  if  the  simple  two  phase  target  material  is  elementary,  then  different 


simple  single  phase  impactor  materials  would  be  expected  to  give  rise  to 
different  curves  connecting  P2  to  P^ . 


Maximal,  Semi -maximal  and  Non-maximal  Dynamical  States 

In  view  of  the  above  discussion  it  is  useful  to  partition  into  the 
following  three  distinct  sets: 


Jiax 

2 


r2,J(0,P2]  u  9  I  [  P6 .  P5 1  U  r2,ii’ 


_sem  ,  _  .  .  +  _ 

■  intenor(?>  n  <p  ), 
2  o  v 


(5) 


r£on  -  r2  -  (  rf x  u  r=em  ) . 

It  then  follows  immediately  from  the  considerations  given  above  that 

r^axcr^.  r£on  4  r^5.  (6) 
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Moreover  each  Pq  G  I^ax,  is  generated  for  every  simple  single  phase  impactor 

material  provided  that  the  impact  velocity  for  a  given  simple  single  phase 

-  # 

impactor  material  is  chosen  to  be  v  -  0(PQ)  (c.f.  (24)  ).  For  this  reason, 

points  P  G  will  be  said  to  be  maximal  dynamical  states  in  the  sense  that 

(IRP)  has  a  solution  for  all  such  points  irregardless  of  the  choice  of  simple 

single  phase  impactor  material.  On  the  other  hand,  points  P  e  r^on  will  be 

said  to  be  non-maximal  dynamical  states  in  the  sense  that  (IRP)  has  no 
solutions  for  all  such  points. 


Figure  1:  The  connected 

A 

curve  Q  for  an  elementary 
two  phase  target  material. 

A 

The  determination  of  Q  at 

P  G  r®em  by  the 
o  z 

oscuilation  condition 
(40)  is  indicated. 


The  remaining  locus  r^em  will  be  said  to  comprise  the  semi-maximal 

dynamical  states.  Notice  from  (5)  and  (38)*  that  r^em  consists  of  points  in 

r2  through  which  the  associated  constant  dissipation  locus  H( • )  has  negative 
slope.  One  may  conclude  from  the  previous  discussion  that  at  least  some 
points  in  r^em  are  also  contained  in  T^5,  but  that  any  such  points  in  r^em  n 

r2  will  certainly  not  be  generated  for  every  choice  of  simple  single  phase 

# 

impactor  material.  Now  according  to  (40)  ,  a  necessary  condition  for  a  point 
sent 

PQ  G  r2  to  be  generated  as  a  solution  to  the  Riemann  problem  for  a  given 

simple  single  phase  material  and  impact  velocity  v  is  that  n(v)  oscullate  the 
associated  constant  dissipation  locus  at  Pq  (Figure  1).  Furthermore,  this 

condition  is  sufficient  provided  that  the  simple  two  phase  material  is  in  fact 

elementary.  Thus  in  order  to  determine  whether  or  not  a  point  Pq  g  r^6111  is 

contained  in  r^*,  it  is  necessary  to  determine  the  extent  to  which  H(v)  and 

hence  -  r^(v)  can  be  made,  through  selection  of  the  stress  response 

-  -k  'k 

function  $"(7)  and  impact  velocity  v,  to  assume  a  given  form  in  the  ( a  , v  )- 
plane.  Now  Theorem  1  of  (1]  serves  to  impose  certain  limitations  on  any  such 
form.  The  following  theorem,  which  is  a  converse  to  Theorem  1  of  [1],  shows 
that  any  form  obeying  the  restrictions  set  forth  in  Theorem  1  of  [1]  is,  in 
fact,  attainable. 
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Theorem  1  -  On  the  existence  of  a  simple  single  phase 
Material  with  a  given  contact  locus  form: 

Let  I  ( • )  :  [0  ,<*>)-*[0 ,  -®)  obey  I(0)-0,  I'(*)<0,  I"(*)<0.  For  v  >  0,  let  %(v)  - 
((ff  ,v  )|  v  -  v  +  I(a  ),  a  >0).  Then  there  exists  a  stress  response  function 
{■(7)  corresponding  to  a  simple  single  phase  material  such  that  the  associated 
rx(v)  -  *(v) . 

Proof.  For  s  >  0  define 


x*  cem 

Fix  now  a  point  Pq  -  (aQ,vo)  €  ,  then  by  taking  any  function 

I  (•):[  0  ,<»)-»[  0 ,  -<*>)  such  that 

I(0)-0,  I'(*)<0,  I"(.)<0,  I' (o*)  -  u(H(d  ) ,P  ) ,  d  -D*(o*,v*),  (9) 

o  0  0  o  0  0 

it  follows  that  {(7)  constructed  on  the  basis  of  Theorem  1  will  lead  to  the 
satisfaction  of  (40)  provided  that  the  impact  velocity  is  then  taken  to  be 
9(Pq).  (Notice  that  an  infinity  of  functions  I(»)  exist  which  satisfy  (9).) 
Consequently  it  may  be  concluded  that 

r2em  c  r^\  (io) 

provided  Chat  Che  simple  two  phase  target  material  is  an  elementary  two- phase 
material.  For  a  non- elementary  two  phase  target  material  we  may  only  conclude 

that  the  necessary  condition  (40)  can  be  met. 
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Prescribing  the  I  impact  Velocity 


Until  further  notice,  let  us  agree  to  confine  attention  to  elementary 
two  phase  target  materials.  For  Pq  -  (a* , v*)  e  r^em,  we  shall  define 


v 

m 


a*  x  i/(E(d  )  ,P  ) , 
o  o  o 


d 

o 


*.  *  *. 

D  (<*  ,v  ) • 
o  o 


(11) 


It  then  follows  from  the  convexity  of  the  loci  1*^  in  conjunction  with  (40) 

that  if  Pq  is  generated  as  the  solution  to  (IRP) ,  then  the  impact  velocity  v 
must  obey 

vm  «  V  <  VM  .  (12) 


Furthermore,  Theorem  1  ensures  that  for  any  v  obeying  (12)  there  will  exist  an 
infinity  of  simple  single  phase  impactor  materials  that  will  generate  Pq  as 

the  solution  to  the  impact  problem  with  impact  velocity  v.  A  simple  single 
phase  material  stress  response  f(y)  which  generates  Pq  at  an  impact  velocity 

near  v^  must  then  have  a  locus  that  is  essentially  horizontal  until  just 

before  a where  it  must  radically  bend  in  order  meet  (40)  .  Conversely,  a 
simple  single  phase  material  stress  response  f(-y)  which  generates  Pq  at  an 

impact  velocity  near  v^  must  have  a  locus  that  is  essentially  linear  from 

(cr  ,v  )  -  (0,v)  to  (<7  ,v  )  -  (a0,v0)  (see  Figure  2).  According  to  (12)  ,  the 

absolute  value  of  the  slope  i/(r^,(<7  ,v  ))  goes  like  the  reciprocol  of  the 

square  root  of  the  material  stiffness  at  stress  value  a  .  Consequently, 

generating  Pq  near  the  lower  bound  impact  velocity  v^  requires  a  simple  single 

phase  impactor  material  with  a  very  high  stiffness  (i.e.  essentially  rigid 

* 

response)  for  all  stresses  f  obeying  0  <  f  <  <7q  .  On  -the  other  hand, 
generating  Pq  near  the  upper  bound  impact  velocity  v^  requires  a  material  with 

an  essentially  linear  stress  response  for  all  stresses  f  obeying  0  <  f  <  a  , 

where  the  stiffness  is  determined  by  the  slope  of  the  constant  dissipation 
locus  at  P 

o 

# 

One  should  note  that  the  absence  of  an  oscullation  condition  like  (40) 

★  ★  max 

for  dynamical  states  in  Pq  -  (aQ>’vQ)  E  ,  indicates  that  the  associated 

_  mav 

range  of  impact  velocities  v  that  succeed  in  generating  Pq  e  (by 

-  ★ 

suitable  choice  of  simple  single  phase  impactor  material)  is  simply  v  >  vq . 
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If  we  monitor  the  values  of  v  and  v„  as  P  roams  within  r?en,  one  may 

m  M  o  2  J 


then  conclude  from  (11)  and  (38)  that 


if  P  -  P  €  *  , 

o  a  ' 


if  P  -Pep  , 
o  v  ’ 


y  x  \  r 
^ — ^5  \ 


\  -p 


Figure  2:  The  window  of 
impact  velocities  (12) 

,  .  _sem 

for  PQ  -  (oo>vo)  e  1^  . 

Also  shown  are  contact 
loci  for  two  different 
simple  single  phase 
impactor  materials  that 
generate  this  Pq.  In 

both  cases  the  oscullation 

condition  (40)  must  hold. 

Here  material- i  is  more 
compliant  than  material- ii 
at  stresses  less  than  a 

and  so  requires  a  greater 
impact  velocity  to  generate  ?c 


"fc  'fc  'fc  "fc  ^  # 

Let  Pj  -  (o 2^2),  P5  -  (ct5>v5),  so  that  -  r(yb)  and  a 5  -  r(yf)  (c.f.  (14) 
and  (15)#).  Suppose  that  a  obeys  <  cr*  <  a*.  It  can  then  be  shown  that  if 

s 

one  travels  from  <p  to  <ps  along  the  vertical  line  segment  associated  with  the 
* 

value  a  t  then  the  corresponding  dynamical  states  involve  phase  boundaries 
with  progressively  greater  phase  boundary  velocities.  Thus,  in  this  sense, 

points  Pq  near  correspond  to  dynamical  states  with  a  relatively  low  phase 

boundary  velocity  while  points  Pq  near  correspond  to  dynamical  states  with 

a  relatively  high  phase  boundary  velocity.  It  is  thus  interesting  to  note 
that  the  window  of  possible  impact  velocities  (12)  is  very  small  for  dynamical 
states  of  the  former  type,  and  that  generating  these  dynamical  states 
neccesitates  the  use  of  simple  single  phase  impactor  materials  that  are 
essentially  rigid. 


The  Rieaarm  Problem  for  Target  Materials  that  are  not  Elementary 

We  now  turn  to  consider  in  more  detail  certain  aspects  of  the  Riemann 
problem  for  simple  two  phase  target  materials  that  are  not  elementary. 

Perhaps  the  most  interesting  aspect  of  such  problems  is  that  the  possibility 

exists  that  the  locus  O(v) ) -  -  -  will  not  be  connected!  To  see  this, 

've  (vi , Vj) 

let  dQ  be  a  value  for  the  dissipation  such  that  the  constant  dissipation  locus 
H(dQ)  intersects  njem.  Assume  that  a  contact  locus  associated  with  impact 
velocity  v^  oscullates  H(dQ)  at  agjglnt,  say  Pg,  at  which  E(dQ)  has  positive 


curvature.  This  then  assures  that  the  dynamical  state  associated  with 
provides  a  local  maximum  to  the  dissipation  on  the  (pre-criterion)  solution 
locus  H(v  ) .  The  question  then  arises  as  to  whether  or  not  D(v  )  might  also 
oscullate  S(dQ)  at  some  other  point,  say  ?aa>  at  which  S(dQ)  also  nas  positive 

curvature.  Of  course  the  convexity  condition  on  1*^  embodied  in  Theorem  1  of 

[1]  indicates  that  this  is  not  possible  unless  vi/(S(dQ) , Pq)  <  0  for  some  point 
P  e  E(d  )  between  P  and  P  .  Now,  by  definition,  a  non- elementary  simple 

O  O  a  aa 

two  phase  material  is  one  for  which  there  exists  a  dissipation  value  dQ  and  a 
sem 

point  Pfl  e  n  S(dQ)  such  that  the  anomolous  convexity  condition 

i/i/(S(dQ) , Pq)  <  0  is  met.  In  fact,  given  such  a  dQ  and  Pq,  Theorem  1  suggests 
that  a  simple  single  phase  impactor  material  exists  such  that  the  associated 
T,  oscullates  two  points  P  and  P  in  the  manner  described  above  (Figure  3) . 

JL  3.  33 


Figure  3:  The  anomolous 
convexity  of  the 
dissipation  loci  E(*)  for 
a  non- elementary  two 
phase  target  material  may 

give  rise  to  a  locus  Q 
that  is  disconnected. 

The  continuations 
represent  dynamical 
states  that  are  locally 
maximally  dissipative  on 

T^(v)  but  not  globally 

maximally  dissipative. 


Consequently,  let  us  assume  that  we  have  identified  a  simple  single  phase 
impactor  material  $ (y)  and  an  impact  velocity  v  ,  such  that  the  above  scenario 
obtains.  Then  the  dynamical  states  associated  with  P  and  P  provide  local 

3  33 

maxima  to  the  dissipation  on  the  solution  locus  II(v  ).  In  fact,  these  maxima 

3 

each  have  the  same  dissipation  value  dQ .  If,  as  may  well  be  the  case,  these 

local  maxima  are  in  fact  global  maxima,  then  the  dynamical  states  associated 
with  P  and  P  each  provide  a  maximally  dissipative  solution  to  the  impact 

problem  at  impact  velocity  v  .  Now  within  the  above  scenario  the  generic  case 
would  have  the  following  features: 

(i)  P  and  P  will  provide  the  only  oscullation  points  of  D(v  )  with 

3  33  3 


(ii)  as  v  departs  from  v  ,  the  local  maxima  that  "evolve"  from  P  and  P 

r  3  3  33 

will  not  have  identical  dissipation  values. 
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Hence  the  generic  case  is  that  the  impact  problem  will  have  a  unique  maximally 

dissipative  solution  for  each  impact  velocity  near  v  ,  with  the  special  impact 

a 

velocity  v&  giving  rise  to  two  maximally  dissipative  solutions  because  it 


induces  an  exchange  in  global  maxima  of  dissipation  on  the  v-parametrized 
solution  sets  II(v) .  For  the  situation  that  we  that  we  have  just  described, 


the  locus  Q(v) | -  .  -  -  .  will  be  discontinuous  at  v  —  v  (Figure  3). 

v  t  (.v^ , v^;  a 


Furthermore,  if  a  norm  is  placed  upon  dynamical  state  solutions  to  Riemann 
problems,  then  this  norm  is  also  likely  to  suffer  a  discontinuity  on 


Q(v)  -  _  . -  -  .  at  the  impact  velocity  v 

'v  e  (v1(v3)  K  J  s 


Extended  versions  of  the  Inverse  Riemann  Problem 

In  view  of  the  preceding  discussion,  we  shall  henceforth  confine 

/A 

attention  to  elementary  two  phase  target  materials  so  that  the  locus  O  is 
guaranteed  to  be  connected .  Then  the  inverse  Riemann  problem  (IPP)  can  be 
regarded  as  determining  whether  or  not  certain  choices  for  the  simple  single 
phase  impactor  material  (Cl)  will  result  in  a  given  point  e  being 

A 

contained  in  the  associated  connected  curve  Q.  If  such  is  indeed  the  case, 
then  the  associated  impact  velocity  is  necessarily  given  by  v  -  ©(P^).  As 

already  shown,  the  choice  of  (Cl)  is  crucial  only  if  P  e  r?em. 

o  Z 

It  is  thus  natural  to  consider  an  extended  inverse  problem  of  the 
following  type:  given  n  distinct  points  in  r^6111,  is  it  possible  to  select  a 

A 

simple  single  phase  material  such  that  the  associated  Q  will  contain  j.1  n 

A 

points?  In  fact,  one  can  enquire  into  the  general  extent  to  which  the  curve  Q 
can  be  prescribed  within  the  region  r^era. 

Let  us  agree  to  denote  the  inverse  Riemann  problem  in  which  n  points  in 

S  6IB 

in  T 2  are  prescribed  as  a  problem  of  type  (IRP-n).  It  has  already  been 

shown  that  a  problem  of  type  (IRP-1)  is  always  well  posed.  For  n>l  there  do, 
of  course,  exist  problems  of  type  (IRP-n)  that  are  also  well  posed  (e.g. 

choose  n  points  on  the  Q  associated  with  some  fixed  f(-y)).  Conversely,  for 
any  n>l,  there  exist  problems  of  type  (IRP-n)  that  are  not  well  posed.  To 
verify  this  let  us  suppose  that  two  points  in  the  prescription  of  such  a 
*  *  *  *  sem 

problem  are  P  =(o  , v  ),  P  **(a  ,v  )  e  ro  Let  us  first  of  all  assume  that 

o  o  o  oo  oo  oo  2 

★  *  # 

aoo  >  V  t'len  f°H°ws  from  (40)  and  Theorem  1  of  [1]  that  a  necessary 

condition  for  the  problem  to  be  well  posed  is  that 


‘'(S<doo)’Poo)  < 


4L-  -£•  sP 

d  —  D  (a  ,v  ) ,  d  -  D  (a  v  ).  (14) 

00  00  oo  o  o  o 


'ic 

Secondly,  if  we  assume  that  o -  cr^,  then  (14)  is  to  be  replaced  by 


v(S(  d  ),P  )  >  </(E(d  ),P  ),  dnn  -  D*(/  .v*)> 

oo  oo  o  o  00  00  oo 


*  *  * 

D  (a  , v  ) ,  (15) 

o  o 
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as  a  necessary  condition  for  the  well-posedness  of  (IRP-n) .  Since  it  is 
certainly  possible  to  chose  points  PQ  and  Pqq  violating  either  (14)  or  (15), 

this  establishes  the  existence  of  problems  of  type  (IRP-n)  that  are  not  well 
posed  for  each  n>2 . 

Clearly  (15)  is  also  a  severe  restriction  on  the  possibility  that 
A  '  * 

curves  Q  are  not  mapped  bijectively  to  the  a  -axis.  Letting  Pqo  approach  Pq 

sem 

in  (15)  one  may  obtain  the  following  necessary  condition  for  a  point  Pq  e  ^ 

A 

to  be  a  location  of  vertical  tangency  for  che  curve  Q: 

k  k  k  k 

D  D-D  D  -  0,  (16) 

W  O  (TV  V 

"k  "k  k  k  mk  'k 

where  D  ,  D  (and  D  )  are  second  derivitives  of  D  (a  ,v  )  defined 
w  ov  oo 

analogously  to  D*  and  D*  (c.f.  (29)#).  It  shall  be  shown  shortly  that  (16)  is 

A 

also  a  sufficient  condition  for  any  smooth  curve  Q  to  suffer  a  vertical 

A 

sem 

tangency  at  a  point  Consequently  if  Q  is  a  smooth  curve  passing 

GIU 

through  a  point  Pq  6  1^  obeying  (IS),  then  is  necessarily  a  point  of 

A 

vertical  tangency  for  Q.  Whether  or  not  such  points  exist,  however,  remains 
an  open  question. 

sem 

A  problem  in  which  a  curve  segment  within  r ^  is  prescribed  for  an 

inverse  Riemann  problem  will  be  said  to  be  a  problem  of  type  (IRP-®) .  Let  us 
agree  to  limit  attention  to  such  problems  for  cases  in  which  the  prescribed 

k 

curve  is  mapped  bijectively  to  the  a  -axis  so  that  the  curve  segment  can  be 

k  k  k  k 

parametrized  by  v  -  x(°  )  on  some  a  -subinterval  of  the  interval  T(7^)  -  o  ^ 

#  k  k  k 

r(7w)  (c.f.  (34)  ),  say  oq  <  a  <  &OQ-  Then  the  considerations  leading  to 

# 

(14)  in  conjunction  with  (38)  indicate  that  a  necessary  condition  for  the 
problem  to  be  well  posed  is  that 


*  *  * 

-  D^ia  ,x(a  )) 

*  *  * 

D  (ff  )) 


k  k  k 

a  <  o  <  a 
o  oo 


In  fact,  (17)  is  also  a  sufficient  condition  for  the  well-posedness  of  the 
problem  as  can  be  seen  from  the  following  construction  for  the  stress  response 
function  f(7)  of  a  simple  single  phase  impactor  material  which  solves  the 
problem.  First  let 


* 

V(°  ) 


*  *  * 

-  D o(a  ,x(a  )) 

*  *  * 

Dv(tf  ,*(<?  )) 


k  k  k 

a  <  o  <  a 
o  oo 


■><:  ^  k 

and  then  extend  this  function  to  the  range  a  >  0  in  such  a  fashion  that  r)(o  ) 

k 

<  0,  r)’ (a  )  <  0.  Now  define  the  function  l(a  )  to  be 


k 

I  (a  )  = 


j ( z ) dz . 
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Finally  since  the  function  I(o  )  so  constructed  satisfies  the  hypotheses  of 
Theorem  1,  the  stress  response  function  f(7>  can  be  constructed  as  the  inverse 
function  to  H(s)  as  given  in  (7). 


Since  verification  of  (17)  is  an  easy  task  (if  the  function  D  (o  ,v  ) 
is  known) ,  it  is  a  simple  matter  to  determine  the  well-posedness  of  any 

problem  of  type  (IRP-®)  for  a  curve  (o  ,x(a  ))  e  1*^  on  oq  <  a  <  oqq  where 

^  Tit 

r (7^)  <  <7q  <  ooo  <  r(7w).  In  particular  (17)  can  be  employed  as  a  necessary 

and  sufficient  condition  to  test  well-posedness  for  the  case  when  this  curve 

f'fc  tAt 

i.e.  a  -  r  ( 7,  )  ,  a  -  r  ( 7  ), 
o  'b  00  w7 


(V^*))'  P6  "  (%o’x(aoo))  • 


All  of  the  extended  versions  of  the  inverse  Riemann  problems  discussed 
thus  far  can  be  thought  of  as  global  inverse  Riemann  problems  in  that  their 

A 

specification  is  given  in  terms  of  properties  of  the  prescribed  locus  Q  at 

s  em 

more  than  one  point  in  T ^  .  In  contrast  a  local  inverse  Riemann  problem 

A 

will  be  one  in  which  only  properties  at  a  single  point  of  Q  are  prescribed. 

A 

The  most  immediate  such  problem  is  one  in  which  the  curve  Q  is  required  to 

★  ★  cem 

pass  through  a  given  point  Pq  -  (a0>v0)  €  with  a  prescribed  slope ,  say 

v  .  Such  problems  will  be  said  to  be  of  type  (IRP-v) .  In  the  absence  of  a 

slope  specification,  the  problem,  now  simply  of  type  (IRP),  is  well  posed 
(c.f.  (10))  and  can  be  solved  by  means  of  the  algorithm  developed  in 
connection  with  Theorem  1.  The  immediate  questions  which  come  to  mind  are 
thus : 

(Q3)  For  a  given  Pq  e  r^em,  what  values  for  the  prescribed  slope  v °  give 
rise  to  a  well  posed  problem? 

(Q4)  Given  a  v  d  posed  problem  of  type  (IRP-v),  what  is  an  algorithm  for 
determining  a  simple  single  phase  impactor  material  which  solves  the 
problem? 

To  address  these  questions,  consider  a  standard  Riemann  problem  in  which  the 
simple  single  phase  impactor  material  stress  response  function  f(y)  is  given. 

Let  the  corresponding  contact  locus  be  given  by  r^(v)  «  (  (a  ,v  ) |  v  -  v 

+  I (a  ),  o  >0  )  (c.f.  Theorem  1).  A  calculation  based  upon  (38)#  and  (40)# 

then  yields 
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where  all  of  the  derivatives  D  ,  .  D  are  to  be  evaluated  at  the  point 

o  w  r 

sem 

P  e  under  consideration.  In  view  of  Theorem  1,  it  may  be  concluded  that 

o  2  y 

a  problem  of  type  (IRP-v)  is  well  posed  for  and  only  for  those  values  of  v 


which  can  be  obtained  from  (20)  by  chosing  finite  negative  values  for  I"(c  ). 

It  thus  follows  that  |i/  |-®  gives  rise  to  a  well  posed  problem  of  type  (IRP-i/) 
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at  and  only  at  points  Pq  at  which  (16)  holds;  moreover  at  all  such  points  the 

•fg 

value  of  I"(oo)  has  no  bearing  on  the  solution  to  the  problem.  Thus  it  only 

"fc 

remains  to  consider  points  Pq  at  which  (16)  does  not  hold.  Letting  I"(o  ) 

approach  in  turn  both  zero  and  negative  infinity  in  (20)  one  thus  finds  that 
(IRP-o)  is  well  posed  if  and  only  if 
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(21) ;  all  possible  curves  Q  through  Pq  are  then  confined  to  the  sector  (21)  in 

a  neighborhood  of  Pq.  The  algorithm  for  determining  a  simple  single  phase 

impactor  material  which  solves  a  well  posed  problem  of  type  (IRP-i/)  is  now 
essentially  the  same  algorithm  used  in  solving  (IRP)  given  in  connection  with 

Theorem  1;  the  only  difference  is  that  the  value  of  1"(^0)  is  no  longer 

allowed  to  be  arbitrary,  but  instead  must  be  given  by  the  value 

k  f  k  k  ★  ★  ★  ★  St  *  ^  Sr 

I"(<T  )  -  4(D  D  -  D  D  )  o  -  (D  D-D  D  )[  /  D  .  (22) 

O  ^  W  O  OV  V  O  (TO  v  ov  a  J  '  V  x  ' 

sem 

In  vi<  of  (16)  and  (21)  it  would  be  useful  to  partition  into  regions 

k  k  k  k 

based  on  the  sign  of  D  D-D  D  .  Although  I  have  not  yet  examined  this 
°  wo  ov  v  °  J 

issue,  an  interesting  result  does  follow  from  the  observation  that  if 
k  k  k 

d  -  D  (o  v  ),  then  a  direct  calculation  gives 
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Consequently  (Ql)  allows  us  to  draw 
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In  particular  (25)  indicates  that  if  PQ  appraches  a  point  at  which  (16)  holds, 

A 

then  the  corresponding  sector  in  which  Q  must  locally  lie  will  collapse  onto  a 
vertical  line. 

As  a  closing  remark,  it  may  be  noted  that  we  have  here  considered  an 
extended  local  inverse  Riemann  problem  in  which  only  the  first  derivative  of 

the  locus  Q  is  prescribed  at  a  point  PQ  -  (ff  .v^)  e  ^*2  ^  ®ne  cou^  certainly 

consider  an  extended  local  inverse  Riemann  problem  in  which  the  first  k 

derivatives  of  the  locus  Q  are  prescribed  at  a  point  Pq  -  (tf  ,vo)  e  *2  0n 

the  basis  of  the  above  development,  one  would  expect  that  the  solution  to  any 
such  problem  would  involve  requiring  definite  values  for  the  first  k+1 

*  * 

derivatives  of  the  function  I(o  )  at  a  -  oq . 
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ABSTRACT 


Large  time  dependent  deformations  of  styrene-butadiene  copolymers 
(SBR)  and  polyurethanes  show  both  plastic  and  viscoelastic  effects.  Such 
materials  are  also  often  nearly  incompressible  and  have  nonlinear  elastic 
constitutive  laws.  The  development  of  finite  element  methods  to  analyze 
these  materials  is  an  active  area  of  research.  In  this  effort  a  recently 
developed  finite  element  algorithm  for  the  analysis  of  viscoelastic 
behavior  in  rubberlike  materials  is  applied  to  the  inflation  and  deflation 
of  a  thick  walled  visco-hyperelastic  sphere  under  internal  pressure.  The 
material  is  assumed  to  have  been  previously  worked  so  that  damage  effects 
do  not  significantly  contribute  to  the  deformations.  A  one  dimensional 
finite  element  analysis  is  constructed  for  incompressible  materials. 
Pressure  vs  time  curves  are  developed  for  both  the  elastic  and  loss  solids 
during  an  inflation  and  deflation  cycle. 


INTRODUCTION 

In  a  recent  paper  Johnson  et  al^  demonstrated  that  the  standard  linear 
solid  could  be  generalized  to  a  nonlinear  solid  and  used  to  model  large 
viscoelastic  deformations  of  elastomers.  The  model  proposed  in  Ref.  1 
consists  of  a  hyperelastic  solid  in  parallel  with  a  nonlinear  internal  loss 
solid.  The  elastic  component  of  the  loss  solid  has  a  hyperelastic 
constitutive  law  and  the  dash  pot  is  proportional  to  the  rate  of  change  of 
the  shear  stresses  in  the  elastic^  component  of  the  loss  solid.  This  paper 
extends  the  nonlinear  solid  model  so  that  deformations  resulting  from  time 
dependent  loadings ^ can  be  conveniently  computed.  First  we  present  some 
information  needed  to  define  the  nonlinear  relaxation  model.  Then  the 
model  is  modified  and  used  to  simulate  the  inflation  and  deflation  of  a 
visco-hyperelastic  thick  walled  sphere.  A  finite  element  model  for  the 
thick  walled  sphere  is  derived  which  includes  a  nonlinear  internal  solid 
and  a  nonlinear  dash  pot  in  parallel  with  the  sphere's  hyperelastic  model. 
The  equilibrium  equations  are  coupled  in  such  a  way  that  the  creep  problem 
can  be  solved  using  a  standard  integration  method.  Numerical  tests  are 
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performed  by  simulating  the  inflation  and  deflation  of  a  visco-hyperelastic 
sphere  to  demonstrate  the  finite  element  algorithm. 


BACKGROUND 

There  are  numerous  papers  on  the  viscoelastic  behavior  of  elastomers. 
We  highlight  Refs.  2-7  as  a  source  of  background  information.  These  papers 
indicate  that  the  general  theory  of  Green  and  Rivlin  can  be  used  with  only 
two  or  three  constitutive  parameters  to  model  viscous  effects  in  many 
elastomers  (typical  visco-hyperelastic  solids).  That  result  is  significant 
since  it  implies  that  the  less  general  but  computationally  more  attractive 
nonlinear  solid  network  model  may  also  be  useful.  Next,  we  define  a 
nonlinear  solid,  similar  to  the  one  presented  in  Ref.  1,  which  we  later 
extend  to  the  case  of  inflation  and  deflation  of  thick  walled 
visco-hyperelastic  spheres. 

Consider  the  network  model  of  a  simple  visco-hyperelastic  solid  given 
in  Figure  1.  The  energy  in  the  solid  is  described  using  total  Lagrangian 
kinematics.  Nodal  locations  are  given  by  x^,  for  the  elastic  (storage) 
solid  and  for  the  loss  solid.  Computation  of  forces  due  to  the  loss 

solid  is  made  assuming  that  the  loss  solid's  current  shape,  x. (t)  ,  is  its 
relaxed  (unloaded)  shape  and  its  deformed  shape,  x„(t)  ,  is  the  current 
deformed  shape  of  the  hyperelastic  solid.  Rivlin  expansions  in  the  strain 
invariants  are  used  for  the  energy  density  functions,  (elastic)  and 
(loss),  which  are  given  by 


c”  j  (ii  -  3)1<i2  - 3)j  <» 

i  j 


2  2  2 
X3  =  X!X2X3 


1 


and  A^,  A^,  A-  are  the  principal  stretch  ratios  determined  from  the 
deformed  and  unaeformed  kinematics.  At  any  time,  t,  the  forces  deforming 
the  loss  solid  are  given  by 


tL(t)  = 


3WL^XE<'t^’  xl/t^ 


Our  model  assumes  that  the  viscous  forces  which  resist  the  deformation  of 
the  loss  solid  are  proportional  to  the  time  rate  of  change  of  the 
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deformation  forces.  That  is,  the  equilibrium  equation  for  the  loss  solid 
(this  equation  also  defines  the  dash  pot)  is 

d  3W  32Wt  3Wt 

-n  —  (  —  )  =  -n  —5=  x  =  —=  (3) 

dt  3xl  9Xl  3^ 

where  x  =  ,  r)  =  the  viscous  proportionality  constant.  If  x_(t)  is 

specified,  then  equation  (3)  is  a  nonlinear  differential  equation  which  can 
be  integrated  by  discretizing  x^Ct)  with  a  time  dependent  step  function 
and  then  solving  a  series  of  relaxation  problems  by  integrating  equation(3) 
in  each  of  the  discretized  step  function  intervals.  Both  scalar  and 
axisymmetric  finite  element  versions  of  this  method  were  formulated  and 
used  to  compute  uniaxial  cyclic  deformations  in  Ref.  1. 


THICK  WALLED  VISCO-HYPERELASTIC  SPHERE;  SCALAR  MODEL 


The  classical  pressure  -  radius  equation  for  the  inflation  of  a  thick 
walled  incompressible  hyperelastic  sphere  is  presented  in  Ref.  3.  The 
geometric  definitions  for  this  static  inflation  problem  are  shown  in  Figure 
2,  and  the  pressure  -  internal  radius  relationship  can  be  summarized  as 
follows. 


r*2  3 

p  =  p( r x ,  Rx)  =  2  J  L[ (QJ  +  1)  $  +  (Q  + 


r . 

where  Q  =  —  ;  i  *  1,2  ;  <t> 

i 

W  =  W( I ^ ,  I^)  =  the  hyperelastic  energy 
and  p  =  internal  pressure. 


;)  V  ]  dQ 


(4) 


-  .  u,  -  9^-  • 

3I1  ’  ~  3I2  ' 


function  of  the  material 


To  demonstrate  how  viscous  effects  with  simple  memory  can  be  modeled  on 
this  classical  hyperelastic  inflation  problem,  we  used  the  network  model 
shown  in  Figure  3.  The  equilibrium  equations  are  written  by  balancing 
pressure  at  the  internal  surface  of  the  sphere  in  the  deformed  geometry. 
With  the  first  variable  in  the  list  of  arguments  representing  the 
undeformed  geometry  and  the  second  being  the  deformed  geometry,  we  can 
write  the  following  equilibrium  equations  for  the  network  in  Figure  3. 
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n  ■  nonlinear  viscous  element. 

x_,  x.  "  nodal  location  of  coordinates. 
E  la 


Figure  1.  A  simple  visco-hyperelastic  solid. 
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,  ,  3V  i 

f  r?  +  -°  1 

R  -  1 

,  3V 

R  +  -°  1 

2 

1  As  J 

1/3 


p  “  internal  pressure 

«  initial  volume  of  material  ■  constant  (incompressible). 


Figure  2.  Classical  inflation  on  an  incompressible  hyperelastic  sphere. 


1  1 

dt”  “  '  “  PL(R1L*  R1E) 

(5) 


dR 

dT 


lE  -  -  ^  [p<c>  -  WR1E>  +  P1*B1L’R1E*' 


where  P^(r^,  R^)  =  the  pressure  at  the  internal  surface  of  the  sphere, 
i  ~  E,  L  , 

p(t)  *  the  applied  internal  pressure  at  time  t, 

n  =  the  damping  proportionality  constant (see  eq.  3), 


n*  =  a  small  damping  constant  selected  so  that  the 

force  generated  in  the  dash  pot  parallel  with  the 
elastic  sphere  is  small  when  compared  to  the  other 
forces  in  the  network. 


Equation  (5)  was  integrated  for  the  case  of  a  Neohookean  material. 
The  form  of  the  applied  pressure  is  shown  in  Figure  4  along  with  the 
resulting  forces  in  the  elastic  and  loss  solids. 

THICK  WALLED  HYPERELASTIC  SPHERE: 

FINITE  ELEMENT  MODEL 


Consider  the  symmetric  deformation  of  a  thick  walled  sphere  and  let 
A  =  the  radial  stretch  and  A„=  A^=  the  hoop  stretches.  To  demonstrate 
the  use  of  internal  hyperelastic  solids  for  rubber  viscoelasticity  we 
construct  a  potential  energy  model  for  the  inflation  of  a  thick  walled 
sphere  in  which  the  incompressibility  constraint  is  enforced  with  a 
penalty.  The  incompressibility  constraint  requires  A^=  *3 
With  A  =  A^  as  the  only  unknown  variable,  the  strain  invariants  Become 


I 


2 

1 


2A  + 


(6) 


Equation  (6)  is  used  in  equation  (1)  to  express  the  internal  strain  energy 
in  terras  of  radial  stretch. 

Next,  we  discretize  the  undeformed,  r,  and  gd^ormed,  R,  radial 
coordinates  and  map  them  both  to  £  C  (0,1)  as  follows 
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Pressure 


Loss  sphere 


Figure  3 


F igure 


Network  model  for  a  scalar  pressurized  visco-hyperelastic  spl 
with  scalar  dash  pots. 


Time 


.  Pressure  vs  time  curves  for  a  viscous  Neohookean  sphere  bei 

i  _ i _ i  ~  _ i 


liXl 


rU)  =  (r.  r.+1) 


1  -  5 
5  J 


T 

r  9 


(7) 


RU)  =  (R.  R.+1  ) 


r  ^ 

1  "  5 

l  5 


_T 

=  R  9 


Then  the  internal  energy  is  given  by 


U 


£  4  it  j\i(r,R)  r2-||  d? 


Elements 

We  have,  from  the  geometry  and  volume  constraints. 


(8) 


.  ,  ,  _  R  _  1 

and  A,,  =  A,  -  ~zr  - 


'2  3  r 


Then, 


W(r,R)  =  W(A) 


(9) 


(10) 


and  the  element  gradient,  with  respect  to  the  unknown  deformed  coordinates 

R  (where  e  =  element  number)  for  the  element  becomes, 
e 


au 


8, 


e  .  f  aw  3A  2  dr 

=  ”T  W  9RT  '  « 


(11) 


3R 


Then, 


8„ 


- 8’  -Rx2(rr  r2)  it)  * d£ 


(12) 
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Similarly,  the  element  tangent  matrix  becomes 


fV  ,,2  3« 
■J  C  3^ 


(r2-  r^  «p  9  d£ 


The  element  gradients  and  tangents  were  evaluated  using  three  point 
numerical  integration.  We  added  the  square  of  the  volume  error  times  a 
penalty  number  (1000  times  C^q)  to  enforce  incompressibility  as  follows. 


Ue  '  4-  (  (Ri+l  •  R?>  -  <ri«  '  ri>)2 

where  X  *  penalty  parameter( 1000  *  C..-) 

The  terms  to  be  added  to  the  gradient  and  tangent  become 


£- x  (<Ri- 


R?>  -  <*i« 


-*) 


3RJ+i 


and  k  =  X 
e 


9R7  -  6R.G 

i  i 


2  2 
“9R7  . R7 

l+l  l 


2  2 
-9r:^,r^ 

i+I  l 


9R7 ..+  OR.^.G 
i+i  i+I 


where  G  «  (R*+1  -  rJ)  '  (r|+1  -r*) 

Finally,  the  work  done  by  the  internal  pressure  is 


WP  =  4-  n  (  Rf  -  r?) 


with  contributions  to  the  global  gradient  and  tangent  of 


r8nR.  00... 
kP  -  0  1  0  0  .  .  . 


The  above  model  was  used  to  numerically  simulate  the  inflation  of  a  thick 
walled  sphere  and  is  compared  graphically  to  the  exact  solution  (using 
finite  elements)  in  Figure  5. 
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Deformed  Internal  Radius 


Figure  5.  Finite  element  solution  vs  analytical  solution  for  inflation 
of  a  thick  walled  Neohookean  sphere. 


INFLATION  AND  DEFLATION  OF  A  VISCO-HYPERELASTIC  SPHERE 

Using  the  network  model  in  Figure  3  with  configuration  vectors  at  the 
nodes  and  dashpots  proportional  to  the  tangent  strain  energy  matrices,  we 
can  write  equilibrium  equations  for  the  finite  element  model  as  follows. 


(18) 
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In  general,  the  selection  of  q  and  should  be  made  by  matching 
hysteretic  or  relaxation  data  after  the  quasi-static  elastic  engery 
function,  W„,  has  been  determined.  As  in  the  scalar  model,  we  are  using  n* 
here  so  that  equation  (18)  maintains  a  form  which  can  be  easily  integrated 
(eg.,  using  the  Runge-Kutta  method).  Obviously,  additional  internal  solids 
can  be  added  to  increase  the  accuracy  of  this  method  (a  generalized 
nonlinear  Maxwell  model).  To  demonstrate  the  model,  we  computed  one 
inflation  and  deflation  of  a  visco-NeoHookean  sphere.  The  pressure  vs  time 
curves  for  the  loss  and  elastic  solids  are  shown  in  Figure  6. 

CONCLUSION 


A  nonlinear  visco-hyperelastic  solid  model  was  proposed  and  used  to 
simulate  the  inflation  and  deflation  of  a  visco-Neohookean  sphere.  The 
method  was  easy  to  program  and  no  difficulties  were  encountered  integrating 
the  differential  equations.  As  pointed  out  in  Ref.  1.,  the  Runge-Kutta 
updates  for  the  elastic  solid  consist  of  Newton-Raphson  steps  generated 
from  a  potential  energy  function.  These  steps  must  be  small  to  be  useful. 
Further  research  is  being  done  to  investigate  the  sensitivity  of  the 
numerical  computations  to  the  values  of  the  parameters  in  the  model. 
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Figure  6.  Pressure  vs  time  curves  for  a  viscous  Neohooi.ean  sphere  being 
inflated  and  deflated  -  finite  element  solution  with  nonlinear 
dash  pots. 
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ABSTRACT 

The  dynamic  overturning  response  of  Army  vehicles  with  shelters  and  trailers  as  well  as  over¬ 
turn  susceptible  structures  has  been  of  considerable  interest  to  the  defense  community  since  critical 
structures  and  internal  equipments  can  be  damaged  resulting  in  system  malfunction  and  reduction  of 
vehicle  performance  jeopardizing  the  primary  mission  objective.  To  overcome  vulnerability  of  vehicles 
and  associated  equipments  to  overturning,  the  Army  is  actively  engaged  in  a  hardening  program  which 
will  result  in  overturn  mitigation  and  increased  survivability.  Flexible  multibody  dynamics  programs 
can  be  used  to  predict  overturning  response  of  structures  due  to  transient  overpressure  loading  and 
facilitate  evaluation  of  mitigation  devices  such  as  outriggers,  cables  and  guy  wires.  However,  accuracy 
of  overturning  prediction  is  dominated  by  the  loading  model  during  the  drag  phase  when  the  structure 
becomes  unstable.  To  assess  validity  of  the  loading  model,  accuracy  of  drag  coefficients  as  a  function 
of  the  roll  angle  and  flow  velocity  should  be  evaluated.  The  current  investigation  is  devoted  to  a 
comparative  evaluation  of  drag  coefficients  used  in  loading  models  in  overturning  response  codes  as 
well  as  any  experimental  data  that  may  be  available. 


INTRODUCTION 

Research  in  vehicle  dynamics  started  with  Olley  [1]  and  Segal  [2]  in  the  area  of  linear  vehicle 
dynamics  which  was  followed  by  several  others  [3-5].  The  first  vehicle  models  allowed  between  two 
and  four  degrees  of  freedom.  Closed  form  analytical  solutions  for  critical  speed  were  obtained  for  these 
models  for  certain  maneuvers.  However,  the  solution  techniques  could  not  be  extended  to  real  vehicles 
due  to  complexity  of  the  vehicle  models  and  the  transient  response  characteristics. 

With  increased  computational  capabilities,  the  vehicle  models  became  increasingly  complex.  Non¬ 
linear  tire  models  replaced  linear  models  to  describe  the  tire  forces  more  accurately  [6].  Eventually, 
very  complex  nonlinear  vehicle  models  could  be  generated  to  simulate  the  dynamic  response  of  vehicles 
with  additional  degrees  of  freedom  [7].  An  overview  of  these  developments  is  given  by  Ellis  [8].  How¬ 
ever,  these  studies  were  devoted  to  a  rather  limited  class  of  vehicles.  The  equations  of  motion  for  these 
vehicles  were  derived  manually  and  then  transformed  to  computer  code  to  be  numerically  integrated. 
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Although  a  simple  vehicle  could  be  simulated  in  this  manner,  creating  larger  representative  models 
and  computation  of  response  are  rather  difficult  and  laborious. 

Recently,  flexible  multibody  dynamics  software  have  been  developed  which  automatically  formu¬ 
late  the  equations  of  motion  for  many  types  of  mechanical  systems.  These  programs  require  the  user  to 
define  the  mass  and  inertial  properties  of  all  rigid  as  well  as  flexible  bodies  in  the  system  and  the  type 
of  joints  that  connect  each  body  relative  to  one  another.  In  addition,  force  elements  can  be  prescribed 
to  simulate  springs,  dampers  and  actuators  between  rigid  or  flexible  bodies.  This  involves  defining  the 
spring  mass  and  suspension  system  elements  connected  with  ideal  joints  or  bushings.  The  springs  and 
shocks  are  modeled  as  force  elements. 

Multibody  dynamics  programs  can  be  divided  into  two  seperate  groups.  The  first  type  numerically 
formulates  the  equations  of  motion  at  each  time  step  during  the  numerical  integration  process.  ADAMS 
[9]  and  DADS  [10-12]  are  multibody  dynamics  programs  which  numerically  formulate  and  solve  the 
equations  of  motion.  The  second  group  of  programs  symbolically  formulates  the  equations  of  motions 
which  can  then  be  numerically  integrated.  NEWEUL  [13-15],  and  MESA  VERDE  [16]  codes  belong 
to  this  group.  Other  differences  exist  between  all  of  these  programs  including  coordinate  selection, 
integration  method,  and  output  format.  Schiehlen  et  al  [14]  obtained  a  good  comparison  of  the 
performances  of  these  programs  as  related  to  the  vehicle  dynamics  field. 

Although  many  of  these  programs  have  a  high  potential  for  modeling  wheeled  vehicle  dynamics, 
their  capabilities  are  currently  limited  due  to  a  lack  of  appropriate  forcing  functions  to  which  the 
vehicle  may  be  subjected  in  a  battlefield.  Majority  of  flexible  multibody  dynamics  codes  that  are 
commercially  available  addresses  the  needs  of  the  vehicle  industries  which  are  interested  primarily  in 
dynamic  response  due  to  sudden  change  in  road  friction  due  to  rain,  snow  and  accidental  spillage  or 
the  influence  of  a  rough  terrain  with  bumps  or  discontinuities  upon  vehicle  stability.  Researchers  at 
the  University  of  Iowa  have  modeled  the  ride  characteristics  of  several  vehicle  combinations  [11,12]. 
Majority  of  these  models  did  not  contain  any  lateral  response  dynamics. 

Lee,  Hobbs  and  Atkinson  [17]  have  developed  a  computer  program  TRUCK  3.1  to  find  the  dynamic 
response  of  a  vehicle  subjected  to  different  types  of  loads  including  such  intensive  loads  as  blast  waves 
from  conventional  or  nuclear  explosives.  The  code  yields  gross  motions  of  the  vehicle  body,  and  of  tires, 
axles,  shelters,  and  racks  relative  to  the  vehicle  body.  Large  motions,  including  sliding  and  overturning 
of  the  vehicle,  are  permitted.  However,  each  individual  element  like  an  axle,  shelter  or  rack  is  treated 
as  a  rigid  body.  Recently,  the  governing  equations  of  motion  used  in  the  TRUCK  code  that  model  the 
rigid  body  motions  of  the  vehicle  assembly  and  its  components  and  modelling  of  the  frictional  force 
between  the  tires  and  the  ground  has  been  validated  by  Batra  [18]  under  Ballistic  Research  Laboratory 
(BRL)  sponsorship. 

This  paper  is  devoted  to  an  evaluation  of  drag  coefficients  in  the  drag  phase  loading  models 
currently  installed  in  the  MINITRUCK  code  as  well  as  the  Overturning  Code  [19]  developed  at  BRL 
and  validation  with  respect  to  some  experimental  data  for  two  specific  Army  vehicles  obtained  by 
S-CUBED  [20]  Inc.  under  BRL  sponsorship.  Commercially  available  flexible  multibody  dynamics 
programs  do  not  include  blast  overpressure  loading  at  present.  However,  capability  exists  in  these  codes 
to  interface  with  user-supplied  loading  routines.  Accurate  prediction  of  vehicle  overturning  response 
from  these  codes  will  depend  on  the  development  of  reliable  blast  loading  models  for  incorporation 
into  these  codes  and  response  validation  using  simple  generic  problems  with  known  solutions. 
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STRUCTURAL  MODELING 


The  complexity  of  the  mathematical  model  selected  to  represent  a  structure  is  necessitated  by 
the  need  to  obtain  accurate  information  to  be  derived  from  the  model.  Accurate  analyses  of  wheeled 
vehicles  require  adequate  representation  of  tire  and  suspension  systems  which  dominate  vehicle  re¬ 
sponse.  Additionally,  accurate  loading  functions  need  to  be  developed  with  capability  to  impose  these 
loads  upon  the  vehicle  model  in  a  realistic  manner.  Since  most  multibody  dynamics  programs  do 
not  currently  address  these  problems  in  sufficient  detail,  it  is  imperative  upon  the  user  to  produce 
loading  subroutines  and  interface  them  with  an  existing  program.  Complex  kinematic  models  suited 
for  multibody  modeling  require  a  large  amount  of  geometric  and  physical  data.  Any  gains  due  to 
increased  model  Complexity  may  be  lost  due  to  lack  of  accurate  input  and  loading  data.  In  spite  of 
these  drawbacks,  flexible  multibody  dynamics  programs  have  several  advantages  for  vehicle  model¬ 
ing  applications  where  large  number  of  kinematically  constrained  members  and  complex  kinematic 
relationships  for  suspension  systems  and  joints  are  involved. 


CODE  DESCRIPTION 

The  MINITRUCK  [21]  code  is  a  simplified  two-dimensional  version  of  the  TRUCK  code  developed 
by  Kaman  AviDyne  to  predict  the  response  of  a  variety  of  Army  vehicles  to  blast  waves.  The  program 
models  the  five  major  degrees-of-freedom  which  include  roll,  sideslip,  heave,  and  two  suspension  related 
degrees-of-freedom.  The  vehicle  is  assumed  to  be  initially  at  rest  on  level  ground.  Only  a  side-on  blast 
encounter  from  the  roadside  is  allowed  for  the  2-D  version.  Capabilities  include  modeling  regular  or 
independent  suspension,  flexible  or  rigid  outriggers  and  guy  wires.  Equivalent  properties  are  used  for 
characterizing  many  vehicle  components  Thus  stiffnesses  of  all  axle  springs  on  one  side  of  a  vehicle  are 
represented  by  one  equivalent  spring  In  the  same  manner,  masses  are  represented  by  equivalent  lumped 
mass  models.  This  is  due  to  the  2-D  MINITRUCK  structural  model  restriction.  This  representation 
can  best  be  visualized  by  compressing  or  collapsing  a  vehicle  in  the  fore-and-aft  direction  until  it  is 
entirely  compressed  in  a  vertical  plane,  normal  to  the  shock  front.  During  the  response,  all  motions 
are  confined  within  this  plane.  The  three-dimensional  aerdynamic  representation  used  in  the  TRUCK 
code  was  retained. 

The  current  MINITRUCK  version  allows  parking  the  vehicle  on  a  simple  horizontal  ground  which 
excludes  special  positioning  of  a  loaded  field  vehicle  on  an  inclined  ground  at  the  time  of  a  blast  en¬ 
counter.  However,  such  effects  of  inclined  slopes  on  vehicle  overturning  due  to  a  blast  load  is  rarely 
encountered  and  require  complex  ground  modeling.  The  assumption  of  horizontal  ground  simplifies 
considerably  the  theoretical  formulation.  The  primary  coordinate  system  is  a  body  axis  system  at¬ 
tached  to  the  vehicle  body  with  its  origin  at  the  center  of  gravity  of  the  entire  system. 

The  Overturning  Code  [19]  developed  at  BRL  included  an  airblast  loading  model  for  closed 
cubical  box  type  structures  which  were  used  to  model  various  targets.  The  targets  were  subdivided 
into  a  convenient  number  of  closed  boxes  which  were  attached  to  each  other  to  represent  the  shapes 
of  the  targets.  To  allow  estimation  of  overturning  moments,  values  of  the  rolling  moment  coefficients 
measured  in  the  wind  tunnel  for  specific  truck-shelter  combinations  were  fitted  as  a  function  of  the 
angle  of  rotation  and  incorporated  in  the  code. 
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AERODYNAMIC  LOADING 


The  loading  on  the  vehicle  results  from  the  blast  wave  from  a  nuclear  or  conventional  explosion. 
Associated  with  the  blast  wave  are  increased  pressure  density  and  material  velocity.  Blast  wave 
characteristics  for  two  heights  of  burst  are  considered  in  these  codes.  The  first  one  corresponds  to 
a  height  of  burst  of  zero  representing  the  sea  level  and  the  second  corresponds  to  a  burst  height  of 
60 W1!2  m  based  upon  BRL  data.  The  two  blast  wave  characteristics  are  specified  in  tabular  form  for 
a  1  KT  weapon  yield  at  sea  level  and  the  results  are  applied  for  other  weapon  yields  and  conditions 
other  than  sea  level  using  the  built-in  modified  Sachs  scaling  laws.  The  nominal  range  of  the  structure 
from  the  burst  point  or  the  peak  overpressure  behind  the  shock  needs  to  be  specified. 

The  blast  wave  front  is  assumed  to  be  normal  to  the  ground  surface,  and  the  vehicle  is  assumed 
initially  normal  to  the  ground;  shock  encounter  can  be  from  any  orientation.  As  the  blast  wave  envelops 
the  vehicle,  the  wave  reflects  from  the  vehicle  surface  and  rarefaction  waves  emanate  from  free  edges 
to  relieve  the  reflected  pressure.  At  later  times,  the  loading  is  essentially  a  drag  type  loading,  resulting 
from  the  material  velocity  associated  with  the  blast  wave.  The  development  of  the  aei  ^dynamic  loading 
is  seperated  into  two  phases  namely,  diffraction  loading  and  drag  loading. 


Diffraction  Loading 

The  diffraction  loading  phase  begins  with  the  shock  impinging  upon  the  vehicle.  As  the  vehicle 
is  enveloped,  the  shock  wave  reflects  from  the  vehicle  surfaces  and  rarefaction  waves  emanate  from 
free  edges  to  relieve  the  reflected  pressure.  The  diffraction  loading  is  based  upon  shock  tube  experi¬ 
ments  in  which  front  and  back  face  pressures  on  the  structural  component  were  measured  for  a  shock 
front  normally  incident  on  the  front  face  [22].  For  MINITRUCK  only  front  face  normal  incidence  is 
considered  and  diffraction  loadings  for  arbitrary  incidence  angles  are  excluded. 

It  is  assumed  that  the  vehicle  moves  very  little  during  the  diffraction  period,  so  that  the  shock 
wave  and  the  vehicle  is  expected  to  remain  normal  to  the  ground  during  this  period  of  time.  This 
assumption  limits  the  number  of  configurations  that  can  be  addressed. 

The  basic  model  for  the  diffraction  loading  describes  waves  emanating  from  the  free  edges  of 
the  aerodynamic  boxes  representing  the  exposed  vehicle  surfaces.  When  the  undisturbed  shock  front 
reaches  an  edge  point,  a  relief  wave  emanates  which  leads  to  an  exponential  decay  in  the  time- varying 
pressure  loading  applied  to  the  faces. 

Drag  Loading 

When  the  blast  wave  first  encounters  and  engulfs  the  target,  the  overturning  moment  due  to 
shock  reflection  and  diffraction  process  is  large,  and  drag  loading  is  not  considered.  After  the  shock 
front  traverses  the  target,  the  drag  loading  moment  relative  to  the  diffraction  loading  moment  becomes 
significant.  Thus  following  the  diffraction  phase,  the  pressure  loading  becomes  a  drag  type  of  loading. 
Drag  loading  depends  primarily  on  the  dynamic  pressure  associated  with  the  material  velocity  behind 
the  shock  front.  When  the  drag  loading  moment  becomes  equal  to  or  larger  than  the  diffraction 
loading  moment,  the  diffraction  loading  phase  is  terminated,  and  only  the  drag  loading  moment  is 
used  in  calculating  further  response  of  the  target. 
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The  pressure  loading  during  the  drag  phase  for  a  non-decaying  shock  of  strength  P,,  is  given  by 


p(t)  =  Pa  +  Cpq  (1) 

where  Pa  and  q  are  the  overpressure  and  the  dynamic  pressure  associated  with  the  blast  wave. 
The  task  of  estimating  the  drag-phase  loading  thus  becomes  a  matter  of  determining  the  pressure 
coefficient,  Cv.  The  pressure  coefficient  must  be  determined  for  arbitrary  vehicle  orientations  for 
various  rectangular  areas  which  make  up  the  exterior  surfaces  of  the  box-like  configuration  while  the 
shock  is  assumed  to  be  travelling  parallel  to  the  ground. 


a.  MINITRUCK  Drag  Model 

There  are  many  factors  which  can  influence  the  value  of  Cv  .  In  addition  to  alpha,  the  angle 
between  the  flow  velocity  and  the  surface  normal,  these  dependent  variables  may  include  Reynolds 
number,  Mach  number,  aspect  ratio  (i.e.,  ratio  of  the  two  dimensions  of  the  rectangular  surface),  etc. 
In  MINITRUCK,  expressions  for  the  drag  coefficient  in  a  generalized  form  are  given  as 


Cp  =  d  cos(9/8a)  (2) 

where,  0  <  a  <  aj.  However,  if  an  <  a  <  7r,then  Cp  =  -0.4  and, 

d  =  (1.00  -  0.15/2)  +  (0.20  +  0.15P)(t/>/3)2  (3) 

The  above  equations  account  only  for  the  a-variation  and  R  in  equation  (3)  refers  to  the  aspect 
ratio.  The  Mach  number  M  is  related  to  the  shock  strength  ratio  ip  according  to  the  Rankine-Hugoniot 
relation 


M  =  5 V7  y/{ 7  +  7</>)(7  +  </>)  (4) 

At  low  Mach  numbers  and  aspect  ratio  of  unity  the  equations  are  in  general  agreement  with 
experimental  data  of  Hankins  [23]  and  Hoerner  [24]  which  results  in  a  drag  coefficient  of  1.25  for 
a  cuboid.  In  MINITRUCK  code  the  generalized  drag  coefficient  model  is  necessitated  by  a  lack  of 
extensive  pressure  distribution  data  for  various  surfaces  at  higher  Mach  number  obtained  from  wind- 
tunnel  tests. 


h.  Overturning  Code  Drag  Load 

Drag  loading  model  used  in  the  Overturning  Code  [18]  assumes  that  the  target  is  a  collection 
of  cuboids.  The  drag  loading  on  each  cuboid  is  calculated  independently  of  the  others,  and  the  total 
drag  moment  is  calculated  by  summing  the  moments  calculated  for  each  cuboid.  The  drag  loading  on 
a  cuboid  is  assumed  to  be  zero  until  the  shock  front  arrives  at  the  center  line  of  the  cuboid.  Loading 
is  then  computed  using  appropriate  areas  corresponding  to  the  angle  of  rotation  of  the  target.  The 
characteristics  of  the  blast  wave  are  assumed  to  remain  unchanged  over  the  depth  of  the  target,  so 
that  the  same  free-field  blast  waveform  can  be  used  for  all  cuboids. 
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The  drag  loading  is  divided  into  a  horizontal  and  a  vertical  component  which  in  turn  depends 
upon  the  dynamic  pressure,  horizontal  or  vertical  area,  constants  corresponding  to  drag  coefficient  of 
horizontal  or  vertical  area  at  zero  flow  velocity,  and  the  roll  angle  of  rotation.  The  code  multiplies 
each  component  with  the  corresponding  moment  arm  and  the  product  is  summed  over  all  cuboids  to 
calculate  the  overturning  moment  due  to  drag.  The  increase  in  horizontal  and  vertical  drag  with  Mach 
number  is  based  on  the  approach  of  Hoerner  [24],  who  describes  the  drag  in  subsonic  flow  of  bluff 
forms  with  entirely  seperated  from  the  rear  in  terms  of  two  components.  These  are  a  positive  front 
surface  pressure  proportional  to  the  incident  pressure,  and  a  negative  base  pressure  assumed  to  be 
independent  of  Mach  number.  The  relation  which  was  obtained  by  fitting  wind  tunnel  data  on  block 
configurations  can  be  expressed  as: 


CDrag  =  Co(0.375  +  0.625(1  +  0.25M2)) 


(5) 


where, C0  =  1.26  for  a  cube  on  a  ground  plane  and  C0  =  1.15  for  a  rectangular  parallelipiped  on 
a  ground  plane,  with  width  to  height  ratios  of  2  or  4. 


c.  Modified  Drag  Load 

It  is  rather  difficult  to  obtain  a  generalized  drag  equation  which  will  accurately  predict  drag 
coefficients  as  a  function  of  the  roll  angle  and  the  Mach  number  for  all  types  of  structural  configurations 
since  these  coefficients  are  very  sensitive  to  geometrical  configurations,  surface  smoothness  and  lateral 
dimensions  of  the  target  body.  Although  diffraction  loading  influences  initial  response  of  structures, 
the  onset  of  overturning  in  most  cases  occurs  at  a  later  stage  which  is  largely  dominated  by  loading 
during  the  drag  phase.  As  a  result  reliable  prediction  of  overturning  is  directly  dependent  on  accurate 
prediction  of  drag  coefficients. 

Computational  fluid  dynamics  codes  have  been  developed  to  reliably  predict  transient  structural 
loading  due  to  a  propagating  wave  during  the  diffraction  phase.  However,  serious  numerical  difficulties 
occur  during  the  drag  phase  at  later  times  which  precludes  the  use  of  these  codes  for  drag  loading 
prediction.  A  preferred  alternative  is  to  generate  a  scaled  mechanical  model  for  each  structure  or 
vehicle  and  experimentally  obtain  drag  force  and  moment  coefficients  from  wind  tunnel  tests  at  various 
orientations  along  roll,  pitch  and  yaw  directions  relative  to  the  wave  front.  This  method  is  expensive 
since  the  experiment  has  to  be  repeated  several  times  at  various  combinations  of  Mach  numbers  and 
roll,  pitch  and  yaw  angles  until  overturning  is  ensured.  However,  it  is  the  only  reliable  and  fully 
acceptable  technique  of  drag  loading  assessment  at  present. 

In  the  absence  of  experimental  data  for  most  recently  developed  Army  vehicles  and  equipments,  a 
modified  drag  loading  equation  based  on  a  combination  of  useful  features  of  the  previous  two  approaches 
can  be  deployed  as  shown  below: 


Coras  =  Ci  Cos(9/8a)  (0.375  +  0.625(1  +  0.25  A/2)]  (6) 

where,  C i  =  1.2  for  a  rectangular  parallelipiped  on  a  ground  plane  with  width  to  height  ratios  of 
2-4. 

The  equation  above  has  a  constraint  condition  on  the  roll  angle  (alpha)  such  that  0  <  q  <  7tt 
/ 1 2.  However,  beyond  this  range  the  coefficient  is  determined  to  be  a  constant,  Corag  =  -0.4  provided 
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we  have  7jt/12  <  a  <  tt. 


d.  Experimental  Drag  Load 

The  aerodynamic  force  and  moment  coefficients  are  essential  input  to  the  solution  of  any  airblast 
loading  and  response  problem.  Recently  S-CUBED  Inc.  [20]  derived  the  drag,  lift  and  moment 
coefficients  from  experimental  data  from  a  series  of  wind  tunnel  tests  on  scale  models  of  two  Army 
vehicles. 

The  roll  (or  overturn)  angle  dependency  and  to  a  lesser  extent,  the  Mach  Number  effect  can  be 
seen  when  the  moment  coefficients  due  to  side-on  exposures  to  a  number  of  steady  state  flow  field 
environments  are  obtained  as  a  function  of  the  normalized  angle  of  attack  for  a  particular  vehicle. 

The  aerodynamic  drag,  lift  and  moment  coefficients  were  shown  to  be  quite  sensitive  to  changes  in 
both  the  overturn  angle  and  the  aspect  ratio.  This  sensitivity  is,  to  a  large  extent,  introduced  artificially 
as  a  consequence  of  using  a  body-fixed  coordinate  system  and  could  be  reduced  by  performing  two 
operations  on  the  data.  First,  the  dependence  on  overturn  angle  is  suppressed  by  transforming  the 
coefficients  to  a  non-rotating  coordinate  system.  The  geometry  effects  are  reduced  by  normalizing  the 
overturn  angle  to  an  angle  defined  by  the  roll  position  of  the  system  when  the  projected  side-on  area 
is  a  maximum. 

As  shown  in  Figure  1,  the  aerodynamic  drag  coefficient  from  wind  tunnel  tests  on  a  1-1/4  ton 
truck  scale  model  subjected  to  a  side-on  blast  overpressure  typically  exhibit  an  initial  increase  at  small 
roll  angles  followed  by  rapid  decrement  at  higher  angles  of  attack  beyond  20  degrees.  The  drag  appears 
to  decrease  with  increased  Mach  number  until  the  roll  angle  equals  or  exceeds  60  degrees  when  Mach 
number  dependency  of  the  drag  coefficient  is  diminished  considerably.  For  accurate  estimation  of  the 
drag  coefficient,  the  experiment  has  to  be  repeated  for  each  vehicle  model  at  various  flow  velocites  at 
both  low  and  high  Mach  numbers  and  roll  angles  until  occurrence  of  overturning  is  ensured. 

COMPARISON  OF  DRAG  LOADING  MODELS 

The  variation  of  drag  coefficient  with  roll  or  overturning  angle  currently  used  in  the  MINITRUCK 
code  has  been  compared  with  the  modified  drag  loading  model  from  the  Overturning  Code  as  shown  in 
Figure  2.  Additionally,  the  dependence  of  drag  on  flow  velocity  and  roll  angle  in  the  form  of  desensitized 
aerodynamic  coefficient  data  from  S-CUBED  Inc.  obtained  by  transforming  the  coefficients  to  a  non¬ 
rotating  coordinate  system  for  two  specific  Army  Vehicles  has  been  compared  with  the  MINITRUCK 
drag  model  as  shown  in  Figure  1. 

The  MINITRUCK  drag  model  in  both  Figure  1  and  2  has  no  dependence  on  flow  velocity  since 
the  simplified  semi-empirical  drag  equations  described  earlier  ignores  the  influence  of  Mach  number  on 
drag  coefficients.  In  Figure  2  both  models  describe  gradual  decrease  of  drag  coefficients  from  initial 
configuration  to  a  roll  angle  of  approximately  80  degrees.  Beyond  this  angle  drag  coefficients  appear 
to  have  small  negative  values  for  both  models.  However,  when  the  roll  angle  exceeds  100  degrees,  the 
drag  is  assumed  to  be  a  constant  since  at  these  large  angles  overturning  is  fully  ensured  in  most  cases. 

Comparison  of  numerical  values  of  the  drag  coefficient  at  small  roll  angles  shows  considerable 
difference  between  the  two  models.  At  the  initial  configuration  with  a  null  value  for  the  roll  angle, 
the  coefficient  varies  from  1.33  to  1.2  due  to  Mach  number  variation  of  .1  to  .8  for  the  modified 
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Figure  1:  Comparison  of  S-CUBED  experimental  drag  data  with  MINITRUCK,  drag  model 
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I7- glare  2 :  Ccrnoarison  of  MINITRUCK  drag  wlch  modified  (>rerturnirig  Code  drag  model . 
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ivertuming  code  drag  model  while  the  corresponding  MINITRUCK  drag  value  is  approximately  0.85. 
This  difference  is  rather  large  and  it  can  exert  significant  influence  upon  the  overturning  response 
>rediction.  As  a  result,  the  large  difference  is  of  concern  even  though  the  difference  diminishes  rapidly 
vith  increasing  roll  angle  until  approximately  80  degrees  when  the  data  appear  to  converge  at  zero 
irag. 

Comparison  of  drag  in  Figure  1  between  the  semi-empirical  model  in  the  MINITRUCK  code  and 
ffie  experimented  data  from  S-CUBED  shows  large  difference  in  magnitude  of  drag  coefficients  between 
the  two  sets  of  curves  similar  to  observation  in  Figure  2.  However,  comparison  of  drag  between 
the  experimental  data  and  the  modified  Overturning  Code  loading  model  shows  very  satisfactory 
agreement  with  an  exception  that  initially  the  drag  data  tends  to  increase  with  roll  angle  until  the 
drag  reaches  a  peak  value  at  an  angle  of  10  degrees  beyond  which  it  decays  gradually  as  expected 
while  the  modified  model  does  not  exhibit  such  a  trend.  The  MINITRUCK  drag  loading  model  is  in 
substantial  disagreement  with  available  data  and  other  predictive  models  indicating  a  probable  lack 
of  validity  and  a  need  for  improvement  of  the  currently  installed  MINITRUCK  loading  capability. 

The  agreement  between  the  experimental  data  and  the  modified  model  is  rather  unexpected 
in  view  of  the  fact  that  the  data  pertains  to  two  specific  Army  vehicles  while  the  modified  model 
is  a  generalized  predictive  model.  Validity  of  such  a  model  cannot  be  made  without  conducting 
extensive  number  of  experiments  upon  various  vehicles  and  structures  and  comparing  the  data  with 
the  modified  model.  Such  a  study  will  facilitate  evaluation  of  an  experimentally  based  correlation 
factor  which  can  improve  the  capability  of  the  modified  loading  model.  However,  interaction  effect 
of  various  components  of  the  structure  upon  the  drag  loading  cannot  be  predicted  accurately  by  a 
generalized  model  and  experimental  drag  loading  data  are  deemed  to  be  vital  for  overturning  response 
computation  of  complicated  structures  and  vehicles  which  can  be  dominated  by  such  effects. 

DISCUSSION  AND  RECOMMENDATION 

Comparison  of  various  drag  loading  models  with  experimental  data  is  necessary  to  determine 
accuracy  of  prediction  of  vehicle  overturning  to  airblast  loading.  In  most  cases  the  drag  loading 
on  each  rectangular  parallelled  or  cuboid  aerodynamic  configuration  is  calculated  independently 
of  the  others  and  the  total  drag  moment  is  calculated  by  summing  the  moments  calculated  as  a 
function  of  the  dynamic  pressure,  facing  area,  the  moment  arm  and  the  angle  of  rotation  for  each 
block.  As  a  result,  the  interaction  effects  of  adjacent  blocks  upon  one  another  are  ignored  during  the 
computation.  In  structural  assemblies  when  some  blocks  are  located  behind  others  along  the  direction 
of  wave  propagation,  significant  error  in  loading  calculation  will  occur  due  to  exclusion  of  overlapped 
regions.  The  linear  summation  method  of  loading  assessment  can  result  in  rather  excessive  load 
estimation  contributing  to  incorrect  overturning  prediction.  Any  proposed  improvement  in  prediction 
ot  overturning  response  must  include  accurate  estimation  of  drag  loading  that  accounts  for  mutual 
interaction  effects  of  adjacent  structural  components  upon  the  drag  coefficient. 

In  summary,  a  major  shortcoming  in  computional  modeling  and  accurate  prediction  of  overturning 
response  is  the  lack  of  a  reliable  drag  function  to  use  in  flexible  multibody  dynamic  response  programs 
currently  available.  Due  to  inability  of  current  computational  fluid  dynamics  codes  to  predict  accurate 
drag  functions  of  structures  subjected  to  side-on  overpressures  at  late  response  times,  there  is  a  need  to 
conduct  an  experimental  effort  to  obtain  drag  functions  of  various  structures  susceptible  to  overturning. 
The  effort  is  expected  to  include  model  studies  of  vulnerable  structures  from  generators  to  tanks  in 
wind  tunnels  at  various  roll  and  pitch  angles  at  small  as  well  large  flow  velocities  in  the  subsonic 
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range.  The  data  obtained  from  such  experiments  on  a  variety  of  structures  must  be  reduced  to  a 
form  suitable  for  direct  input  to  one  or  more  of  the  currently  available  flexible  body  dynamics  codes 
and  may  constitute  part  of  a  library  of  standardized  drag  loading  functions  for  a  variety  of  frequently 
encountered  structures  and  vehicle  configurations  commonly  used  by  the  Army. 

Additionally,  whole  body  motion  of  targets  in  terms  of  sliding  and  overturning  could  be  studied 
in  open-ended  shock  tubes.  This  should  also  include  the  case  where  the  loading  function  consists  of 
a  rounded,  relatively  nonreflecting  shock  front.  It  will  have  very  low  side-on  overpressure  component 
combined  with  a  very  high  stagnation  pressure  component  which  is  of  great  interest  to  the  Army. 
Initially  wooden  cuboids  like  those  used  by  Ethridge  may  be  modeled  and  tested  for  sliding  and 
overturning,  the  data  from  which  could  be  used  to  validate  prediction  from  currently  available  flexible 
multibody  dynamics  codes.  The  side-on  and  stagnation  overpressure  versus  time  at  open  end  of  the 
shock  tube,  at  various  longitudinal  and  radial  locations  can  be  mapped,  for  use  as  data  input  in 
appropriate  prediction  codes. 
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ABSTRACT.  An  elastic-plastic  analysis  of  stresses  and  strains  in  n  inter¬ 
nally  pressurized,  composite-jacketed  cylinder  is  studied  here.  Each  layer  is 
orthotropic  but  with  different  material  properties.  Analytical  expressions  are 
derived  for  a  steel  pressure  vessel  wrapped  with  multilayered  composites. 
Numerical  results  are  obtained  for  three  types  of  composite  jackets.  The  inter¬ 
face  pressure,  hoop  strains,  and  stresses  in  the  liner  and  jacket  are  presented. 

INTRODUCTION.  In  recent  years  there  nas  been  increasing  emphasis  on  the 
use  of  composite  materials  in  armament  structures.  A  current  problem  in  Army 
cannon  design  is  to  replace  a  portion  of  the  steel  wall  thickness  with  a  lighter 
material.  The  inner  portion,  steel  liner,  maintains  the  tube  projectile  inter¬ 
face  and  shields  the  composite  from  the  extremely  hot  gases.  The  outer  portion, 
composite  jacket,  is  made  of  single  or  multilayered  graphite-bismaleimide  wound 
and  wrapped  on  the  steel  liner.  Two  subscale  models  have  been  fabricated  and 
tested  [1,2].  An  analytical  elastic-plastic  solution  for  the  model  with  a 
single- layered  composite  jacket  has  been  presented  in  a  recent  paper  [3].  This 
oaper  covers  an  elastic-Dlastic  analysis  for  the  model  with  a  multi layered  com¬ 
posite  jacket.  Analytical  solutions  are  presented  separately  for  the  composite- 
jacket  and  steel  liner  and  then  for  the  compound  cylinder  Droblem.  Numerical 
results  are  obtained  for  loading  within  and  beyond  the  elastic  region  up  to 
fai lure. 


COMPOSITE  JACKET.  The  composite  jacket  is  made  of  n  layers  bounded  by 
radii  ( rj , r2  >  . . • , rn, rn+i ) .  Each  layer  is  elastically  orthotropic  but  with  dif¬ 
ferent  material  proDerties.  The  strain-stress  relations  for  the  k-th  layer  in 
cylindrical  coordinates  are  given  by 
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or 

£i(k)  =  Sij<k)  crjOO  (i.j  =  r,e,z)  (2) 

where  S.jj(k)  are  components  of  the  compliance  matrix.  The  superscript  k  refers 
to  the  k-th  layer.  In  plane-strain  conditions,  the  above  strain-stress  rela¬ 
tions  modify  to 
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(3) 


cr00 

'  e9<k> 


arr(k)  <Wk> 

Sr0^k*  000^ 


or(k) 

*0(k)/ 


where 

*rr(k)  .  (1-ur2(k)V2r(k))/Er(k) 

0r0^k)  =  -(V0r^+i;0z^yzr(k*  )/E0^ 

/Wk)  =  (l-y0z(k)^z0{k>)/E0(k)  <4> 

The  normal  traction  acting  on  the  interface  between  (k-l)th  and  k-th  layers  is 
denoted  by  qk.  Then  the  general  elastic  solution  for  the  k-th  layer  bounded  by 
radii  (rk,rk+1)  and  subjected  to  interface  pressure  (qk,qk+j)  is  given  by  [4] 

ar(k)  =  (-akqk+ckqk+1)(rk+1/r)gk+l  +  (akqk-bkqk+1 ) (r/rk+1 )gk-l 

=  9k(akclk-ckclk+l)(rk+l/r)9k+1  +  9k(ak9k-bk9k^l)(r/rk+l)qk'1 

u(k)  =  r(/3re(k)<Tr(k)+(300(k)a0(k) )  (5) 

where 

dk  =  rk*l/rk  -  9k  "  l&r r  <k)/09tf 

ck  =  (dk2gk-l)_1  ,  bk  =  ckdk2gk  ,  ak  =  ckdkgk~l  (6) 

At  the  two  ends  of  the  k-th  layer  the  expressions  for  the  displacements  and  hoop 

stresses  are 

uk+l  =  <Ak9k  '  Bk9k+1 ) rk+l 
uk  =  <Ckak  -  Dkqk+1)rk 
a^fk)  _  2akgkqk  -  (bk+ck)gkqk+^  at  rk  +  i 
a0(k)  =  ( bk+ck ) gkqk  -  2akdk’gkak+1  at  rk 

where 

Ak  =  2akgk^Q0^v5  ,  Bk  =  Br0'k)  +  (bk+ck  )gk/300  ( k  > 

Ck  =  -<3rfl(k>  +  (bk+ck)gk000(k)  ,  ok  =  2akdk3gkB00  ( k )  (8) 

At  the  interfaces  (rk,k=2, . . . ,n)  the  displacements  should  be  continuous  and 
these  require 

Ak-l9k-l  “  Bk-ldk  =  ^k9k  "  gk9k+l 
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Let  Q|<  =  qfc/qn  for  all  k,  then  Qn+i  =  0,  Qn  =  1.  and  we  can  calculate  Q^-j 
backward  for  k  =  n  to  2  by 

Qk-1  =  Ak-l_1 t(Bk-l+ck>Qk  -  D^Qk+i] 

Normalizing  by  leads  to 

dk  =  Qk/cil  for  k  =  1.2 . n  (10) 

i.e.,  the  relative  values  for  the  interface  pressures  when  qj  =  l.  we  can  also 
obtain  the  corresponding  displacements  U},...,un,  un+j  at  rj, . . . ,rn,rn+j. 

STEEL  LINER.  The  steel  liner  of  inside  radius  a  and  outer  radius  b  is 
elastic-plastically  isotropic  and  assumed  to  obey  Tresca’s  yield  criterion,  the 
associated  flow  rule,  and  linear  strain-hardening.  The  elastic  solution  for  the 
steel  liner  subjected  to  internal  pressure  p  and  external  pressure  q  is 


ar 

=  |T(p-q)(b/r)*  +  p-q  b */a*| / (b 2/a2-l ) 

*0 

u/r  =  E"'(l+w)[(p-q)(b/r)*  ♦  (l-2v)(p-q  bVaz)]/(bJ/a2-l)  (11) 


When  the  internal  pressure  p  is  large  enough,  part  of  the  steel  liner  (a  $  r  $ 
p)  will  become  plastic  and  p  is  the  elastic-plastic  interface.  The  elastic- 
plastic  solution  can  be  written  in  the  elastic  portion  (p  i  r  i  b)  as 


E_  u 

*1  r 


liw  p2  + 

2  r2 


°r/°o 

aQ/°o 


_  9- 


^o 


oz/a0  =  v  p2/b2  -  2v  q/a0 


(12) 


and  in  the  plastic  portion  (a  ^  r  i  p) 


--  -  =  ( l-v-2v! )  --  +  (1-w 
ao  r  ao 


r 2 


r/a°  =  t  \  (l-ne+n<s  ")  ♦  \  -  (l-nfllin  6  -  9_ 


Pz/a0  =  v  p2/b2  -  2 v ( 1  - n/3 ) J? n  -  -  2v  q/'J0 


ip  =  0(p2/r2-l) 


riP 


m 

~~~~3~UZmV 

m  +  -  - - 

4  (1-v-) 


2  E_  _m_ 

oQ  1-m 


<7  =  a0(i+ncp) 


(13) 
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where  aQ  is  the  initial  tensile  yield  stress  and  Et  is  the  tangent  modulus  in 
the  plastic  range  of  the  stress-strain  curve. 

When  the  internal  pressure  is  further  increased,  the  steel  liner  will 
become  fully-plastic.  Using  Tresca's  yield  criterion,  the  associated  flow  rule, 
and  assuming  linear  strain-hardening,  the  fully-plastic  solution  derived  in  [3] 
is  given  below. 

Subject  to  oq  ?  oz  >  <7r,  the  analytical  expressions  for  the  stresses  and 
displacement  are 

r  1  nfl  b*  b* 

ar  »  -P  +  j  {-;  -  -iJE^ 

<70  =  ar  +  <r0(l+iieP 

ru  «  E*1  (l-2v)  (l+i>)r*orr  +  <J>  b*  (14) 


where 


<(>  =  ub/b  +  (l-2u)  (l+v)E-,q 

ip  =  [<(»  b*/r*  -  (l-u*)c0/E]/[l  +  --  (l-v2)no0/E] 

V3  /3 

COMPOUND  CYLINDER.  The  compound  cylinder  consists  of  an  inner  steel  liner 
and  an  outer  composite  jacket.  The  steel  'Hner  of  inside  radius  a  and  outer 
radius  b  is  wrapped  by  a  multilayered  composite  jacket.  The  displacement  and 
normal  traction  at  the  interface  between  the  liner  and  jacket  should  be  con¬ 
tinuous,  i.e.,  q  =  qj  and  ub  =  uj.  From  these  conditions  we  can  determine  the 
relations  between  p  and  q. 

When  the  internal  pressure  p  is  small,  an  explicit  functional  relation 
exists. 

2d  t b2 /a2-l 1  — 

q  =  (1-Dm  tE(C1-01q2)  ♦  (l-w-2v*)l  ♦  2  (15) 

where  every  term  in  the  right-hand  side  is  known.  The  displacement  at  the  bore 
can  also  be  expressed  as  an  explicit  function  of  p, 


F  U; 


b~ 


h  -  i)  ;  r  *  (1+y)  h  + 


P  a 


2(l-v* ) 


(16) 


When  the  internal  pressure  is  large  enough,  part  of  the  steel  liner  will 
become  plastic.  The  elastic-plastic  solution  is  given  in  terms  of  the  parameter 
p.  The  conditions  of  continuity  require 

g_  = _ ik^lei/bi _  (17) 

a0  (1-U-2UM  +  E(C1-D1q2) 
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(18) 


This,  together  with 


o0  <j0  *  b1-  ii»  *  l  *  r  £ 


e_  =  g. 

Or 


1) 


serves  to  give  an  implicit  relation  between  p  and  q.  By  letting  p  =  a  and  b,  we 
can  determine  the  lower  limits  p*,  q*,  ua*.  ub*  and  the  upper  limits  p**,  q**, 
ua**,  ub**,  respectively. 

When  the  internal  pressure  p  is  further  increased,  i.e.,  p  >  p**,  ua  > 
ua**,  ub  >  ub**,  the  conditions  of  continuity  lead  to 

<t>  =  qC  (C1-01q2 )  ♦  (l-v-2v*)/E]  (19) 


and 


B_  - 

Ors 


(l-n/5)ln  ; 

9 


g. 

°o 


{1 


rj/3(b*/a*-l ) 

"2(l-v*r 


[EfCj-D^)  ♦  (l-v-2v* )  ] } 


(20) 


It  should  be  pointed  out  that  the  pressure  q  and  the  displacement  ub  at  the 
interface  are  linear  functions  of  internal  pressure  p.  The  bore  displacement  ua 
can  be  written  as 


~  =  -(l-2v)(l+v)  |  ~  <0  (21) 

which  is  also  a  linear  function  of  internal  pressure  p. 

NUMERICAL  RESULTS.  Given  any  value  of  internal  pressure,  we  can  obtain 
numerical  results  for  the  stresses  and  strains  in  the  radial  and  tangential 
directions  and  also  for  the  displacement  at  any  radial  position  in  a  steel 
pressure  vessel  wrapped  with  multilayered  composites.  The  steel  liner  for  the 
subscale  test  specimens  [1]  had  an  inner  diameter  of  2.0  inches  and  an  outer 
diameter  of  2.34  inches.  The  steel  was  4130  seamless  mechanical  tubing  heat 
treated  to  a  hardness  of  34  to  36  Rockwell  "C.”  A  standard  ASTM  tensile  test 
was  conducted  to  determine  the  0.1  percent  offset  yield  strength  (120  Ksi)  and 
the  ultimate  tensile  strength  (140  Ksi).  The  composite  jacket  is  a  graphite- 
bismaleimide  produced  by  Fiberite  Corporation.  Its  cure  temperature  is  450°F 
and  it  is  wound  and  wrapped  on  the  steel  liner  in  the  same  manner  as  the  full- 
scale  gun  tube  specimen  denoted  as  CTL  III.  The  layup  is  again  approximately 
half-scale  and  made  up  of  two  longitudinal  layers  alternating  with  two  circum¬ 
ferential  layers.  Sixteen  layers  are  applied  in  this  way.  Lamina  properties 
for  this  material  are  given  in  Table  1.  For  the  purpose  of  comparison,  numeri¬ 
cal  results  are  obtained  for  four  types  of  composite  jackets  as  shown  in  Table 
2.  Cases  3  and  4  represent  four  hoop-axial  and  axial-hoop  alternating  layers, 
respectively,  while  cases  1  and  2  represent  eight  axial  and  hoop  layers,  respec¬ 
tively.  The  total  thickness  of  each  composite  jacket  is  0.12  inch  and  the  steel 
liner  is  assumed  to  be  linear  strain-hardening  with  a  =  1  inch,  b  =  1.17  inches, 
<70  =  120  Ksi,  m  =  0.04.  in  addition  to  the  lower  and  upper  limits  (p*  and  p**) 
of  internal  pressure  in  the  elastic-plastic  range,  we  also  show  in  Table  2  two 
other  limits  (Pg  g  and  Pi. 3)  which  correspond  to  the  internal  pressure  when  ub/b 
=  0.8  and  1.3  percent,  respectively.  It  should  be  noted  that  ub/b  is  the  maxi¬ 
mum  hoop  strain  in  the  composite.  Brittle  failure  of  the  composite  material  is 
assumed  to  occur  at  a  maximum  strain  of  0.8  or  1.3  percent.  The  limits  (Pg.a  or 
Pi. 3)  will  be  the  maximum  values  of  internal  pressure  these  compound  tubes  can 
contain  without  failure. 
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The  pressure  at  the  interface  between  the  liner  and  jacket  has  been 
obtained  as  a  function  of  internal  pressure  and  the  results  for  the  first  three 
cases  are  shown  in  Figure  1.  The  results  of  the  hoop  strains  at  the  bore, 
interface  between  the  liner  and  jacket,  and  outside  surface  for  three  cases  are 
shown  in  Figures  2,  3,  and  4,  respectively,  as  functions  of  internal  pressure. 
The  complete  (including  elastic,  elastic-plastic,  and  ful ly-plastic )  ranges  of 
loadings  up  to  Pqq  have  been  considered.  These  numerical  results  for  the 
strains  are  oresented  here  for  future  comparisons  with  exDerimental  results. 

The  results  of  hoop  stresses  in  the  liner  at  the  bore  are  shown  in  Figure  5  as 
functions  of  internal  pressure.  It  should  be  noted  that  the  relation  cnanges 
drastically  when  yielding  occurs.  The  results  of  hoop  stresses  in  the  liner  at 
the  interface  are  shown  in  Figure  6  as  functions  of  internal  pressure.  The 
relation  changes  from  linear  to  nonlinear  when  yielding  sets  in  and  more  signif¬ 
icant  change  occurs  when  the  fully-plastic  state  is  reached.  The  distribution 
of  hoop  stresses  in  the  liner  and  jacket  can  be  obtained  at  any  given  value  of 
internal  pressure.  In  Figures  7,  8,  and  9  we  present  the  numerical  results  for 
three  cases  of  composite  jackets  at  three  values  of  internal  pressure,  i.e.,  p  = 
p*,  p**  and  when  half  of  the  liner  is  plastic.  The  values  of  internal  pressure 
when  half  of  the  liner  is  plastic  are  p  =  18.61,  23.86,  21.41  Ksi  for  cases  1, 

2,  3,  respectively.  The  values  of  two  limits,  p*  and  p**,  are  given  in  Table  2 
for  all  four  cases.  When  the  composite  jacket  is  made  of  axial  lamina  only,  the 
hoop  stresses  in  the  jacket  are  very  small  as  shown  in  Figure  7.  When  the  liner 
is  wrapped  by  hoop  lamina  only,  the  hoop  stresses  in  the  jacket  become  larger  as 
the  internal  pressure  increases  as  shown  in  Figure  8.  When  the  jacket  consists 
of  alternating  hoop-axial  lamina,  the  hoop  stresses  become  discontinuous  not 
only  at  the  interface  between  the  liner  and  jacket  but  also  at  all  other  inter¬ 
faces  between  axial  and  hoop  lamina. 
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TABLE  1.  ELASTIC  CONSTANTS  OF  STEEL  AND  COMPOSITE  MATERIALS 


Material 

Ee 

xlO*  psi 

Ep 

xlO*  psi 

E2 

xlO*  psi 

vrz 

vr0 

VZ0 

Hoop  lamina  Im6 

— 

mm 

n 

M 

EK9 

Axial  lamina  G50 

1 

B| 

BOB 

^^1 

SB 

Steel  4130 

i 

Kkh 

m 

TABLE  2.  LIMITS  OF  INTERNAL  PRESSURE  FOR  FOUR  CASES 


Case 

Layup 

P* 

wm 

mm 

p1.3 

1 

(90«)8 

16.49 

19.44 

21.26 

2 

(0°>8 

20.95 

25.55 

35.20 

3 

(0°.90°)4 

18.87 

22.70 

28.59 

4 

( 90” , 0° ) 4 

18.80 

22.60 

28.3b 

34.90 
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Figure  1.  Interface  pressure  as  a  function  of  internal  pressure. 
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Figure  2.  Hoop  strain  at  the  bore  as  a  function  of  internal  pressure 


Figure  3.  Hoop  strain  at  the  interface  as  a  function  of  internal  pressure. 
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Figure  4.  Hoop  strain  at  the  outside  as  a  function  of  internal  pressure. 
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Abstract 

A  common  problem  that  occurs  when  performing  exact  computations  is  expres¬ 
sion  swell,  in  which  the  size  of  expressions  involved  in  a  calculation  grow  dramatically. 
An  important  special  case  of  this  phenomenon  is  intermediate  expression  swell  where, 
during  the  middle  stages  of  a  calculation,  intermediate  expressions  cam  expand  sub¬ 
stantially,  but  the  final  results  of  the  calculation  are  comparatively  simple.  Computing 
the  characteristic  polynomial  of  a  matrix  is  a  good  calculation  for  examining  the  ef¬ 
fects  of  expression  swell,  which  are  very  striking,  even  for  small  matrices.  A  number 
of  case  studies  involving  matrices  consisting  of  integer  and  rational  entries  have  been 
performed.  In  addition,  some  worst  case  theoretical  analyses  have  been  done  and  the 
results  compared  with  those  of  the  case  studies. 


1  Introduction 

A  common  phenomenon  that  occurs  when  performing  exact  computations  is  expression 
swell,  in  which  the  size  of  numbers  and  expressions  involved  in  a  calculation  grow  dramati¬ 
cally  as  the  calculation  progresses.  A  typical  example  of  this  phenomenon  is  the  calculation 
of  the  roots  of  a  third  or  fourth  degree  univariate  polynomial  which  does  not  factor  over  the 
rational  numbers  and  so  the  cubic  or  quartic  formula  must  be  used.  As  a  particular  example 

’This  work  is  partially  sponsored  by  the  Army  Research  Office  and  is  being  done  under  the  direction  of 
Professor  Stanly  Steinberg  as  part  of  the  requirements  for  a  Ph.D. 

**  ‘  e  author  of  tkis  paper  orese.  fe'  t  •_*  v  :■  Sixth  Army  Conference 
on  Applied  liathenatics  are  Cnr’prti rr. 
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[Gro87],  the  characteristic  polynomial  of  the  9x9  real  symmetric  Hankel  matrix 
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A9  +  A8  -  40A7  -  24A6  +  240A5  +  144A4  , 
which  can  be  factored  over  the  rational  numbers  into 

A4(A  +  6)(A4  -  5A3  -  10A2  +  36A  +  24)  . 

To  complete  the  solution  of  the  eigenvalue  problem,  the  roots  of  the  last  factor  must  be 
extracted  using  the  quartic  formula.  One  of  the  roots  is  shown  in  Figure  1.  An  estimate 
of  the  size  of  this  expression  can  be  obtained  by  counting  the  number  of  operators  and 
atomic  operands  involved  in  its  construction.  In  this  case,  using  the  expression’s  internal 
representation  in  MACSYMA,  a  size  of  300  was  computed.  The  other  three  roots  are  of  the 
same  complexity  and  also  have  sizes  of  300.  Of  course,  if  these  roots  were  to  be  evaluated 
numerically,  each  root  would  be  reduced  to  a  single  complex  floating  point  number  (which 
would  have  a  size  of  5). 

An  important  special  case  of  the  expression  swell  phenomenon  is  intermediate  expression 
swell.  This  refers  to  a  condition  where,  during  the  middle  stages  of  a  calculation,  intermediate 
expressions  can  expand  substantially,  but  the  final  results  of  the  calculation  are  comparatively 
simple.  A  typical  example  here  is  the  verification  of  a  trigonometric  or  tensor  identity.  As  an 
example  of  the  latter,  Figure  2  presents  the  results  of  MACSYMA  computing  the  left-hand 
side  of  the  Bianchi  identity  for  a  symmetric  connection 

ffj  hk\p  "F  A  j  fcp|/i  F  Aj  ph\k 

in  terms  of  Christoffel  symbols  of  the  second  kind.  Here,  K  is  the  Riemann  curvature  tensor. 
This  sum  contains  72  terms,  each  of  which  is  a  product  of  2  or  3  Christoffel  symbols,  for  a 
total  of  180  Christoffel  symbols.  However,  upon  simplifying  this  expression  by  consistently 
renaming  the  dummy  indices,  the  simple  result  of  zero  is  obtained  which  verifies  the  identity. 

Expression  swell  is  a  major  problem  in  symbolic  mathematical  computations.  As  an 
expression  grows  in  size,  it  takes  up  more  and  more  memory  and/or  disk  space  while  also 
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taking  more  and  more  time  to  be  manipulated.  A  shortage  of  either  of  these  resources 
can  cause  a  computation  to  fail,  even  if  the  final  result  is  known  to  be  relatively  simple. 
Sometimes  a  computation  can  be  reorganized  so  that  it  uses  less  resources  (or  more  of  one 
and  less  of  another)  and  thus  succeeds  where  previously  it  failed.  Sometimes  nothing  will  help 
but  the  acquisition  of  more  space  (memory,  disk,  etc.)  and/or  an  increase  in  the  processing 
speed  (bringing  the  time  of  a  computation  down  to  a  reasonable  level). 

In  the  past,  discussion  of  expression  swell  in  symbolic  mathematical  computations  has 
often  been  anecdotal.  In  this  paper,  I  will  present  some  quantitative  results  of  the  effects  of 
expression  swell  when  computing  the  characteristic  polynomial  (and  the  determinant)  of  a 
matrix.  These  results  will  be  a  combination  of  case  studies  involving  matrices  consisting  of 
integer  and  rational  entries  and  some  theoretical  worst  case  (and  other)  analyses.  Many  of 
the  findings  are  quite  striking,  even  for  small  matrices. 
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2  Theoretical  Expression  Swell  Analysis 

Since  expression  swell  is  such  an  important  problem  in  symbolic  mathematical  computa¬ 
tions,  a  very  useful  ability  is  to  be  able  to  predict  the  extent  of  this  phenomenon  during  the 
course  of  a  particular  calculation.  One  way  to  do  this  is  to  set  up  an  algorithm  and  run  an 
extensive  series  of  calculations  under  a  variety  of  initial  conditions.  An  analysis  of  the  results 
will  provide  an  idea  (perhaps  a  good  one)  of  the  progress  of  expression  growth  during  the 
calculation.  In  a  subsequent  section,  a  statistical  survey  of  determinant  and  characteristic 
polynomial  calculations  under  a  limited  set  of  initial  conditions  is  presented.  The  problem 
with  this  approach  to  quantitatively  assessing  the  expression  swell  inherent  in  a  given  cal¬ 
culation  is  that  typically,  a  large  number  of  sample  calculations  need  to  be  made,  which  can 
be  an  expensive  and  time  consuming  proposition. 

Generally,  a  better  way  of  charting  the  course  of  expression  growth  is  to  make  some  kind 
of  theoretical  prediction.  A  common  type  of  theoretical  estimate  is  a  bounding  calculation. 
For  an  expression  swell  analysis  of  an  algorithm,  there  are  two  important  kinds  of  bounding 
calculations  that  can  be  performed.  One  involves  worst  case  behavior  in  which  the  expression 
size  is  maximized  for  the  outcome  of  each  mathematical  operation  (the  results  of  an  addition, 
of  applying  a  function  to  its  arguments,  of  applying  an  operator  to  a  function,  etc.),  while  the 
other  involves  best  case  behavior  in  which  the  expression  size  is  minimized  for  the  outcome 
of  each  mathematical  operation.  In  general,  determining  worst  or  best  case  behavior  for 
expression  swell  of  a  general  operation  on  general  operands  is  extremely  difficult.  Even  just 
the  notion  of  expression  size  is  not  clearly  defined.  How  is  the  size  of  an  expression  to  be 
judged?  Does  it  mean  the  number  of  terms  in  the  expression,  the  number  of  operators  and 
atomic  operands,  the  number  of  characters  in  some  representation  of  the  expression  or  some 
other  measure?  Or  perhaps,  different  measures  are  appropriate  at  different  times. 

Matters  are  considerably  simplified  if  only  the  expression  swell  analysis  of  algorithms 
involving  infinite  precision  integer  and  rational  number  expressions  is  considered.  In  these 
cases,  expression  size  can  be  well-defined.  For  an  integer,  the  number  of  digits  in  its  decimal 
representation  is  a  good  indicator  of  its  size.  (One  could  also  take  the  absolute  value  of  the 
number  itself  to  represent  its  size  but  this  definition  does  not  provide  enough  generality  to 
be  useful — it  is  no  easier  nor  much  more  informative  to  do  an  analysis  using  this  definition 
than  it  is  to  just  do  the  calculation  with  the  original  integers.)  This  definition  of  the  size  of 
an  integer  is  essentially  a  logarithmic  measure.  Indeed,  the  number  of  digits  comprising  an 
integer  n,  vV(n),  can  alternatively  be  defined  by 

vV’(n)  =  (  i.logiolnU  +  1 

where  [  J  denotes  the  floor  function.  (In  practice,  vV’(n)  will  be  computed  directly  by 
actually  counting  the  number  of  digits  in  n.)  Similarly,  the  size  of  a  rational  number  (which 
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need  not  be  in  lowest  terms)  can  be  defined  as  the  sum  of  the  sizes  of  its  numerator  and 
denominator  (or  possibly  as  the  maximum  of  these  two  values).  For  example,  the  size  of 
“ms  would  be  A/"(— 327)  JV(  1 955)  =  3  +  4  =  7. 

In  order  to  deal  only  with  integer  and  rational  number  expressions  in  an  algorithm, 
the  operations  permitted  on  these  quantities  must  be  restricted  as  well.  In  particular,  the 
expression  swell  analysis  of  an  algorithm  with  solely  integer  inputs  can  proceed  in  a  simple 
manner  only  if  all  the  operations  in  the  algorithm  are  integer  or  at  least  rational  number 
preserving.  Therefore,  confining  the  scope  of  algorithms  studied  to  only  those  that  just  use 
rational  arithmetic  operations  (except  perhaps  at  the  last  stage,  such  as  when  computing 
a  polynomial  from  its  coefficients)  will  make  expression  swell  analysis  manageable  for  those 
problems  with  such  algorithms.  Most  of  the  calculations  from  linear  algebra  fall  into  this 
category. 

In  the  following  subsections,  an  expression  swell  arithmetic  is  developed.  This  arithmetic 
will  operate  on  numbers  that  represent  the  number  of  digits  in  classes  of  integers  and  rational 
numbers.  Integer  preserving  operations  will  be  treated  separately  from  rational  number 
preserving  ones.  This  division  is  made  since  exact  rational  number  arithmetic  exhibits  a 
more  complex  behavior  than  exact  integer  arithmetic.  After  developing  this  expression  swell 
arithmetic  and  discussing  an  implementation  of  it  in  MACSYMA,  examples  of  its  use  and 
comparisons  of  its  predictions  with  the  results  of  actual  calculations  will  be  presented  in 
later  sections. 

2.1  Integer  Calculations 

Here,  a  worst  case  expression  swell  arithmetic  will  be  developed  for  certain  integer  pre¬ 
serving  operations.  These  include  the  rational  arithmetic  operations  of  addition,  negation, 
multiplication,  exact  division  and  exponentiation  to  a  nonnegative  integer  power.  Also  in¬ 
cluded  are  absolute  values  and  greatest  common  divisors  (GCDs).  Precisely,  worst  case 
behavior  under  an  integer  preserving  operation  means  that  the  result  will  contain  the  great¬ 
est  number  of  digits  that  are  possible  (GCDs  are  a  special  case  and  will  be  discussed  below). 

To  begin  this  development,  note  that  Af  partitions  the  integers  into  an  infinite  set  of 
equivalence  classes.  Each  equivalence  class  can  be  designated  by  a  nonnegative  integer: 
0,1,2,...  .  Thus,  n  will  represent  the  set  of  integers  that  have  exactly  n  digits.  For  example, 
1  =  {—  9, . . . ,  —  1,  1, . . .  ,9}  (as  a  special  case,  0  =  {0}).  The  intent  behind  classifying  the 
integers  in  this  way  is  to  permit  the  analysis  of  algorithms  where  the  inputs  have  specified 
numbers  of  digits  (are  members  of  specified  equivalence  classes).  The  analyses  will  produce 
upper  bounds  on  the  number  of  digits  in  the  results  for  any  representatives  chosen  from  the 
respective  equivalence  classes  that  are  used  as  inputs  to  the  algorithms. 

Operations  on  the  equivalence  classes  defined  by  Af  can  yield  one  of  two  types  of  results. 
An  operation  can  produce  an  integer  value.  An  obvious  example  is  A f{n)  =  n.  That  is,  the 
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number  of  digits  in  any  representative  of  n  is  no  greater  than  n  (actually,  is  exactly  n  in  this 
simple  usage).  This  defines  the  operation  of  M  on  an  equivalence  class.  The  other  possibility 
is  that  an  operation  on  one  or  more  equivalence  classes  will  itself  yield  an  equivalence  class. 
For  example,  suppose 

/(ni,n2,...)  =  g(nun2,...)  . 

Then,  for  worst  case  expression  swell  behavior,  this  will  mean  that  g(ni,n2,. . .)  will  bound 
the  number  of  digits  in  f(n\,n2, . . .)  over  all  possible  choices  of  n[,  n2, . . .  such  that  J\f{n\)  = 
ni,Af(n’2)  =  n2, . . .  .  Another  way  of  writing  this  is 

Af{f(ni,n2,...))  <  Sf(g(ni,n2,...))  . 


For  example,  multiplying  the  equivalence  class  2  with  itself  (i.e.  multiplying  together  all 
possible  pairs  of  2-digit  integers)  will  produce  a  set  of  3  and  4-digit  integers  (ranging  from 
±10  ■  ±10  =  ±100  to  ±99  •  ±99  =  ±9801).  Therefore,  the  product  of  2  with  itself  will  be 
defined  to  be  4  since  no  element  of  the  product  set  will  contain  more  than  4  digits.  Note 
that  the  result  could  have  been  defined  as  5  or  6  or  . . .  but  these  choices  would  not  have 
been  as  good  since  they  do  not  yield  as  strict  a  bound  on  the  maximum  number  of  digits  as 
does  4. 

The  best  choice  for  g  in  the  above  formulation  will  be  a  function  that  actually  takes  on 
the  value  of  n2, . . .))  (i.e.  that  maximizes  the  number  of  digits  in  f(n'x,n2 , .  . .)  where 

A^(n')  =  n,  for  i  =  1,2,...).  For  unary  and  binary  operations,  this  is  not  difficult.  Consider 
the  unary  operations  first. 

Definition  1.  The  unary  operations  of  integer  absolute  value  and  negation  when  applied 
to  the  equivalence  class  n  are  defined  by 

(i)  |n|  =  n, 

(ii)  ©n  =  n. 

This  says  that  taking  the  absolute  value  or  the  negation  of  an  integer  does  not  change 
the  number  of  digits  it  contains.  The  ©  is  a  “maximal  expression  swell”  operator.  This 
notation  has  been  introduced  to  emphasize  the  distinction  between  worst  case  expression 
swell  arithmetic  and  traditional  integer  arithmetic. 

Definition  2.  The  binary  operations  of  integer  addition,  subtraction,  multiplication  and 
exact  division  when  applied  to  the  equivalence  classes  ni  and  n2,  where  ni  >  0  and  n2  >  0, 
are  defined  by 

(i)  U!  ©  n2  =  max(ni,n2)  +  1, 

(ii)  ni  ©  n2  =  max(ni,n2)  +  1, 

(iii)  rfi  ©  n2  =  nx  +  n2, 


893 


exact  _ 

(iv)  111  ©  f?2  =  nx  -  n3  +  1  (nx  >  n2). 

If  ni  =  0  then  the  right-hand  sides  of  (i)  and  (ii)  become  n2  and  the  right-hand  sides  of  (iii) 
and  (iv)  become  0.  If  n2  =  0  then  the  right-hand  sides  of  (i)-(iv)  are  respectively,  nx,nx,0 
and  undefined. 

Worst  case  expression  swell  addition  and  subtraction  are  equivalent  since  in  the  worst 
case  for  regular  subtraction,  the  two  operands  will  be  of  opposite  signs  and  so  it  will  really 
be  an  addition  of  two  terms  of  the  same  sign,  which  is  the  worst  case  for  regular  addition. 
Symbolically,  this  can  be  stated 

ni  ©  n2  =  ni  ©  (©n2)  =  nx  ©  n2  . 

Now,  to  understand  the  rest  of  Definition  2,  it  is  best  to  consider  some  examples.  For 
instance,  the  worst  case  of  adding  an  nx-digit  number  to  an  n2-digit  number  occurs  when  all 
the  digits  in  the  larger  number  are  9’s.  Then  adding  a  smaller  or  equal  sized  number  of  the 
same  sign  will  at  worst  (in  this  particular  case,  will  always)  produce  a  result  with  one  more 
digit  than  the  original  integer.  For  example,  the  validity  of  the  assertion  4  ©  3  =  5  can  be 
seen  by  looking  at  a  worst  case  calculation  like  9999  4-  999  =  10998.  For  multiplication,  the 
worst  case  of  expression  size  growth  can  be  exhibited  when  both  numbers  consist  of  all  9’s. 
In  this  case,  the  multiplication  becomes  (10ni  —  1)(  10nj  —  1 )  =  10"’ +”2  —  (10ni  -f  10n2)-f  1.  The 
two  middle  terms  will  always  bring  the  final  number  of  digits  in  the  result  down  to  n j  +  n2. 
Thus,  the  worst  case  9999  -  999  =  9989001  establishes  the  validity  of  writing  4  ©  3  =  7. 
Finally,  the  worst  case  for  exact  division  occurs  when  the  ni -digit  number  is  as  large  as 
possible  and  the  n2-digit  number  is  as  small  as  possible.  For  instance,  to  demonstrate  that 

exact 

4  ©  3  =  2,  note  that  the  worst  situations  are  9900  4-  100  =  99  and  9999  4-  101  =  99. 

Like  their  integer  arithmetic  counterparts,  worst  case  expression  swell  addition  and  mul¬ 
tiplication  are  commutative.  However,  the  binary  operation  ©  at  least  is  not  associative, 
unlike  ordinary  addition.  A  simple  example  shows  this.  (1©2)©3  =  3©3  =  4  while 
I  ©  (2  ©  3)  =  I  ©  4  =  5.  Remember  that  the  motivation  behind  developing  an  expression 
swell  arithmetic  is  to  establish  bounds  on  the  rate  of  expression  growth.  Therefore,  if  one 
way  of  ordering  operations  produces  a  tighter  bound  on  expression  swell  than  other  methods, 
then  this  way  is  to  be  preferred.  The  above  example  suggests  that  the  optimal  arrangement 
for  adding  equivalence  classes  is  to  have  them  ordered  in  terms  of  increasing  size  (i.e.  add 
ni,n2,«3  where  nj  <  n2  <  n 3  by  (nj  ©  ri2)  ©  7T3).  ©  is  associative  as  can  be  easily  seen 
and  so  needs  no  further  elaboration. 

©  and  ©  can  also  be  thought  of  as  n-ary  operators  (operators  that  can  be  applied  to  an 
indefinite  number  of  operands).  ©  can  be  easily  extended  to  handle  an  arbitrary  number  of 
factors  due  to  its  associativity  when  considered  as  a  binary  operator.  Extending  ©  is  more 
difficult;  however,  some  properties  can  be  stated. 
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Definition  3.  The  n-ary  operations  of  integer  multiplication  and  addition  when  applied  to 
the  equivalence  classes  ni, . . . ,  nt  and  n,  respectively,  where  n,  >  0  V  i  and  n  >  0,  are  defined 
by 

(i)  rii  ®  n2  (•)  •  •  •  ©  nk  =  n2  +  n2  +  •  •  •  +  nk  ( k  >  0), 

(ii)  &®n  =  n©n©---©n  =  n-|-  M(k  —  1 )  (k  >  0). 

■  v  " -  y 

k  terms 

If  any  of  the  nj  =  0  then  the  right-hand  side  of  (i)  becomes  0.  If  n  =  0  then  the  right-hand 
side  of  (ii)  is  0. 

The  left-hand  side  of  (ii)  in  the  above  definition  defines  a  shorthand  notation  for  the  n- ary 
sum  on  the  right.  To  see  the  motivation  behind  this  definition  of  the  n-ary  sum,  consider 
what  happens  in  the  worst  case  when  the  n-digit  numbers  are  composed  only  of  9’s.  For 
instance,  suppose  n  =  2  then  Table  1  will  show  the  results  of  actual  worst  case  sums  for 
various  values  of  k  as  well  as,  in  the  last  column,  the  theoretical  quantities  M{k  (•)  2). 


a 

k  ■  99 

M{k  ■  99) 

99 

2 

2 

198 

3 

3 

990 

3 

3 

1089 

4 

4 

100 

9900 

4 

4 

101 

9999 

4 

5 

102 

10098 

5 

5 

Table  1.  Actual  versus  theoretical  worst  case  n-ary  addition. 

Notice  that  eventually  Af(k  ©  2)  will  occasionally  exceed  the  actual  worst  case,  but  this 
occurs  only  for  relatively  large  values  of  k  (starting  at  k  =  10n  +  1  in  the  general  case).  In  a 
similar  vein,  note  that  the  results  of  n-ary  expression  swell  multiplication  will  also  eventually 
exceed  the  actual  worst  cases  but  again,  only  for  relatively  large  values  of  k.  For  example, 
if  n,  =  2,  i  =  1, . . . ,  k  then  at  least  k  =  230  factors  are  needed  before  the  number  of  digits 
predicted  exceeds  the  actual  worst  case  number  of  digits  (by  one). 

The  n-ary  ©  operator  introduced  above  is  distinct  from  the  binary  version,  although  both 
versions  do  produce  the  same  result  of  n  +  1  for  the  common  case  of  n  ©  n.  For  example, 
using  n-ary  ©,  n©n©n  =  n  +  l  while  under  binary  ©,  this  becomes  (n  ©  n)  ©  n  = 
n  +  l©n  =  n  +  2.  Again,  the  tighter  bound  is  to  be  preferred  when  doing  expression  swell 
arithmetic,  so  when  adding  like  terms,  the  n-ary  definition  above  will  be  used.  The  infix 
representation  of  n-ary  ©  is  somewhat  misleading,  hence  the  alternative  notation  of  k  ©  n 
for  a  sum  of  k  like  terms  will  be  adopted  whenever  possible.  This  notation  has  the  property 
that  {ki  (•)  n)  @  (k2  (•)  n)  =  (k\  +  k2)(i)n  since  the  parenthesized  terms  on  the  left  are  really 
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sums  and  for  estimating  bounds,  it  is  best  to  make  the  whole  expression  into  a  single  n-ary 
sum. 

Two  more  worst  case  expression  swell  operations  need  to  be  defined  to  complete  the  the¬ 
oretical  framework  which  will  allow  algorithms  involving  rational  arithmetic  to  be  analyzed. 

Definition  4.  The  binary  operation  of  integer  exponentiation  to  a  nonnegative  integer 
power  and  the  n-ary  operation  of  integer  greatest  common  divisor  when  applied  to  the 
equivalence  classes  n  and  ny, . . .  ,njt,  respectively,  are  defined  by 

(i)  n  ©  0  =  1  (n  >  0), 

(ii)  n©£  =  n©n©---©n  =  A:n  (A:  >  0), 

k  factors 

(iii)  gcd(ny,n2,...,nfc)  =  1  (k  >  1). 

If  n  =  0  then  the  right-hand  side  of  (i)  becomes  undefined. 

The  definition  for  exponentiation  follows  directly  from  the  definition  for  n-ary  multiplication 
and  is  simply  a  special  case  of  that  operation.  The  definition  of  the  GCD  seems  at  first  incon¬ 
sistent  with  the  other  definitions,  but  it  is  really  quite  appropriate  for  worst  case  expression 
swell  behavior.  Typically  in  symbolic  mathematical  calculations,  GCDs  are  used  to  find  the 
common  factors  of  sets  of  expressions  in  order  to  simplify  subsequent  computations  in  some 
way  (e.g.  removing  the  common  factors  from  the  numerator  and  denominator  of  a  rational 
number,  reducing  it  to  lowest  terms).  Therefore,  expression  swell  will  be  maximized  if  the 
quantities  involved  in  a  calculation  are  all  relatively  prime  with  respect  to  one  another.  This 
implies  that  the  GCD  is  always  one.  As  an  example,  consider  the  least  common  multiple 
(LCM)  of  an  ny-digit  integer  and  an  n2-digit  integer.  In  the  worst  case,  the  LCM  will  become 

exact  _ exact  _  _ 

lcm(ny,n2)  =  (ny  ©  n2)  ©  gcd(ny,  n2)  =  ny  -f  n2  ©  1  =  n,  +  n2  . 

In  a  manner  similar  to  the  above,  a  best  case  expression  swell  arithmetic  for  integer 
preserving  operations  can  be  developed.  Best  case  behavior  under  an  integer  preserving 
operation  means  that  the  result  will  contain  the  least  number  of  digits  that  are  possible. 
Hence,  this  arithmetic  will  provide  a  lower  bound  on  expression  growth  during  a  computation. 
However,  the  best  case  for  addition  and  subtraction  of  integers  with  the  same  number  of 
digits  is  zero  (catastrophic  cancellation)  so  the  results  of  an  analysis  mav  be  quite  a  bit  less 
interesting  than  for  worst  case  behavior.  In  this  paper,  no  analysis  of  best  case  expression 
swell  behavior  will  be  attempted. 

2.2  Rational  Number  Calculations 

Worst  case  expression  swell  arithmetic  can  also  be  developed  for  rational  number  opera¬ 
tions.  Essentially,  rational  expression  swell  arithmetic  can  be  considered  as  an  extension  of 
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integer  expression  swell  arithmetic  to  ordered  pairs  of  integer  equivalence  classes.  These  or¬ 
dered  pairs  will  be  denoted  like  ?■,  which  represents  the  set  of  rational  numbers  with  m-digit 
numerators  and  n-digit  denominators.  Since  this  is  worst  case  arithmetic,  the  components 
of  the  ordered  pairs  will  be  assumed  to  be  relatively  prime,  implying  that  the  corresponding 
set  of  rational  numbers  are  in  lowest  terms  and  cannot  be  reduced  in  size.  This  is  consistent 
with  defining  integer  GCDs  to  be  one  as  was  done  in  the  previous  subsection.  Note  that 
a  given  rational  number  (e.g.  |  €  i)  need  not  actually  be  in  lowest  terms  but  for  worst 
case  arithmetic  operations,  this  assumption  will  always  be  made  nevertheless.  With  these 
definitions,  as  with  the  integers,  Af  can  be  seen  to  partition  the  rational  numbers  into  a 
countably  infinite  set  of  equivalence  classes.  =■  will  then  be  the  general  notation  for  one  of 
these  equivalence  classes  where  m  and  n  can  take  on  any  integer  value  satisfying  m  >  0  and 
n  >  0  (if  m  =  0  then  n  is  only  allowed  to  be  1). 

Now,  worst  case  rational  expression  swell  arithmetic  can  be  defined  by  analogy  with 
ordinary  rational  arithmetic  in  terms  of  the  operations  of  the  previously  defined  worst  case 
integer  expression  swell  arithmetic.  This  is  done  as  follows: 


Definition  5.  The  unary  operations  of  rational  absolute  value  and  negation  when  applied 
to  the  equivalence  class  =  are  defined  by 


Definition  6.  The  binary  operations  of  rational  addition,  subtraction,  multiplication,  divi¬ 
sion,  exact  division  and  exponentiation  to  an  integer  power  when  applied  to  the  equivalence 
classes  ^  and  where  >  0  and  m 2  >  0,  are  defined  by 


(j)  HU.  (p)  2Ji  =  +n;®ni 

'  '  n i  '  H2  ni<s)nj  ni+nj 

(ii)  Si  (Z)  Si  =  (mi ®n;)Q(ni ®m^)  _  mi  +  n20ni  -t-m? 
'  '  ni  '  n2  ni®n2  7>i+n2 

run  Si  a)  sz  =  s.i®Si  =  mi  +m2 

'  ’  ni  '-s  ri2  n  i®7i2  ni+n2  1 

(jv)  Si  (2)  Si  =  Si  (J)  Si.  =  mi  +"2 

_  exact  _  _  _  _ 

nu.  p2)  ®  S2  =  -^2-f  1 

'  ’  n  1  W  n2  _  e“c,_2  ni-nj  +  l  ’ 

(vi)  (I)  ©  0  =  gg  =  f_(m  >  0). 

(vii)  (f)@t-==gi  =  fa  (fc  >_0). 
(viii)(f)©(-t)  3  (!)©*  =  &  (k  >  0,m 


If  mi  =  0  then  the  right-hand  sides  of  (i)  and  (ii)  become  and  the  right-hand  sides  of 
(iii)-(v)  become  2.  If  m2  =  0  then  the  right-hand  sides  of  (i)-(v)  are  respectively,  ISj1-.  2, 
undefined  and  undefined. 
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A  new  operation,  non-exact  division,  has  been  introduced  but  like  the  rest  of  the  rational 
operations,  it  is  adapted  directly  from  ordinary  rational  arithmetic  and  so  should  come  as  no 
surprise.  Finally,  n-ary  addition  and  multiplication  for  rational  expression  swell  arithmetic 
are  simple  extensions  of  the  binary  versions. 


Definition  7.  The  n-ary  operations  of  rational  multiplication  and  addition  when  applied 
to  the  equivalence  classes  . . . ,  ^  (k  >  0)  where  m,  >  0  V  i  are  defined  by 


(i)  Hi  G)  mz  ($)...  G)  m.  =  g mi+m2+-+mt 
n,  'o'  ni  ^  n*  -  ni_®nj_®  - ®n*_  _m +n2-t~  +n*  ’ 

f::\  Hi.  Hi  /TN  _  (mi®n;®  - -®nfc)®  (m  ®ma®— ®nt )®  — ®(r>i  ®m®— ®m>) 

DU  n,  ri7  W  '  KV  jik  ~  Si®nT®  ~®rrfc 


n 


k 

where  n  =  y^n,. 

i=i 


If  any  of  the  m,  =  0  then  the  right-hand  side  of  (i)  becomes  2  and  those  terms  with  m,  =  0 
are  excluded  from  the  sum  in  (ii). 


2.3  MACSYMA  Implementation 

To  obtain  some  practical  experience  with  the  above  concepts,  both  integer  and  rational 
number  worst  case  expression  swell  arithmetic  have  bee*,  implemented  in  MACSYMA.  The 
top-level  interface  to  expression  swell  arithmetic  operations  (there  also  exists  a  LISP-level 
interface  which  is  accessed  slightly  differently)  is  provided  by  the  functions  abs_(jr),  neg_(x). 
add_(fermi,  term?,  . . .  ),  sub_(x,y),  mul_(/aciori ,  factor2,  . .  .  ),  div_(.r,y),  ediv.(-r.y)  [Ex¬ 
act  Division],  power.(i,y)  and  gcd.(j,y)  (.\f  is  performed  by  ndigits(e)).  If  the  global 
variable  exprswell  is  false  (its  default  value)  then  these  functions  will  perform  ordinary 
arithmetic.  However,  if  exprswell  is  set  to  true  then  these  functions  will  treat  their  ar¬ 
guments  as  equivalence  classes  of  jV  (when  appropriate)  and  perform  worst  case  expression 
swell  arithmetic  on  them  (no  mixing  of  integer  and  rational  number  equivalence  classes  will 
be  permitted).  Thus,  an  algorithm  can  be  implemented  using  this  set  of  functions  in  place 
of  the  normal  MACSYMA  arithmetic  operators  except  where  expression  swell  operations 
are  never  appropriate,  such  as  index  calculations  or  incrementing  loop  indices.  Then,  with 
exprswell  set  to  its  default  value,  the  algorithm  can  be  run  in  a  normal  manner.  However, 
by  simply  changing  the  value  of  exprswell,  an  expression  swell  analysis  of  the  algorithm  for 
inputs  from  a  given  set  of  equivalence  classes  can  be  performed. 

The  MACSYMA  implementation  of  general  integer  worst  case  expression  swell  n-ary 
addition  is  an  extension  of  the  n-ary  addition  of  like  terms  defined  previously.  The  terms 
are  first  sorted  into  ascending  order,  then  after  eliminating  any  zeroes,  the  leading  set  of  like 
terms  (which  may  be  a  single  term)  is  combined  using  the  n-ary  addition  of  Definition  3.  The 
result  is  tacked  onto  the  beginning  of  the  list  containing  the  remainder  of  the  terms,  which  is 
then  resorted,  if  necessary,  to  maintain  the  terms  in  ascending  order.  Next,  the  leading  set 
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of  li!  e  terms  are  combined  with  perhaps  an  initial  non-like  term,  once  again  using  the  n-ary 
addition  of  Definition  3  (the  non-like  term,  if  there  is  one,  will  be  treated  as  just  another 
like  term  in  the  n-ary  addition).  These  last  two  steps  will  then  repeat  until  the  list  contains 
a  single  term.  For  example,  1©I©2©2©4©4  =  2©2©2©4©4  =  3©4©4  =  5. 
This  algorithm  was  chosen  so  as  to  try  to  minimize  the  results  of  general  n-ary  addition  and 
thus  provide  as  tight  a  bound  as  possible. 


3  Matrix  Determinant  Computation 

There  are  two  major  algorithms  for  computing  the  determinant  of  a  square  matrix  that 
have  been  generally  implemented  in  symbolic  math  systems.  The  first  method  is  expansion 
by  cofactors  (or  expansion  by  minors).  For  example,  if 

/ 4  3  2  \ 

A=  7  5  1 
\  6  2  9/ 

then  expanding  the  cofactors  along  the  third  column  will  yield 


det  A  =  2(7  •  2  -  5  •  6)  -  1(4  •  2  -  3  •  6)  +  9(4  •  5  -  3  •  7)  =  -31  . 


Worst  case  expression  swell  analysis  can  be  performed  on  the  above  method  of  computing 
the  determinant.  To  simplify  this  analysis,  it  is  helpful  to  expand  out  all  the  products  in 
the  cofactor  expansion.  For  an  n  x  n  matrix  (B),  this  will  produce  a  sum  of  n!  products  of 
n  factors  each,  half  of  which  are  added  and  the  rest  subtracted.  Therefore,  if  the  entries  of 
the  matrix  are  all  m-digit  integers,  then 


jV(detB)  <  M 


<  M 


n\ 


©  (m(i)n) 


~  ®  (m  ©  n) 


© 


© 


n: 


■j  ©  (m  ©  n) 
Tx\ 

—  ©  (m©  n) 


<  j\f(n\  ®  {m  ©  n))  <  J\f{n\  ®  nm)  <  nm  -f  Af(n\  —  1 )  . 


(1) 


The  other  major  algorithm  generally  implemented  in  symbolic  math  systems  for  com¬ 
puting  determinants  begins  with  some  variant  of  a  fraction  free  reduction  of  a  matrix  to 
triangular  form  (for  example,  see  [Bar66]).  This  Gaussian  elimination  algorithm  may  be 
either  one-step  or  multi-step  where  the  number  of  steps  indicates  the  number  of  iterations 
performed  in  a  single  pass  through  the  matrix.  The  determinant  will  then  be  proportional 
to  the  bottom  rightmost  element  of  the  reduced  matrix.  For  example,  a  one-step  fraction 
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free  reduction  to  triangular  form  of  [adapted  from  Fox65] 

/  7  9  -1  2  \ 

4-52-7 
16-3-4 
3  -2  -1  -5, 

proceeds  as 


(7 

9 

-1 

2  ) 

(7 

9 

-1 

2  ) 

(7 

9 

-1 

2  \ 

0 

-71 

18 

-57 

9 

-71 

18 

-57 

0 

-71 

18 

-57 

0 

33 

-20 

-30 

0 

0 

118 

573 

0 

0 

118 

573 

VO 

-41 

-4 

-41 ) 

Vo 

0 

146 

82  ) 

lo 

0 

0 

1042/ 

where,  during  the  ktb  stage,  the  elements  cj**  in  the  (n  —  k)  x 
are  computed  by 


(k-l)  (k-l)  _  (*-!)  (*-!) 

(fc)  Cxj  ~ik  C kj 


C)i’  — 


Ck-l,k-l 


(n  —  k  +  1)  lower  right  block 


(with  Cqq  defined  to  be  one).  Thus,  det  C  is  1042.  In  general,  some  kind  of  strategy  will  be 
used  to  choose  a  “good”  pivot  at  each  stage  and  also  to  take  care  of  the  case  of  a  forced  zero 
pivot  which  indicates  a  singular  matrix. 

For  annxn  matrix  (call  it  D)  consisting  of  m-digit  integer  elements,  worst  case  expression 
swell  analysis  of  the  above  algorithm  reveals  that 


J\f  (det  D)  <  nm  +  (n  —  1  )2  . 


(2) 


This  can  be  seen  by  performing  the  elimination  on  a  representative  matrix  using  expression 
swell  arithmetic.  In  particular,  for  a  4  x  4  matrix  of  m-digit  integers,  the  reduction  proceeds 
as  follows: 


( rn 

m 

m 

m  y 

( m 

m 

m 

m  ^ 

m 

m 

m 

m 

0 

2m  +  1 

2m  +  1 

2m  +  1 

m 

m 

m 

m 

0 

2m  +  1 

2m  +  1 

2m  +  1 

\  m 

m 

m 

m  / 

U 

2m  +  1 

2m  -f  1 

2m  +  1  ) 

( m 

m 

m 

m 

V 

f  m 

m 

m 

m  \ 

0 

2m  -f  1 

2m  -f  1 

2m  -f  1 

0 

2m  +  1 

2m  +  1 

2m  +  1 

0 

0 

3m  +  4 

3m  +  4 

0 

0 

3m  +  4 

3m  +  4 

Vo 

0 

3m  +  4 

3m  -f  4 ) 

U 

0 

0 

4m  -f  9 ) 

For  more  detail,  consider  the  second  stage  calculation  of  the  (3,4)  element: 

=  [(4V  ®  4?)  e  (4?  ®  4V)l  '©*  iff 
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_  _  _  exact 

=  [(2m  +  1  0  2m  +  1)  0  (2m  -r  1  0  2m  +  1)]  0  m 

exact  _ exact 

=  [4m  +  2  ©  4m  +  2]  ©  m  =  4m  +  3  ©  m  =  3m  +  4  . 

A  comparison  of  the  above  two  theoretical  worst  case  determinations  of  the  size  of  the 
determinant  of  an  n  x  n  matrix  with  m-digit  integer  entries  shows  that  the  former  yields  a 
tighter  and  thus  a  better  bound.  This  can  be  seen  by  examining  the  difference 

A/”(det  D)  -  A/"(det  B)  =  (n  -  l)2  -  J\f(n!  -  1) 

=  ("“I)2-  Ll°gio(«!  —  1)J  “I  (n>l) 
ss  n2  —  2n  ~  |jog10  \/2 in  J  J  (n  —*  oo) 

«  n2  -  (n  +  ^  log10n  (n  — ►  oo)  . 

The  second  equality  comes  from  the  definition  of  A/*  and  the  third  line  is  a  result  of  ap¬ 
proximating  n!  (and  hence  n!  —  1)  for  n  large  by  Stirling’s  formula.  Indeed,  Af{ det  D)  will 
strictly  dominate  A/"(det  B)  for  all  n  >  2.  The  primary  reason  for  this  behavior  is  that  worst 
case  expression  swell  analysis  is  performed  one  stage  at  a  time  while  following  the  Gaussian 
elimination  algorithm  but  all  at  once  for  the  expansion  by  cofactors.  The  latter  analysis 
takes  full  advantage  of  the  stricter  bounds  yielded  by  combining  as  many  n-ary  arithmetical 
operations  as  possible  at  one  time  and  so  produces  the  better  estimate.  The  former  analysis 
does  produce  bounds  on  the  size  of  the  matrix  elements  at  each  stage  of  the  elimination  but 
any  overestimates  from  previous  stages  tend  to  accumulate. 

The  determinant  of  a  matrix  can  also  be  bounded  using  Hadamard’s  inequality.  This 
relationship  states  that  given  an  n  x  n  matrix  £, 

I  det  E\ 2  <  II  ^  e0  j  • 

If  all  the  elements  of  E  are  m-digit  integers,  then  worst  case  expression  swell  analysis  gives 

.'V(|  det  E\2)  <  M{[n  0  (m  ©  2)]  ©  n)  <  :V([n  0  2m  ]  ®  r ) 

<  j\f(2m  +  A/"(n  —  1 )  0  n )  <  2 nm  -I-  nM(n  —  1)  . 


Hence, 


Af(det  E)  < 


nm  -|-  -nN{n  —  1) 


(3) 


carefully  applying  Definition  4 ( ii )  in  reverse.  This  estimate  is  of  about  the  same  asymptotic 
order  as  the  one  for  the  cofactor  expansion. 
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Rational  expression  swell  arithmetic  can  also  be  applied  fruitfully  to  Hadamard’s  in¬ 
equality.  Suppose  F  is  an  n  x  n  matrix  with  rational  number  entries  consisting  of  m-digit 
numerators  and  denominators,  then 


^'(|det/'|2)<A^([n0(^©2)]©n)  <  M  (  n®  ||  © n j 


Now, 


m 


=  ©•••©  = 
m  m 

' - v - ' 

k  terms 


k  terms 

, - * - - 

k  factors  k  factors 

(m  ©  •  •  •  ©  m)  ©  •  •  •  0  (m  ©  •  •  •  ©  m) 
m  ©  •  •  ■  ®  m 

s  ■■  -v.  ■■ 

k  factors 


therefore, 


k  ®  km  _  km  +  Af(k  —  1) 
km  km 


^(|detF|2)  < 


/  2 nm  +  JV(n 
\  2  nm 


1 ) 


Carefully  taking  an  exact  square  root  once  again, 

fn2m  +  nj\f{n  —  l)/2] 


jV(det  F)  <  M 


n2m 


)<- 


2 n2m  +  ^nA/”(n  -  1) 


4  Matrix  Characteristic  Polynomial  Computation 

Generally  in  symbolic  math  systems,  the  characteristic  polynomial  of  a  matrix  A  is 
calculated  by  forming  the  matrix  A  —  A/,  where  A  is  a  scalar  variable,  and  then  computing 
its  determinant.  The  determinant  will  always  involve  polynomial  arithmetic,  even  for  a 
purely  numerical  matrix,  so  it  is  difficult  to  perform  expression  swell  analysis  on  this  method. 
However,  in  the  previous  section,  expression  swell  analysis  was  performed  on  the  determinant 
calculation  of  certain  integer  matrices.  Since  the  determinant  is  plus  or  minus  the  constant 
term  of  the  characteristic  polynomial,  which  for  nonsingular  matrices  is  typically  the  largest 
coefficient  of  the  polynomial  in  gross  size,  these  analyses  should  give  a  good  bound  on  the 
size  of  the  largest  coefficient  of  the  characteristic  polynomial  for  integer  matrices  with  entries 
of  the  same  size. 

A  second  method,  which  computes  the  coefficients  of  the  characteristic  polynomial  of  a 
purely  numerical  matrix  using  no  polynomial  arithmetic,  involves  taking  the  trace  of  powers 
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of  the  matrix.  The  procedure  takes  advantage  of  the  fact  that  for  an  n  x  n  matrix  A  with 
eigenvalues  {A,}l=i . n, 

trace  Ak  =  A*  . 

i=i 

The  coefficients  of  the  characteristic  polynomial  are  then  computed  from  the  symmetric 
functions  of  the  traces  of  A,  A2, . . . ,  An  [Sto72], 

The  above  method  requires  0(n 4)  operations  and  so  is  not  practical  for  large  matrices. 
Nevertheless,  it  is  instructive  to  perform  worst  case  expression  swell  analysis  on  this  algo¬ 
rithm.  Initially,  consider  the  first  portion  of  the  algorithm,  which  computes  the  various 
traces.  If  the  entries  of  A  are  m-digit  integers  then  the  number  of  digits  in  the  entries  of 
AA  =  A2  will  be  bounded  by  2m  +  Af(n  —  1).  For  AA2  =  A3,  the  number  of  digits  in  the 
entries  will  be  bounded  by  3m  +  2Af(n  —  1)  and  in  general,  the  number  of  digits  in  the  entries 
of  AAk~l  =  Ak  will  be  bounded  by  km  +  (k  —  l)A/*(n  —  1).  Hence, 

A/"(trace  Afc)  =  Af  f ^  a-,*^ 

<  Af{n  0  km  +  (k  —  1)A f(n  —  1))  <  k[m  +  Af(n  —  1 )]  . 

Now,  the  coefficient  ck  of  Afc  (0  <  k  <  n  —  1)  in  the  characteristic  polynomial  P(A)  of  A 
will  be  composed  of  p(n  —  k)  terms,  each  of  which  will  be  proportional  to  a  product  of  traces 
such  that  the  total  sum  of  powers  involved  is  n  —  k.  Here,  p(n)  is  the  number  of  partitions  of 
the  integer  n  (i.e.  the  number  of  ways  n  can  be  written  as  a  sum  of  positive  integers  where 
order  does  not  matter)  and  is  computed  by  Hardy  and  Ramanujan’s  formula 

p(n)  «  - L_e’r\/2n73 

4\/3  n 

For  example,  the  coefficient  of  A  for  A,  4  x  4,  is 

trace  A3  (trace  A2)(trace  A)  (trace  A)3 

Cl  =  — _  -  +  g 

which  consists  of  p(4  —  1)  =  p( 3)  =  3  terms.  It  is  easily  seen  that  .^([trace  A]fc)  = 
A/"(trace  A*),  therefore, 

j\f(ck)  <  j\f(p(n  —  k)  0  (n  —  k)[m  +  ,V  (n  —  1  )j) 

<  (n  —  k)[m  +  M{n  —  1)]  +  ,\f{p(n  —  k)  —  1)  . 

In  particular, 

N {det  A)  =  Af [cq)  <  nm  +  nAf {n  —  \)  +  Af {p{n)  —  \)  (4) 
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hence 


Af(det  A) 


nm  +  n(Llog10(n  -  1)J  +  1)  + 
nm  +  n  logi0  n  (n  — >  oo)  . 


1 

4  s/ln 


+  1 


(n  >  1) 


This  estimate  compares  favorably  to  the  one  derived  from  the  cofactor  expansion  of  the 
determinant  (although  it  will  be  seen  in  the  next  section  that  it  is  somewhat  higher). 

A  third  way  to  compute  the  characteristic  polynomial  of  a  matrix  is  to  transform  the 
matrix  into  upper  Hessenberg  form  via  a  similarity  reduction  (which  will  preserve  the  char¬ 
acteristic  polynomial)  and  then  compute  the  characteristic  polynomial  of  the  transformed 
matrix  using  a  computationally  cheap  algorithm.  A  matrix  is  upper  Hessenberg  if  all  the 
entries  below  the  subdiagonal  are  zero  (i.e.  H  is  upper  Hessenberg  if  =  0  whenever 
i  >  j  +  1).  The  reduction  of  a  general  matrix  into  upper  Hessenberg  form  proceeds  via  a 
series  of  elementary  similarity  transformations.  For  example,  performing  the  first  stage  of 
the  reduction  on  the  matrix  C  of  the  previous  section  yields 
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Once  an  upper  Hessenberg  matrix,  H,  similar  to  the  original  matrix,  has  been  con¬ 
structed,  the  characteristic  polynomial  of  the  upper  Hessenberg  matrix  (and  thus  of  the 
original  matrix)  is  calculated  next.  The  basic  algorithm  is  to  create  a  triangular  system  of 
equations  using  H,  which  can  then  be  easily  solved  for  the  coefficients  of  the  characteris¬ 
tic  polynomial.  In  particular,  let  H  be  an  n  x  n  standard  upper  Hessenberg  matrix  (one 
with  no  zero  subdiagonal  elements).  Now,  define  the  vectors  Wj,...,wn  by  w,+i  =  Hw, 
for  t  =  0, . . .  ,n  —  1  where  w0  =  (1,0, . . .  ,  0)T  is  the  n  x  1  unit  vector  ej.  Then  the  upper 
triangular  system 

(w0|---|wn-i)a  =  w„ 

will  have  a  unique  solution  for  a  =  (a0,  •  •  • ,  an_i  )T  and  the  characteristic  polynomial  of 
//  will  be  given  by  An  +  an_iA”~l  +  •  •  •  +  «i A  -)-  ao.  If  H  Pas  subdiagonal  elements  that 
are  zero  then  its  characteristic  polynomial  will  simply  be  the  product  of  the  characteristic 
polynomials  of  each  standard  upper  Hessenberg  block  found  along  H's  diagonal. 

Worst  case  expression  swell  analysis  of  the  above  algorithm  is  difficult  to  generalize  and  so 
will  not  be  attempted.  In  the  following  section,  results  from  the  analyses  of  specific  examples 
will  be  presented.  These  analyses  were  performed  using  the  MACSYMA  implementation  of 
worst  case  expression  swell  arithmetic. 
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5  Results  of  Case  Studies 


It  is  all  very  well  to  derive  theoretical  bounds  on  the  possible  expression  swell  in  a 
calculation,  but  when  compared  to  the  results  of  actual  computations,  how  good  of  an 
estimate  of  real  behavior  do  these  bounds  really  provide?  In  order  to  answer  this  question, 
a  number  of  case  studies  were  performed  involving  matrices  of  various  sizes  with  initial 
integer  or  rational  number  entries.  Some  of  these  results  provide  quite  striking  examples  of 
intermediate  expression  swell  in  action. 

To  begin  with,  Table  2  presents  the  four  worst  case  bounds  (relations  (3),  (1),  (4)  and  (2), 
respectively,  of  Sections  3  and  4),  derived  for  the  determinant  of  an  n  x  n  matrix  containing 
m-digit  integer  entries,  for  a  variety  of  values  of  n  with  m  =  4.  Hadamard’s  inequality  yields 
the  lowest  upper  estimate  on  the  number  of  digits  in  the  determinant,  although  the  bounds 
derived  from  the  algorithms  involving  expansion  by  cofactors  and  taking  sums  of  powers 
of  traces  fall  within  the  same  approximate  asymptotic  order.  As  noted  before,  worst  case 
bounds  derived  from  the  fraction  free  Gaussian  elimination  algorithm  grow  much  faster  than 
for  the  other  methods. 

Table  3  presents  the  results  of  actual  calculations  performed  on  n  x  n  matrices  for  values 
of  n  ranging  from  3  through  10,  where  the  initial  matrix  elements  were  4-digit  integers.  The 
table  is  divided  into  three  sections.  The  first  section  (a)  exhibits  the  greatest  number  of 
digits  encountered  in  the  elements  of  the  upper  Hessenberg  matrix  produced  by  a  similarity 
transformation  of  the  original  matrix  using  a  division  free  version  of  the  algorithm  presented 
previously.  The  column  labeled  “exprswell”  shows  the  results  produced  by  setting  the  global 
variable  exprswell  to  true  in  MACSYMA.  The  next  column  displays  the  outcome  of  using 
an  initial  matrix  composed  of  the  n 2  largest  4-digit  primes.  The  last  three  columns  in  this 
table  present  the  results  of  a  statistical  sampling  in  which  matrices  consisting  initially  of 
random  4-digit  integers  were  used.  The  first  pair  of  numbers  is  the  mean  and  standard 
deviation  obtained  from  a  series  of  calculations  whose  number  is  given  by  N^mpie,.1  The  last 
column  gives  the  minimum  and  maximum  values  for  the  largest  number  of  digits  attained. 

Section  (b)  of  Table  3  shows  the  number  of  digits  contained  in  the  largest  coefficient 
of  the  characteristic  polynomial  that  was  computed  directly  from  the  corresponding  upper 
Hessenberg  matrix  of  the  previous  section.  Due  to  the  division  free  nature  of  the  similarity 
reduction  to  upper  Hessenberg  form  that  was  used,  a  straightforward  computation  of  the 
chaiacteristic  polynomial  will  produce,  in  general,  a  non-monic  result.  However,  the  leading 
coefficient  of  this  polynomial  will  exactly  divide  all  the  other  coefficients,  producing  a  monic 
polynomial  whose  constant  term  will  be  plus  or  minus  the  determinant  of  the  original  matrix. 

1  In  some  cases,  when  a  sequence  of  calculations  was  performed  as  here,  later  calculations  were  done 
fewer  times  due  to  time  constraints  and/or  random  problems  occurring  when  running  large  MACSYMA 
jobs  continuously  for  days  at  a  time  (such  as  occasional  memory  corruption  resulting  from  the  growth  of  a 
MACSYMA  job). 
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Essentially,  this  particular  procedure  for  computing  the  monic  characteristic  polynomial  of 
am  integer  matrix  reserves  all  divisions  until  the  final  step  of  the  calculation.  The  maxi¬ 
mum  number  of  digits  found  in  a  coefficient  (nearly  always  the  constant  term)  of  the  final, 
simplified  characteristic  polynomial  is  presented  in  the  final  section  (c)  of  Table  3. 

The  trend  displayed  in  Table  3  is  quite  typical  of  calculations  involving  intermediate 
expression  swell.  Even  though  the  final  numbers  have  relatively  few  digits,  intermediate 
computations  in  this  particular  algorithm  are  already  creating  integers  whose  size  is  ap¬ 
proaching  5  digits  when  n  is  just  10.  It  is  interesting  to  note  that  the  matrices  with  prime 
entries  pretty  much  provide  the  smallest  actual  results  and  this  seems  to  be  true  as  well  for 
the  other  calculations  surveyed  here  that  have  started  with  integer  matrices. 

The  theoretical  limits  on  the  size  of  the  determinant,  tabulated  in  Table  2,  bound  the 
results  in  Table  3(c)  nicely.  The  “exprswell”  calculations  tend  to  greatly  overestimate  the 
maximum  number  of  digits  actually  produced  in  the  first  two  phases  of  the  computation 
except  for  the  lowest  values  of  n.  However,  after  the  final  exact  divisions,  these  bounds  drop 
down  to  much  more  reasonable  estimates.  One  has  to  be  careful,  though,  in  interpreting  the 
results  of  the  exprswell  computations  for  this  last  calculation.  The  coefficients  of  the  non- 
monic  characteristic  polynomials  in  Table  3(b)  were  determined  by  a  series  of  calculations 
that  maximized  the  number  of  digits  at  each  step.  In  order  to  truely  maximize  the  number 
of  digits  in  the  coefficients  of  the  final  monic  polynomials,  though,  the  leading  coefficients 
of  the  non-monic  polynomials  (which  act  as  divisors)  should  really  have  been  minimized  in 
the  course  of  their  calculation  (using  a  best  case  expression  swell  arithmetic).  This  action, 
however,  could  have  impacted  the  results  of  the  worst  case  expression  swell  arithmetic  in 
earlier  phases  of  the  calculations.  Thus,  the  occurrence  of  exact  divisions  in  an  algorithm 
such  as  this  one  can  lead  to  theoretical  uncertainties  in  verifying  whether  the  analysis  does 
indeed  produce  a  bounding  calculation,  although  in  practice,  no  exceptions  have  been  found 
so  far. 

Table  4  presents  the  maximum  number  of  digits  found  in  coefficients  of  the  characteristic 
polynomials  of  upper  Hessenberg  matrices  whose  initial  nonzero  entries  were  4-digit  integers. 
The  exprswell  calculations  and  the  results  from  the  statistical  survey  for  n  =  3  through  10 
are  remarkably  similar  to  those  exhibited  in  Table  3(c).  These  similarities  appear  to  imply 
that  the  results  of  computing  characteristic  polynomials  of  integer  Hessenberg  matrices  will 
give  a  very  good  indication  of  the  trends  to  be  expected  when  computing  monic  character¬ 
istic  polynomials  of  integer  general  matrices.  This  is  useful  since  there  are  no  theoretical 
uncertainties  here  as  there  were  above  because  this  algorithm  involves  no  divisions.  Also,  the 
computation  of  the  characteristic  polynomial  of  an  integer  Hessenberg  matrix  is  reasonably 
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fast  as  well  as  conservative  of  memory,  so  it  was  possible  to  perform  calculations  up  through 
n  =  100.2 

The  trends  of  Table  3(c)  become  more  pronounced  for  n  >  10  in  Table  4.  The  worst 
case  maximums  derived  from  Hadamard’s  inequality  continue  to  provide  good  bounds  on  the 
actual  results,  although  they  become  less  good  for  increasing  n.  For  n  >  10,  the  exprswell 
bounds,  which  are  growing  arithmetically,  begin  to  leave  the  real  recorded  maximums,  which 
are  growing  slightly  slower  than  linearly,  further  and  further  behind.  Generalizing  these  data 
tendencies  (as  well  as  those  found  for  m  =  5  and  6,  the  data  for  which  are  not  presented 
here),  a  good  empirical  bound  on  the  maximum  number  of  digits  in  the  determinant  of  an 
n  x  n  integer  matrix  (A)  which  has  entries  consisting  of  no  greater  than  m  digits  appears  to 
be  given,  simply  by 

A/”(det  A)  <  nm  . 

Table  5  shows  what  happens  when  the  Hessenberg  matrix  is  initially  filled  with  4-digit 
rational  numbers  (the  numerators  and  denominators  are  both  4-digit  integers)  and  the  matrix 
is  first  “derationalized.”  Derationalization  is  the  process  of  converting  a  matrix  of  rational 
numbers  into  a  matrix  of  integers  by  multiplying  the  matrix  by  the  least  common  multiple  of 
the  denominators  of  its  rational  number  entries.  This  number  (call  it  d )  is  an  implicit  divisor 
of  the  integer  matrix  and  if  the  matrix  is  involved  in  subsequent  calculations,  d  needs  to  be 
taken  into  account.  In  this  particular  example,  the  characteristic  polynomial  will  again,  in 
general,  be  non-monic  but  in  this  case,  dividing  the  coefficients  by  the  leading  coefficient  is 
not  guaranteed  to  be  exact  (since  this  is  really  the  characteristic  polynomial  of  a  rational 
number  matrix)  and  so  is  not  performed. 

In  Table  5,  the  “prime”  rational  numbers  (rational  numbers  whose  numerators  and  de¬ 
nominators  are  prime)  provided  the  worst  actual  results  of  expression  swell.  The  observation 
that  “prime”  rational  numbers  typically  generate  the  greatest  expression  growth  can  also  be 
noted  in  other  calculations  that  start  off  with  matrices  of  rational  numbers,  completely  con¬ 
trary  to  what  was  observed  earlier  for  calculations  that  started  with  integer  matrices.  A 
nice  discovery  is  that  the  exprswell  calculations  produced  good  bounds  on  the  worst  case 
expression  swell.  This  may  be  due  to  a  couple  of  considerations.  The  prime  computations 
will  be  particularly  large  since  the  denominators  of  the  initial  rational  number  entries  will 
all  be  big  and  relatively  prime  to  each  other  so  that  derationalization  will  create  nearly  the 
maximum  possible  integer  entries  for  a  matrix  of  a  given  size  undergoing  this  transformation. 
The  exprswell  calculations  will  act  as  if  they  are  computing  the  characteristic  polynomials 
of  integer  Hessenberg  matrices  with  large  effective  values  of  m  (essentially,  the  original  m 
times  the  number  of  nonzero  elements  in  the  matrix).  Thus,  any  excesses  in  the  exprswell 

2 A 1 1  the  calculations  in  this  paper  were  performed  on  Sun  3/160  workstations  with  4  megabytesof  memory 
running  under  a  version  of  the  UNIX  operating  system  (Sun  OS  3.4). 
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computations  (which  seem  to  depend  only  on  n)  will  be  overwhelmed  (at  least,  at  these 
values  of  n)  by  the  large  effective  values  of  m. 

In  Table  6  is  presented  the  results  of  another  characteristic  polynomial  computation  via 
an  intermediate  Hessenberg  transformation.  This  time,  the  entries  of  the  initial  general 
matrices  were  4-digit  rational  numbers  and  all  calculations  were  done  in  rational  arithmetic. 
Note  that  the  numbers  in  this  table  represent  the  number  of  digits  in  the  maximal  rational 
number  (i.e.  the  largest  value  obtained  by  summing  the  number  of  digits  in  the  numerator 
and  denominator  of  each  rational  number  under  consideration). 

Again,  the  calculations  that  started  off  with  ‘‘prime”  rational  number  matrices  produced 
the  greatest  growth  in  expression  size.  Also,  like  the  computations  whose  results  were  shown 
in  Table  2,  expression  swell  decreased  dramatically  between  the  intermediate  results  found  in 
the  entries  of  the  Hessenberg  matrices  and  the  final  numbers  that  comprised  the  coefficients 
of  the  characteristic  polynomials  These  latter  numbers,  upon  examination  of  additional 
data  for  m  =  5  and  6  (not  •  n  here),  appear  to  be  quite  nicely  bounded  by  2 n2m,  which 
is  very  similar  to  the  hi'  ■>»  derived  from  Hadamard’s  inequality.  The  bounds  computed 
from  the  MACSYMA  implementation  of  rational  worst  case  expression  swell  arithmetic  are 
huge  and  provide  /irtually  no  useful  information. 

As  an  experiment  to  see  what  effect  GCDs  have  on  rational  number  computations,  the 
above  calculations  on  “prime”  rational  number  matrices  were  repeated  with  the  GCD  func¬ 
tion  forced  always  to  return  one,  thus  allowing  no  cancellation  of  common  factors.  These 
results  are  compared  with  those  from  Table  6,  in  which  GCDs  were  taken  freely,  in  Table  7. 
It  is  quite  clear  from  this  comparison  of  maximal  numbers  of  digits  that  GCDs,  which  are 
taken  every  time  a  new  rational  number  is  formed  in  MACSYMA,  have  important  effects  in 
minimizing  expression  swell  which,  in  the  samples  reviewed  in  Table  7,  is  growing  exponen¬ 
tially  for  the  “no  GCDs”  cases.  The  reason  that  the  results  for  “no  GCDs”  are  complete 
only  through  n  =  6  is  because  the  MACSYMA  computations  for  bigger  cases  ran  out  of 
available  memory. 
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n 

Hadamard’s 

inequality 

Cofactor 

expansion 

Sums  of  powers 
of  traces 

Gaussian 

elimination 

3 

14 

13 

16 

16 

4 

18 

18 

21 

25 

5 

23 

23 

26 

36 

6 

27 

27 

32 

49 

7 

32 

32 

37 

64 

8 

36 

37 

42 

81 

9 

41 

42 

47 

100 

10 

45 

47 

52 

121 

20 

100 

99 

123 

441 

30 

150 

153 

184 

961 

40 

200 

208 

245 

1681 

50 

250 

265 

306 

2601 

60 

300 

322 

367 

3721 

70 

|  350 

381 

427 

5041 

80 

400 

439 

488 

6561 

90 

450 

499 

548 

8281 

100 

500 

558 

609 

10201 

Table  2.  Determinant  of  an  integer  general  matrix  (m  =  4). 
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n 

exprswell 

primes 

random  numbers 

•^Munpt  «• 

3 

14 

13 

11.42  ±  0.75 

100 

9  — 

12 

4 

43 

27 

30.73  ±  2.10 

100 

24  — 

35 

5 

130 

59 

74.98  ±  2.66 

100 

68- 

81 

6 

391 

98 

142.38  ±  4.26 

100 

130  — 

152 

1174 

155 

249.75  ±  6.75 

100 

233  — 

262 

3523 

240 

366.50  ±  9.53 

100 

343- 

388 

10570 

334 

585.30  ±  11.79 

100 

556  — 

612 

31711 

469 

825.92  ±  14.22 

13 

797  — 

844 

Table  3(a).  Division  free  transformation  of  an  integer  general  matrix  into  Hessenberg  form 
(m  =  4). 


n  ||  exprswell  ||  primes  random  numbers  p\„ 


30.09  ±  2.28 
115.60  ±  7.90 

364.62  ±  13.36 

840.63  ±  25.23 
1731.99  ±  47.74 
2911.08  ±  77.09 
5243.04  ±  106.39 
8243.42  ±  147.58 


1616  — ►  1824 
2720  —  3081 
4977  —  5483 
7942  —  8422 


Table  3(b).  Characteristic  polynomial  of  the  Hessenberg  matrix. 


n 

exprswell 

primes 

random  numbers 

^samples 

max 

17 

7 

11.40  ±  0.84 

10 

10  — 

12 

25 

7 

15.20  ±  0.63 

10 

14  — 

16 

34 

8 

18.70  ±  0.67 

10 

18  — 

20 

6 

44 

8 

22.70  ±  0.95 

10 

21  - 

24 

H 

55 

11 

26.50  ±  0.97 

10 

24  - 

27 

1 

67 

11 

30.60  ±  0.52 

10 

30  — 

31 

9 

80 

13 

34.20  ±  0.63 

10 

33  — 

35 

10 

94 

14 

37.90  ±  0.57 

10 

37  — 

39 

Table  3(c).  Above  divided  by  its  leading  coefficient. 


910 


random  numbers 

^*ample» 

min  — > 

max 

11.46  ±  0.54 

100 

10  — 

12 

15.27  ±  0.60 

100 

13  — 

16 

19.13  ±  0.68 

100 

17- 

20 

23.00  ±  0.70 

100 

21  — 

24 

26.51  ±  0.75 

100 

24- 

28 

30.35  ±  0.73 

100 

28- 

32 

34.12  ±  0.76 

100 

33- 

36 

37.91  ±  0.79 

100 

36- 

39 

75.40  ±  0.89 

5 

74  — 

76 

113.40  ±  1.14 

5 

112  — 

115 

150.60  ±  1.34 

5 

150  — 

153 

187.60  ±  1.14 

5 

186  — 

189 

227.60  ±  1.52 

5 

226  — 

230 

264.40  ±  1.14 

5 

263  — 

266 

301.80  ±  1.30 

5 

301  — 

304 

341.60  ±  1.95 

5 

339  — 

344 

378.00  ±  2.83 

5 

374  — 

380 

exprswell  primes 


17  9 

25  14 

34  15 

44  21 

55  23 

67  29 

80  33 

94  38 

289 
584 
979 
1474 
2069 
2764 
3559 
4454 
5449 


Table  4.  Characteristic  polynomial  of  an  integer  Hessenberg  matrix  (m  =  4] 


n 

exprswell 

primes 

random  numbers 

-^sample* 

min  — 

max 

3 

107 

93 

78.07  ±  5.23 

100 

65  — 

89 

4 

225 

205 

162.47  ±  11.43 

100 

123  — 

187 

5 

404 

376 

283.81  ±  19.35 

100 

236  — 

331 

6 

656 

620 

451.50  ±  30.81 

100 

346  — 

507 

7 

993 

946 

657.26  ±  45.02 

100 

536  — + 

746 

8 

1427 

1368 

934.24  ±  56.08 

100 

804  — 

1066 

9 

1970 

1895 

1245.57  ±  80.66 

100 

1080  — 

1426 

10 

2634 

2540 

1612.45  ±  112.96 

100 

1245  — 

1932 

Table  5.  Characteristic  polynomial  of  a  “denationalized”  Hessenberg  matrix,  initially  filled 
with  4-digit  rational  numbers. 
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exprswell 

primes 

random  numbers 

•^sample* 

82 

61 

50.15  ±  4.84 

100 

36  —  60 

911 

216 

176.29  ±11.92 

100 

137  —  202 

13534 

622 

504.29  ±  27.15 

100 

438  —  564 

255497 

1369 

1096.85  ±  46.77 

93 

983  —  1223 

5852292 

2588 

2013.28  ±  98.13 

100 

1722—*  2206 

157563119 

4313 

3275.51  ±  130.41 

57 

3027-3584 

4874278234 

6671 

4971.33  ±  170.71 

12 

4730—5310 

170327525637 

9717 

7194.50  ±  198.44 

6 

6932—  7488 

Table  6(a).  Transformation  of  a  general  matrix,  initially  filled  with  4-digit  rational  numbers, 
into  Hessenberg  form. 


n 

exprswell 

primes 

random  numbers 

^sample* 

3 

711 

69 

60.34  ±  4.05 

100 

46—  69 

4 

15812 

125 

105.24  ±  5.19 

100 

91—  117 

5 

493670 

198 

163.02  ±  6.45 

100 

145—  178 

6 

19254949 

286 

231.30  ±  8.34 

93 

208—  250 

7 

898326065 

390 

309.65  ±  11.57 

100 

281  —  334 

8 

48860659874 

508 

399.82  ±  14.61 

57 

372-  440 

9 

3039845981556 

642 

498.25  ±  11.41 

12 

4  84  —  518 

10 

213099621017855 

791 

611.50  ±  11.43 

6 

596—  631 

Table  6(b)  Characteristic  polynomial  of  the  Hessenberg  matrix. 


Hessenberg  matrix 

GCDs 

no  GCDs 

61 

61 

216 

379 

622 

2360 

1369 

14203 

2588 

82576 

Characteristic  polynomial 


GCDs  no  GCDs 


122 

565 

3129 

18365 


Table  7.  Transformation  of  a  general  matrix,  initially  filled  with  4-digit  “prime'’  rational 

numbers,  into  Hessenberg  form  and  the  characteristic  polynomial  of  the  Hessenberg 
matrix. 
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6  Concluding  Remarks 

Expression  swell  is  a  significant  and  in  the  long  run  an  inevitable  problem  of  symbolic 
computation.  Since  expression  swell  is  ultimately  unavoidable,  one  goal  should  be  to  make 
it  at  least  manageable.  One  way  to  accomplish  this  is  to  develop  a  collection  of  procedures 
which  will  allow  a  user  to  be  able  to  predict  the  progress  of  expression  growth  for  a  given 
calculation.  In  this  paper,  an  attempt  has  been  made  to  develop  some  tools  to  pursue  the 
above  goal  for  a  certain  class  of  computations.  The  application  of  these  tools  has  had  mixed 
success. 

One  tool  for  charting  expression  growth  is  to  perform  a  variety  of  calculations  with  varying 
initial  conditions  and  measure  the  sizes  of  the  final  and  various  intermediate  results.  This 
procedure,  although  tedious,  has  produced  some  interesting  conclusions  about  the  maximum 
size  of  coefficients  in  the  characteristic  polynomials  of  certain  integer  and  rational  number 
matrices  where  the  nonzero  entries  were  initially  uniform  in  size. 

A  second  tool  is  worst  case  (and  best  case)  expression  swell  arithmetic.  This  can  be 
used  both  for  deriving  general  bounds  on  expression  swell  over  a  class  of  calculations  and 
also  for  determining  such  bounds  for  specific  computations.  Some  of  the  bounds  looked 
at  above  have  produced  good  estimates  on  real  maximal  expression  size  while  others  have 
given  outrageous  overestimates.  At  this  preliminary  stage  in  the  development  of  a  useful 
expression  swell  arithmetic,  some  observations  can  be  made  based  on  the  experience  gained 
here. 

For  integer  worst  case  expression  swell  arithmetic,  the  best  results  (the  tightest  bounds) 
have  come  about  when  analyses  have  been  performed  on  algorithms  or  inequalities  that  have 
minimized  the  mix  of  operations  needed  to  obtain  an  estimate.  For  example,  the  analysis 
of  the  Gaussian  elimination  algorithm  of  Section  3  and  many  of  the  computed  bounds  in 
the  previous  section  required  a  complex,  multi-stage  calculation  and  in  many  cases,  this 
resulted  in  excessive  estimates,  even  for  small  values  of  the  matrix  dimension.  On  the  other 
hand,  simple  formulas  like  Hadamard’s  inequality,  where  the  various  arithmetical  operations 
are  reasonably  consolidated  gave  good  bounds  on  the  expression  swell.  A  major  reason  for 
this  behavior  is  that  expression  swell  arithmetic  ignores  past  history.  Thus,  if  a  calculation 
produces  a  worst  case  outcome  that  just  barely  exceeds  n  digits  (e.g.  1 1  (•)  n  —  2),  the  result 
will  normally  be  treated  as  if  it  was  the  largest  possible  n-digit  integer  in  the  very  next 
calculation,  effectively  neglecting  the  number’s  origin.  Thus,  the  more  steps  there  are  in  a 
computation,  the  more  often  these  jumps  in  (interpreted)  value  can  occur.  Of  course,  this 
phenomenon  also  allows  the  analysis  to  be  greatly  simplified. 

Defining  N(n)  in  terms  of  an  arbitrary  integer  base  6  instead  of  10  (the  latter  was  chosen 
only  for  convenience)  could  help  to  soften  the  effect  of  jumps  in  value  if  a  base  smaller  than 
10  was  selected.  This  is  because  integers  having  the  same  number  of  base  b  digits  will  form 
more  and  smaller  sets  as  6  decreases,  thus  producing  finer  divisions  of  the  integers.  Therefore, 
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expression  swell  bounds  can  be  made  more  precise  and  jumps  in  value  can  be  lessened  (for 
instance,  llllio  would  jump  to  9999io  using  a  decimal  definition  of  jV’(n),  but  it  would  only 
jump  to  2047jo  with  a  binary  definition,  where  M  would  now  count  the  number  of  bits  in  n). 

Another  factor  in  why  the  expression  swell  arithmetic  developed  here  sometimes  tends 
to  greatly  overestimate  expression  growth  is  that  no  provision  has  been  (or  is  easily  able 
to  be)  made  to  account  for  the  effects  of  subtractive  cancellation  (which  can  be  significant 
for  procedures  such  as  the  solution  of  a  triangular  system  of  equations)  and  cancellation 
of  common  factors  in  non-exact  quotients.  A  theorem  from  number  theory  due  to  Ernesto 
Cesa.ro  states  that  the  probability  that  two  integers  chosen  at  random  are  relatively  prime  is 
6 / 7r2  or  about  61%  [Knu69j.  Performing  rational  arithmetic  operations  again  and  again  on 
elements  of  a  given  matrix  involves  numbers  that  are,  after  a  while,  far  from  random  and  so 
GCDs  play  a  significant  role,  as  was  seen  in  Table  7. 

The  other  major  reason  why  the  rational  worst  case  expression  swell  arithmetic  did  so 
poorly  is  that  rational  operations  involve  extensive  integer  arithmetic  so  that  the  jump  in 
value  effect  is  greatly  magnified.  Nonetheless,  this  arithmetic  did  provide  bounds  on  the  size 
of  other  smaller  expressions  involved  in  the  calculations  which  were  more  reasonable,  as  well 
as  indicating  the  relative  ranking  by  size  of  the  various  expressions  present  at  a  given  stage 
(e.g.  matrix  elements  or  polynomial  coefficients).  These  last  remarks  also  apply  to  integer 
expression  swell  arithmetic. 

Worst  case  expression  swell  arithmetic  is  an  attempt  to  effectively  systematize  asymptotic 
analysis  for  a  certain  class  of  problems  while  providing  a  greater  flexibility  in  its  usage  (e.g. 
the  elements  in  a  matrix  need  not  be  considered  initially  uniform  in  size  in  order  to  obtain 
a  bound).  In  this  paper,  only  numerical  calculations  were  considered,  but  these  or  like 
techniques  could  also  be  applied  to  the  coefficients  and  exponents  of  polynomial  or  other 
computations  (as  has  occasionally,  asymptotic  analysis).  One  has  to  be  careful  with  worst 
(or  best)  case  analyses  for  complicated  algorithms  as  what  is  worst  case  at  one  stage  may 
be  best  case  (or  more  usually,  a  mix)  at  another.  Finally,  these  analyses  are  also  useful  for 
not  only  bounding  expression  swell  (and  thus  computer  memory  usage)  but  also  for  setting 
limits  on  CPU  time  consumption.  For  example,  the  time  required  to  multiply  n\  by  n2  using 
the  simplest  algorithm  is  0(M(n.i)  •  A*(n2))  [AkrSS]. 
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A  VARIATIONAL  METHOD  FOR 
FINDING  HOMOCLINIC  ORBITS 
IN  THE  LARGE. 

I.  EKELAND 

CEREMADE,  Universite  de  Paris-Dauphine. 


§1.  A  simple  equation. 

Let  us  start  form  the  following  one- dimensional  equation  : 

(1)  q-  q  +  q3  =  0  ,  q(t)  €  R 

Solving  it  is  a  simple  task.  We  know  that  the  total  energy  is  preserved: 

\  ?(*)2  -  J  ?(02  +  \  9(<)4  =  constant 

so  that  the  trajectories  of  equation  (1),  in  (<j,  g)-space,  are  just  the  level 
curves  of  the  function  \  q2  —  \  q2  +  \  q*  -  Figure  1  shows  the  existence 
of  two  (by  symmetry)  continuous  families  of  closed  level  curves,  corre¬ 
sponding  to  periodic  solutions  of  equation  (1).  The  boundary  curves 
correspond  to  solutions  q(t)  with  the  property  that  : 

q(t)  — »  0  ,  q(t)  — +  0  as  t  — *•  ±oo 

Such  solutions  are  nowadays  called  homoclinic ,  although  I  would  pre¬ 
fer  to  call  them  doubly  asymptotic  to  the  origin,  according  to  Poincare’s 
original  terminology. 
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For  future  generalization,  it  will  be  convenient  to  rephrase  equation 
(1)  in  the  Hamiltonian  formalism.  Introducing  the  function 

*?(?,«)  =  + 
we  rewrite  equation  (1)  as  a  system  : 

\p  =  q-q3  =  ~^L 

§2.  First  extension  :  more  dimensions. 

We  would  like  to  find  homoclinic  orbits  in  higher-dimensional  situ¬ 
ations.  The  preceding  argument  breaks  down  because  the  level  sets 
H(p,q)  =  h  no  longer  are  trajectories.  In  fact,  no  general  result  was 
known  until  the  recent  paper  by  V.  Coti  Zelati  ;  I.  Ekeland  and  E.  Sere 
(ref.  [1])  which  I  am  now  proceeding  to  describe. 

Consider  a  smooth  Hamiltonian  H  :  R2n  — ►  R  of  the  following  form  : 

H{x)  —  (At,  x)  +  i?(x) 

under  the  assumptions  that 

(Hi)  .4*  =  .4  and  JA  is  hyperbolic 
(no  eigenvalue  on  the  unit  circle) 

(H2)  R  is  strictly  convex 

(H3)  R  is  superquadratic,  that  is,  for  some  a  >  2 

and  for  suitable  constants  I\  >  k  >  0  ,  we  have  : 

R(x)  ^  —  ( R'(x ),  x ) 
a 

^|x|Q  <  R(x)  <  K\x\a  . 

Define  J  €  £(R2n)  by 
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so  that  J*  =  —  J  =  J .  We  are  interested  in  the  Hamiltonian  system 

(2)  x  =  JH'(x) 

It  has  the  constant  (equilibrium)  solution  x  =  0.  Periodic  solutions 
can  be  found  by  the  duality  methods  described  in  [2j.  We  want  to  find 
homoclinic  solutions,  i.e. 

(3)  x(t)  — ♦  0  when  t  — >  ±oo 

One  approach  is  through  the  classical  action  principle.  Associated 
with  equation  (2)  is  the  action  integral  : 

f  +  OO  , 

«(*)  =  J  [i(7*,») +  #(!)]<« 

f  +  OO  J 

=  /  [-  ( Jx  +  ax,  x)  +  R(x)\  dt 

7-00  ^ 

and  the  solutions  of  the  boundary-value  problem  (2)-(3)  are  the 
extremals  of  the  integral  <$(x)  over  a  suitable  space  of  curves.  How¬ 
ever,  as  we  already  noted  in  [2],  the  functional  $  does  not  readily  lend 
itself  to  analysis.  Taking  advantage  of  the  convexity  of  R,  we  will  replace 
$  by  a  more  tractable  functional. 

Note  first  that  the  equation 

Jx  +  Ax  =  u 

has  a  unique  solution  x  such  that  x (t)  — ♦  0  when  t  ±oo,  for  any 
u  6  L0 .  This  fact  crucially  uses  the  assumption  that  J  A  is  hyperbolic, 
and  enables  us  to  define  a  continuous  linear  map 

C  :  2/(R)  -*•  La(R)nWl'0{R) 

by  x  =  Cu.  Henre  (3  =  is  the  conjugate  exponent  (1  <  j3  <  2)  and 

W =  {x  G  L0  |  x  6  L0}  . 

It  should  be  noted  that  C  is  not  a  compact  operator  (as  would  happen 
on  any  finite  interval).  This  is  the  reason  why  the  Palais-Smale  condi¬ 
tion  fails  in  this  problem,  as  we  will  see  later  on. 

Introduce  the  Fenchel  conjugate  R *  of  the  convex  function  R  : 

R*{y )  =  max  { xy  —  R(x)  \  x  €  Rn} 
and  define  a  functional  xl>  on  L &  by  : 


/+oo 

[(£u,u)  +  R*{u)] 

-OO 


0^0 


PROPOSITION  1.  xp  is  well-defined  and  C1.  If  u  is  a  critical  point  of  ip. 
i.e.  if  ip'(u)  =  0,  then  x  =  C(u)  is  a  solution  of  (2)-(3). 


Formal  PROOF:  Write  0  =  t p'(u)  =  Cu  +  Vi?*(u).  Hence  : 

VfT(u)  =  -£u  . 

By  the  Legendre  reciprocity  formula,  this  can  be  rewritten  as  : 

u  =  VR(-£u) 


or,  introducing  x  =  —  Cu  : 

—  Jx  —  Ax  —  VJ?(x)  □ 


The  question  now  is  to  find  critical  points  of  xp.  Simple  estimates  show 
that  xp  has  a  local  minimum  at  the  origin,  with  xp( 0)  =  0,  while  points 
u  6  L13  can  be  found,  with  ||u||  arbitrarily  large,  such  that  xp{u)  <  0  By 
the  Ambrosetti-Rabinowitz  theorem  (see  [2]),  we  conclude  that  there  is 
a  sequence  u„  6  L &  with 

(4)  c  >  0 

(5)  V>'(u„)  0  . 

At  this  stage,  one  would  like  to  conclude  that  un  has  a  convergent 
subsequence,  u„_fc  — *  u  with  rp'(u)  =  0.  This  is  the  so-called  Palais- 
Smale  condition.  Unfortunately  it  does  not  hold  in  this  problem.  The 
analytical  reason  is  that  the  operator  C  is  not  compact,  the  underlying 
geometrical  reason  is  the  fact  that  the  problem  is  translation-invariant, 
and  the  symmetry  group  R  is  not  compact. 

To  see  what  could  happen,  suppose  there  actually  is  some  critical 
point  u  ^  0,  corresponding  to  a  homoclinic  orbit  T  =  —  Cu.  Set  : 

( 9  *  u)(t )  =  u(t  +  9)  ,  for  9  €  R  • 

Define  a  sequence  u„  in  L&  by  : 

un  =  u  +  n*u  ,  n  — ►  +00  . 
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In  other  words,  un  is  the  sum  of  two  bumps  which  split  apart.  It  is 
not  difficult  to  see  that  u(t)  — >  0  exponentially  fast  when  t  — >  ±oo,  so 
that,  as  n  increases  there  is  less  and  less  interaction  between  u  and  n*u, 
each  of  which  solves  xl>'  =  0.  In  the  limit,  we  get  : 

0'(un)  — +  0  when  n  — *  +oo  . 


The  concentration-compactness  lemma  of  Pierre- Louis  Lions  (see  [3], 
[4])  tells  us  that  this  is  exactly  what  does  happen.  In  fact,  fr^m  (4)-(5) 
we  conclude  that  there  is  some  subsequence  unk,  finitely  many  critical 
points  u1 , ,uN  of  ip,  and  corresponding  sequences  p[, . . .  ,p%  in  Z  such 
that  : 


u 


0 


0 


I Pk  -p{\  -*  oo  i 


N 

c  =  ^0(U*) 
i  =  l 

Any  of  the  u1  solves  =  0,  so  that  the  corresponding  x'  =  — £u' 
solves  (2)-(3).  We  have  proved  : 

THEOREM  1.  Under  assumptions  (HI)  to  (H3),  the  system  has  at  least 
one  homoclinic  orbit. 


§3.  Second  extension  :  non-autonomous  case. 

Let  us  now  consider  the  equation  : 

(6)  x  =  JH'(x)  =  JAx  +  JR\t,x ) 


under  assumptions  (Hi)  to  (H3).  The  latter  has  to  hold  uniformly  with 
respect  to  t  : 


(  R(t,x)  <  (Rf(t,x),x) 

\  k\x\a  <  R(t,x)  <  /v|x|a  . 
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Assume  in  addition  the  R  is  time-periodic  : 


(H4)  3T:  R(t  +  T,x)  =  R(t,x)  V(t,x). 


We  may  then  introduce  the  time-map  in  phase  space.  This  is  the  map 
/  :  R2n  — +  R2n  which  associates  x(T)  with  x(0) 


x(0)  =  xo  1 
x(T)  =  xiJ 


x\  =  /(x0)  . 


Because  of  (H3),  the  origin  is  a  fixed  point  for  /  : 


/(0)  =  0  . 

Assumption  (Hi)  tells  us  that  0  is  a  hyperbolic  fixed  point.  By  the 
standard  theory  of  dynamical  systems,  there  axe  two  n  -dimensional 
manifolds  branching  off  from  0,  the  stable  one  Ea  and  the  unstable  one 

Eu  : 

Ea  —  {x  €  R2rt  |  /*(*)  — ►  0  when  k  — ►  oo} 

Eu  =  {x£  R2n  I  /*(*)  -  o  when  k  ->  -oo}  . 

Any  homoclinic  orbit  x(t)  gives  us  a  sequence  of  homoclinic  points 
x(nT),  n  €  I  ■ 


VneZ,  x(nT )  £  Ea  D  Eu  . 


The  manifolds  E*  and  Eu  therefore  have  infinitely  many  intersection 
points.  The  first  to  notice  this  fact  was  Poincare  (see  [6]),  and  the 
intricate  picture  which  arises  from  his  analysis  its  now  classical  : 


O'Yv*.  hjerwc  ch/*\.\/C  Tc 
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It  turns  out  that,  if  the  intersection  of  E3  and  Eu  is  transversal,  there 
are  many  more  points  in  this  intersection  than  the  x(nT).  In  other 
words,  if  there  is  one  homoclinic  orbit,  there  should  be  many  more. 

This  argument  relies  on  transversality  of  the  intersection  E3  Pi  Eu, 
a  fact  that  cannot  be  checked  except  in  very  special  situations  (e.g. 
Melnikov  theory,  see  [5])  and  can  at  best  be  hoped  to  be  generic.  We  have 
therefore  wondered  whether  we  can  do  better  by  variational 
methods.  It  turns  out  to  be  the  case  : 

THEOREM  2.  (Coti-Zelati,  Ekeland,  Sere  [1]) 

Under  assumptions  (Hi)  to  (H4),  there  are  at  least  two  homoclinic 
orbits  xi  and  xi  which  are  geometrically  distinct  : 

Vn  G  Z  ,  (nT)  *X\^i2  □ 

Theorem  3.  (Sere  [7]) 

Under  assumptions  (Hi )  to  (H4)  there  are  infinitely  many  homoclinic 
orbits  xn,  n  G  N,  pairwise  geometrically  distinct  : 

i  J  ==>  V  n  G  Z  ,  ( nT )  *x,ylxJ  □ 

Clearly  theorem  3  contains  theorem  2.  The  proofs,  however,  are  dis¬ 
tinct,  betraying  the  fact  (which  already  arises  from  a  careful  investiga¬ 
tion  of  Poincare's  argument)  that  the  second  solution  is  somehow  more 
fundamental  them  the  remaining  ones.  The  second  solution  is  found  by 
a  min-max  argument  around  level  2c.  The  other  solutions  are  found  by 
perturbation  arguments  around  much  higher  levels. 

The  main  theoretical  advance  in  both  situations  is  the  introduction 
of  a  new  condition,  termed  (PSS)  (Palais-Smale-Sere)  which  goes  as 
follows. 


DEFINITION  4.  A  function  rp  is  said  to  satisfy  (PSS)  if  any  sequence  un 
such  that  : 

i /»(«„)-♦  c  ,  rp'(un)  -»  0  ,  ||un+1  -  u„||  0 

has  a  convergent  subsequence.  □ 


923 


It  is  remarquable  that  condition  (PSS)  (instead  of  (PS))  is  enough  for 
the  deformation  lemma  to  hold. 

PROPOSITION  5.  Define  ip  :  L&  — *  R  as  above,  and  assume  there  are 
finitely  many  critical  points  of  ip  ( up  to  a  time  translation  by  some 
multiple  ofT).  Then  (PSS)  holds  for  D 


The  idea  behind  the  proof  is  that  a  splitting  such  as  the  one  we  de¬ 
scribed,  with  two  bumps  separating  and  going  away  from  each  other 
while  rp'  goes  to  zero,  cannot  occur  continuously.  In  fact,  if  ip'( u)  =  0, 
and  if  we  set  : 

t is  =  1 1  +  0  *  u 

with  0  —*  oo,  we  will  have  xp'(unT )  — ►  0  a s  rc  — >  oo,  since  the  equation  is 
T-periodic,  but  ip'(us)  will  remain  bounded  away  from  0  as  long  as  8  is 
bounded  away  from  multiples  of  T. 

We  now  describe  the  two  min-max  procedures  that  enable  us  to  find 
the  two  first  homoclinic  orbits. 

The  first  one,  already  alluded  to  in  the  preceding  section,  consists  in 
choosing  some  point  v  £  with  |(ujj  large  such  that  xp(y)  <  0,  and 
considering  all  continuous  paths  connecting  0  and  v. 

r  =  {7  £  c°([o,  1]; l0)  j  7(0)  =  0 ,  7(i)  =  v} 

and  in  defining  : 

c  =  inf  max  [ip  o  7 (t)  j  0  <  t  <  1 }  . 

Then  c  >  0  and  (PSS)  implies  that  it  is  a  critical  value  (no  splitting 
occurs) 

3  u  :  tp'(u)  =  0  and  ip{u)  =  c  . 

The  second  one  consists  in  introducing  a  set  of  continuous  maps  from 
the  square  K  —  [0,  l]2  into  L ^  : 

7(s,  0  =  0  V  s 

<7  €  C°(K;  L^)  7(0,1)  =  *  =  7(1,1) 

7(s  +  l,t)  =  T  *7(5,0 
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and  in  defining  : 


d  —  inf  max  {ip  o  cr(s,t)  \  (s,t)  £  K) 


It  can  be  shown  that  either  d  =  c  (in  which  case  there  are  infinitely 
many  critical  points  on  that  level)  or  d  >  c  (in  which  case,  by  (PSS), 
there  is  a.  critical  point  v  with  ip(v)  —  d  ^  c  =  i />(u),  su  u  ^  v).  In  both 
cases  there  is  a  second  solution. 
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